simcadi: Similarity Indices for Categorical Distributions

11 Pages Posted: 17 Nov 2016 Last revised: 23 Nov 2016

See all articles by Stephan Huber

Stephan Huber

University of Regensburg; Hochschule Fresenius - University of Applied Sciences

Date Written: November 16, 2016

Abstract

In this article we introduce the Stata command simcadi. It helps to calculate indicators for the similarity of categorically ordered variables and distribution, respectively. In particular, it permits the calculation of the Cosine index, and indices introduced by Finger and Kreinin (1979), Bray and Curtis (1957), Dice (1945), Sørenson (1948), Jaccard (1912), Grubel and Lloyd (1971), Ruzicka (1958), and Gower (1971). Moreover, it allows us to compute the development of a distribution over time. The command offers various options for an efficient handling of datasets, because it permits the calculation of benchmarks of comparison automatically, and the incoporation of complex weighting schemes.

Keywords: Categorical Distributions, Indicators, Similarity, Distance Measurements

JEL Classification: C1, C8

Suggested Citation

Huber, Stephan and Huber, Stephan, simcadi: Similarity Indices for Categorical Distributions (November 16, 2016). Available at SSRN: https://ssrn.com/abstract=2870834 or http://dx.doi.org/10.2139/ssrn.2870834

Stephan Huber (Contact Author)

Hochschule Fresenius - University of Applied Sciences ( email )

Limburger Str. 2
Idstein, 65510
Germany

University of Regensburg ( email )

93040 Regensburg
D-93040 Regensburg, 93053
Germany

Do you have negative results from your research you’d like to share?

Paper statistics

Downloads
178
Abstract Views
1,037
Rank
308,143
PlumX Metrics