Two Simple Measures of Variability for Categorical Data

Allaj, E. (2018). Two simple measures of variability for categorical data. Journal of applied statistics, 45(8), 1497-1516.

20 Pages Posted: 3 Jan 2017 Last revised: 18 Apr 2024

See all articles by Erindi Allaj

Erindi Allaj

Parma University, Department of Economics and Management

Date Written: December 31, 2016

Abstract

This paper proposes two new variability measures for categorical data. The first variability measure is obtained as one minus the square root of the sum of the squares of the relative frequencies of the different categories. The second measure is obtained by standardizing the first measure. The measures proposed are functions of the variability measure proposed by Gini (1912) and approximate the coefficient of nominal variation (CNV) introduced by Kvalseth (1995) when the number of categories increases. Different mathematical properties of the proposed variability measures are studied and analyzed. Several examples illustrate how the variability measures can be interpreted and used in practice.

Keywords: Categorical Data; Variability Measure; Relative Frequencies; Euclidean Distance; Categorical Variable

Suggested Citation

Allaj, Erindi, Two Simple Measures of Variability for Categorical Data (December 31, 2016). Allaj, E. (2018). Two simple measures of variability for categorical data. Journal of applied statistics, 45(8), 1497-1516., Available at SSRN: https://ssrn.com/abstract=2892097 or http://dx.doi.org/10.2139/ssrn.2892097

Erindi Allaj (Contact Author)

Parma University, Department of Economics and Management ( email )

Via J.F. Kennedy, 6
Parma, 43125
Italy

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
519
Abstract Views
2,636
Rank
134,337
PlumX Metrics