Two Simple Measures of Variability for Categorical Data
Allaj, E. (2018). Two simple measures of variability for categorical data. Journal of applied statistics, 45(8), 1497-1516.
20 Pages Posted: 3 Jan 2017 Last revised: 18 Apr 2024
Date Written: December 31, 2016
Abstract
This paper proposes two new variability measures for categorical data. The first variability measure is obtained as one minus the square root of the sum of the squares of the relative frequencies of the different categories. The second measure is obtained by standardizing the first measure. The measures proposed are functions of the variability measure proposed by Gini (1912) and approximate the coefficient of nominal variation (CNV) introduced by Kvalseth (1995) when the number of categories increases. Different mathematical properties of the proposed variability measures are studied and analyzed. Several examples illustrate how the variability measures can be interpreted and used in practice.
Keywords: Categorical Data; Variability Measure; Relative Frequencies; Euclidean Distance; Categorical Variable
Suggested Citation: Suggested Citation