Statistical Problems and Solutions in Onomastic Research - Exemplified by a Comparison of Given Name Distributions in Germany Throughout the 20th Century
36 Pages Posted: 9 Dec 2010
Date Written: November 1, 2010
Abstract
The German Socio Economic Panel Study (SOEP) offers the rare opportunity to look at patterns of given names amongst a representative sample of more than 50,000 people born since 1900. This article develops an exemplary picture of typical frequency distributions for given names and their developments over time. In this paper, we first discuss the advantages and limitations of various data bases which have been widely used to study the distribution of given names. Second, we address the problem that name distributions are typically characterized by a "Large Number of Rare Events" (LNRE) zone. With regard to this, we focus our attention on the difficulties associated with comparing name distributions. Third, we apply some measures of the concentration of distributions from other lines of research (economics and computational linguistics). Finally, we stress the problem of the statistical significance of differences in name distributions based on samples.
Keywords: Given names, large number of rare events (LNRE), concentration of distributions, SOEP
JEL Classification: C49, C83, Y8
Suggested Citation: Suggested Citation
Do you have a job opening that you would like to promote on SSRN?
Recommended Papers
-
A New Account of Personalization and Effective Communication
-
Some Economics of Personal Activity and Implications for the Digital Economy