Inferring Gender From Name: A Large Scale Performance Evaluation Study

19 Pages Posted: 12 Oct 2023

See all articles by Kriste Krstovski

Kriste Krstovski

Columbia University

Yao Lu

Columbia University in the City of New York

Ye Xu

Columbia University in the City of New York

Date Written: September 16, 2023

Abstract

A person's gender is a crucial piece of information when performing research across a wide range of scientific disciplines, such as medicine, sociology, political science, and economics, to name a few. However, in increasing instances, especially given the proliferation of big data, gender information is not readily available. In such cases researchers need to infer gender from readily available information, primarily from persons' names. While inferring gender from name may raise some ethical questions, the lack of viable alternatives means that researchers have to resort to such approaches when the goal justifies the means - in the majority of such studies the goal is to examine patterns and determinants of gender disparities. The necessity of name-to-gender inference has generated an ever-growing domain of algorithmic approaches and software products. These approaches have been used throughout the world in academia, industry, governmental and non-governmental organizations. Nevertheless, the existing approaches have yet to be systematically evaluated and compared, making it challenging to determine the optimal approach for future research. In this work, we conducted a large scale performance evaluation of existing approaches for name-to-gender inference. Analysis are performed using a variety of large annotated datasets of names. We further propose two new hybrid approaches that achieve better performance than any single existing approach.

Keywords: gender inference from name, machine learning, natural language processing

Suggested Citation

Krstovski, Kriste and Lu, Yao and Xu, Ye, Inferring Gender From Name: A Large Scale Performance Evaluation Study (September 16, 2023). Available at SSRN: https://ssrn.com/abstract=4572589 or http://dx.doi.org/10.2139/ssrn.4572589

Kriste Krstovski (Contact Author)

Columbia University ( email )

New York
United States

Yao Lu

Columbia University in the City of New York

Ye Xu

Columbia University in the City of New York

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
65
Abstract Views
305
Rank
690,744
PlumX Metrics