Using machine learning and qualitative interviews to design a five-question women's agency index

49 Pages Posted: 26 Mar 2021

See all articles by Seema Jayachandran

Seema Jayachandran

Northwestern University - Department of Economics

Monica Biradavolu

QualAnalytics

Jan Cooper

Harvard University T.H. Chan School of Public Health

Date Written: March 23, 2021

Abstract

We propose a new method to design a short survey measure of a complex concept such as women's agency. The approach combines mixed-methods data collection and machine learning. We select the best survey questions based on how strongly correlated they are with a "gold standard" measure of the concept derived from qualitative interviews. In our application, we measure agency for 209 women in Haryana, India, first, through a semi-structured interview and, second, through a large set of close-ended questions. We use qualitative coding methods to score each woman's agency based on the interview, which we treat as her true agency. To identify the close-ended questions most predictive of the "truth," we apply statistical algorithms that build on LASSO and random forest but constrain how many variables are selected for the model (five in our case). The resulting five-question index is as strongly correlated with the coded qualitative interview as is an index that uses all of the candidate questions. This approach of selecting survey questions based on their statistical correspondence to coded qualitative interviews could be used to design short survey modules for many other latent constructs.

Keywords: women's empowerment, survey design, feature selection, psychometrics

JEL Classification: C83, D13, J16, O12

Suggested Citation

Jayachandran, Seema and Biradavolu, Monica and Cooper, Jan, Using machine learning and qualitative interviews to design a five-question women's agency index (March 23, 2021). Global Poverty Research Lab Working Paper No. 21-104, Available at SSRN: https://ssrn.com/abstract=3811783 or http://dx.doi.org/10.2139/ssrn.3811783

Seema Jayachandran (Contact Author)

Northwestern University - Department of Economics ( email )

2003 Sheridan Road
Evanston, IL 60208
United States

Monica Biradavolu

QualAnalytics ( email )

Jan Cooper

Harvard University T.H. Chan School of Public Health ( email )

Boston, MA 02115
United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
111
Abstract Views
832
Rank
489,111
PlumX Metrics