On the Validity of Using Census Geocode Characteristics to Proxy Individual Socioeconomic Characteristics

28 Pages Posted: 20 Sep 2000 Last revised: 8 Jan 2023

See all articles by Arline T. Geronimus

Arline T. Geronimus

University of Michigan at Ann Arbor - School of Public Health

John Bound

University of Michigan; National Bureau of Economic Research (NBER)

Lisa J. Neidert

University of Michigan at Ann Arbor

Date Written: December 1995

Abstract

Investigators of social differentials in health outcomes commonly augment incomplete micro data by appending socioeconomic characteristics of residential areas (such as median income in a zip code) to proxy for individual characteristics. However, little empirical attention has been paid to how well this aggregate information serves as a proxy for the individual characteristics of interest. We build on recent work addressing the biases inherent in proxies and consider two health-related examples within a statistical framework that illuminate the nature and sources of biases. Data from the Panel Study of Income Dynamics and the National Maternal and Infant Health Survey are linked to census data. We assess the validity of using the aggregate census information as a proxy for individual information when estimating main effects, and when controlling for potential confounding between socioeconomic and sociodemographic factors in measures of general health status and infant mortality. We find a general, but not universal, tendency for aggregate proxies to exaggerate the effects of micro-level variables and to do more poorly than micro-level variables at controlling for confounding. The magnitude and direction of these biases, however, vary across samples. Our statistical framework and empirical findings suggest the difficulties in and limits to interpreting proxies derived from aggregate census data as if they were micro-level variables. The statistical framework we outline for our study of health outcomes should be generally applicable to other situations where researchers have merged aggregate data with micro data samples.

Suggested Citation

Geronimus, Arline T. and Bound, John and Neidert, Lisa J., On the Validity of Using Census Geocode Characteristics to Proxy Individual Socioeconomic Characteristics (December 1995). NBER Working Paper No. t0189, Available at SSRN: https://ssrn.com/abstract=225098

Arline T. Geronimus

University of Michigan at Ann Arbor - School of Public Health ( email )

109 S. Observatory
M5142 SPH II
Ann Arbor, MI 48109-2029
United States
(734) 763-7379 (Phone)
(734) 936-0929 (Fax)

John Bound (Contact Author)

University of Michigan ( email )

611 Tappan Street
Ann Arbor, MI 48109-1220
United States
313-998-7149 (Phone)
313-998-7415 (Fax)

National Bureau of Economic Research (NBER)

1050 Massachusetts Avenue
Cambridge, MA 02138
United States

Lisa J. Neidert

University of Michigan at Ann Arbor

500 S. State Street
Ann Arbor, MI 48109
United States