References (34)


Citations (1)



An Automated Snowball Census of the Political Web

Abe Gong

University of Michigan at Ann Arbor - Gerald R. Ford School of Public Policy

August 9, 2011

This paper solves a persistent methodological problem for social scientists studying the political web: representative sampling. Virtually all existing studies of the political web are based on incomplete samples, and therefore lack generalizability. In this paper, I combine methods from computer science and sampling theory to conduct an automated snowball census of the political web and constructs an all-but-complete index of English political websites. I check the robustness of this index, use it to generate descriptive statistics for the entire political web, and demonstrate that studies based on ad hoc sampling strategies are likely to be biased in important ways. In future research, this bias can be eliminated by using this index as a sampling universe. In addition, the methods and open-source software presented here can be used to creating similar sampling frames for other online content domains.

Number of Pages in PDF File: 34

Keywords: sampling theory, web mining, text classification, computational social science

Open PDF in Browser Download This Paper

Date posted: May 9, 2011 ; Last revised: August 19, 2014

Suggested Citation

Gong, Abe, An Automated Snowball Census of the Political Web (August 9, 2011). Available at SSRN: http://ssrn.com/abstract=1832024 or http://dx.doi.org/10.2139/ssrn.1832024

Contact Information

Abe Gong (Contact Author)
University of Michigan at Ann Arbor - Gerald R. Ford School of Public Policy ( email )
735 South State Street, Weill Hall
Ann Arbor, MI 48109
United States
Feedback to SSRN

Paper statistics
Abstract Views: 1,142
Downloads: 135
Download Rank: 160,816
References:  34
Citations:  1

© 2016 Social Science Electronic Publishing, Inc. All Rights Reserved.  FAQ   Terms of Use   Privacy Policy   Copyright   Contact Us
This page was processed by apollobot1 in 0.188 seconds