Abstract

http://ssrn.com/abstract=1832024
 
 

References (34)



 
 

Citations (1)



 


 



An Automated Snowball Census of the Political Web


Abe Gong


University of Michigan at Ann Arbor - Gerald R. Ford School of Public Policy

August 9, 2011


Abstract:     
This paper solves a persistent methodological problem for social scientists studying the political web: representative sampling. Virtually all existing studies of the political web are based on incomplete samples, and therefore lack generalizability. In this paper, I combine methods from computer science and sampling theory to conduct an automated snowball census of the political web and constructs an all-but-complete index of English political websites. I check the robustness of this index, use it to generate descriptive statistics for the entire political web, and demonstrate that studies based on ad hoc sampling strategies are likely to be biased in important ways. In future research, this bias can be eliminated by using this index as a sampling universe. In addition, the methods and open-source software presented here can be used to creating similar sampling frames for other online content domains.

Number of Pages in PDF File: 34

Keywords: sampling theory, web mining, text classification, computational social science

working papers series


Download This Paper

Date posted: May 9, 2011 ; Last revised: August 19, 2014

Suggested Citation

Gong, Abe, An Automated Snowball Census of the Political Web (August 9, 2011). Available at SSRN: http://ssrn.com/abstract=1832024 or http://dx.doi.org/10.2139/ssrn.1832024

Contact Information

Abe Gong (Contact Author)
University of Michigan at Ann Arbor - Gerald R. Ford School of Public Policy ( email )
735 South State Street, Weill Hall
Ann Arbor, MI 48109
United States
Feedback to SSRN


Paper statistics
Abstract Views: 921
Downloads: 110
Download Rank: 147,443
References:  34
Citations:  1

© 2014 Social Science Electronic Publishing, Inc. All Rights Reserved.  FAQ   Terms of Use   Privacy Policy   Copyright   Contact Us
This page was processed by apollo5 in 0.281 seconds