Web Data Collection

Forthcoming in: Handbook of Quantitative Methods in Sociology, ed. Ulf Liebe, Edward Elgar Publishing.

29 Pages Posted: 19 Dec 2024

See all articles by Nicole Schwitter

Nicole Schwitter

University of Mannheim - Mannheim Centre for European Social Research (MZES); University of Warwick

Omer Faruk Yalcin

University of Massachusetts Amherst

Date Written: October 27, 2024

Abstract

The digital revolution and the widespread use of the internet have had profound effects on people's everyday lives as well as empirical social science research. The digital landscape that has developed with this shift holds unprecedented masses of data which are increasingly used by sociologists. Popular sources of web data include for example data from social media sites, news sites, digitised archives, crowd-sourced information, or online markets. How can this data be harnessed? This chapter provides a hands-on introduction to collecting web data directly from platforms, providing code examples in two popular programming languages-R and Python.

Keywords: API, application programming interface, data collection, digital trace data, web data, web scraping

Suggested Citation

Schwitter, Nicole and Yalcin, Omer Faruk, Web Data Collection (October 27, 2024). Forthcoming in: Handbook of Quantitative Methods in Sociology, ed. Ulf Liebe, Edward Elgar Publishing.
, Available at SSRN: https://ssrn.com/abstract=5009050 or http://dx.doi.org/10.2139/ssrn.5009050

Nicole Schwitter (Contact Author)

University of Mannheim - Mannheim Centre for European Social Research (MZES) ( email )

D-68131 Mannheim
Germany

University of Warwick ( email )

Gibbet Hill Rd.
Coventry, West Midlands CV4 8UW
United Kingdom

Omer Faruk Yalcin

University of Massachusetts Amherst ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
46
Abstract Views
144
PlumX Metrics