Topic Modeling for Media and Communication Research: A Short Primer

20 Pages Posted: 9 Sep 2016 Last revised: 13 Sep 2016

See all articles by Cornelius Puschmann

Cornelius Puschmann

Alexander von Humboldt Institute for Internet and Society

Tatjana Scheffler

University of Potsdam

Date Written: August 25, 2016

Abstract

A variety of powerful tools for the automated and semi-automated analysis of textual data are increasingly at the disposal of media and communication researchers. Among the assemblage of methods, the school of techniques known as topic modeling has recently attracted particular interest. What utility does one popular type of topic model, latent dirichlet allocation (LDA), have for media and communication research? This paper illustrates some distinct strengths and weaknesses of LDA. We first briefly introduce its conceptual foundations, along with a selection of studies from the social sciences that apply it to different types of content, from newspapers and scientific publications to literary texts and social media. We then present a case study of news coverage of the Syrian civil war. After describing our data, we turn to two facets of the results in particular: the relation of terms and topics and the proportions of topics in documents, aggregated into months. We make the case for contrastive (rather than descriptive) uses of topic modeling that build broader analyses on the initial output of a model, rather than concluding with a list of terms.

Keywords: Methods, Content analysis, Topic modeling, LDA, News

Suggested Citation

Puschmann, Cornelius and Scheffler, Tatjana, Topic Modeling for Media and Communication Research: A Short Primer (August 25, 2016). HIIG Discussion Paper Series No. 2016-05, Available at SSRN: https://ssrn.com/abstract=2836478 or http://dx.doi.org/10.2139/ssrn.2836478

Cornelius Puschmann (Contact Author)

Alexander von Humboldt Institute for Internet and Society ( email )

Französische Str. 9
Berlin, 10117
Germany
+49 30 2007 6082 (Phone)

HOME PAGE: http://www.hiig.de

Tatjana Scheffler

University of Potsdam ( email )

Karl-Liebknecht-Str. 24-25
Potsdam, DE 14476
Germany

Do you have negative results from your research you’d like to share?

Paper statistics

Downloads
1,085
Abstract Views
3,677
Rank
35,338
PlumX Metrics