How do Twitter Conversations Differ based on Geography, Time, and Subject? A Framework and Analysis of Topical Conversations in Microblogging

Automatic discovery of how members of social media are discussing different thoughts on particular topics would provide a unique insight into how people perceive different topics. However, identifying trending terms/words within a topical conversation is a difficult task. We take an information retrieval approach and use tf-idf (term frequency-inverse document frequency) to identify words that are more frequent in a focal conversation compared to other conversations on Twitter. This requires a query set of tweets on a particular topic (used for term frequency) and a control set of conversations to use for comparison (used for inverse document frequency). The terms identified as most important within a topical conversation are greatly affected by the particular control set used. There is no clear metric for whether one control set is better than another, since that is determined by the needs of the user, but we can investigate the stability properties of topics given different control sets. We propose a method for doing this, and show that some topics of conversation are more stable than other topics, and that this stability is also affected by whether only the most frequent terms are of interest (top-50), or if all words (full-vocabulary) are being examined. We end with a set of guidelines for how to build better topic analysis tools based on these results.

Keywords: social media, microblogging, trend identification, topic stability, language usage, ranking, Twitter

Suggested Citation: Suggested Citation

Lai, Victoria and Rand, William, How do Twitter Conversations Differ based on Geography, Time, and Subject? A Framework and Analysis of Topical Conversations in Microblogging (August 15, 2013). 2013 ASE/IEEE International Conference on Social Computing, Robert H. Smith School Research Paper, Available at SSRN: https://ssrn.com/abstract=2231823 or http://dx.doi.org/10.2139/ssrn.2231823

Victoria Lai (Contact Author)

University of Maryland - College of Computer, Mathematical and Natural Sciences ( email )

2300 Symons Hall,
University of Maryland
College Park, MD 20742-3255
United States

William Rand

North Carolina State University ( email )

Raleigh, NC 27695
United States

Download This Paper

Open PDF in Browser

Do you have a job opening that you would like to promote on SSRN?

Place Job Opening

Paper statistics

Downloads

117

Abstract Views

1,250

Rank

518,204

16 References

PlumX Metrics

Feedback

How do Twitter Conversations Differ based on Geography, Time, and Subject? A Framework and Analysis of Topical Conversations in Microblogging

Victoria Lai

William Rand

Abstract

Victoria Lai (Contact Author)

University of Maryland - College of Computer, Mathematical and Natural Sciences ( email )

William Rand

North Carolina State University ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Related eJournals

Economics of Networks eJournal

Social & Political Philosophy eJournal

Consumer Social Responsibility eJournal

Political Behavior: Cognition, Psychology, & Behavior eJournal

Economic & Social Impacts of Innovation eJournal

Philosophy of Language eJournal

Linguistic Anthropology eJournal

Sociology of Innovation eJournal

Psychology of Innovation eJournal

Social & Personality Psychology eJournal

Psychology Research Methods eJournal

Computational & Quantitative Research in Communication eJournal

Visual Anthropology, Media Studies & Performance eJournal