19th Workshop on Information Technologies and Systems, pp. 175-180, December 2009
8 Pages Posted: 7 Mar 2010 Last revised: 6 Aug 2014
Date Written: December 8, 2009
Data quality in the Wikipedia is debatable. On the one hand, existing research indicates that not only are people willing to contribute articles but the quality of those articles is close to that found in conventional encyclopedias. On the other hand, the public has never stopped criticizing the quality of Wikipedia articles, and critics never have trouble finding low quality Wikipedia articles. Why do Wikipedia articles vary widely in quality? We investigate the relationship between collaboration and data quality. We show that the quality of Wikipedia articles is not only dependent on the different types of contributors but also on how they collaborate. Based on an empirical study, we classify contributors based on their roles in editing individual Wikipedia articles. We identify various patterns of collaboration based on the provenance or, more specifically, who does what to Wikipedia articles. Our research helps identify collaboration patterns that are preferable or detrimental for data quality, thus providing insights for improving data quality in Wikipedia.
Keywords: Wikipedia, data quality, collaboration pattern, provenance
JEL Classification: L15
Suggested Citation: Suggested Citation
Liu, Jun and Ram, Sudha, Who Does What: Collaboration Patterns in the Wikipedia and Their Impact on Data Quality (December 8, 2009). 19th Workshop on Information Technologies and Systems, pp. 175-180, December 2009. Available at SSRN: https://ssrn.com/abstract=1565682