SSRN Home Search and Download Papers Browse Abstract and Paper Submission Subscribe to Networks View Briefcase Top Papers Top Authors Top Institutions

 

Abstract

 
 

Citations (19)

Beta

 
 

Footnotes (16)

Beta

 


 


Download | Share | Email | Add to Briefcase | Buy Hard Copy

Yahoo! for Amazon: Sentiment Parsing from Small Talk on the Web

Sanjiv Ranjan Das
Santa Clara University - Leavey School of Business

Mike Y. Chen
University of California, Berkeley


August 5, 2001

EFA 2001 Barcelona Meetings

Abstract:     
The internet has made it feasible to tap a continuous stream of public sentiment from the world wide web, quite literally permitting one to "feel the pulse" of any issue under consideration. We present a methodology for real time sentiment extraction in the domain of finance. With the advent of the web, there has been a sharp increase in the influence of individuals on the stock market via web-based trading and the posting of sentiment to stock message boards. While it is important
to capture this "sentiment" of small investors, as yet, no index of sentiment has been compiled. This paper comprises (a) a technology for extracting small investor sentiment from web sources to create an index, and (b) illustrative applications of the methodology. We make use of computerized natural language and statistical algorithms for the automated classification of messages posted on the web. We design a suite of classification algorithms, each of different theoretical content, with a view to characterizing the sentiment of any single posting to a message board. The use of multiple methods allows imposition of voting rules in the classification process. It also enables elimination of "fuzzy" messages which are better off uninterpreted. A majority rule across algorithms vastly improves classification accuracy, but also leads to a natural increase in the number of messages classified as "fuzzy". The classifier achieves an accuracy of 62% (versus a random classification accuracy of 33%), and compares favorably against human agreement on message classification, which was 72%. The technology is computationally efficient, allowing the access and interpretations of thousands of messages within minutes. Our illustrative applications show evidence of a strong link between market movements and sentiment. Based on approximately 25,000 messages for the last quarter of 2000, we found evidence that sentiment is based on stock movements.

Working Paper Series

Date posted: July 11, 2001 ; Last revised: October 27, 2009

Suggested Citation

Das, Sanjiv Ranjan and Chen, Mike Y., Yahoo! for Amazon: Sentiment Parsing from Small Talk on the Web (August 5, 2001). EFA 2001 Barcelona Meetings. Available at SSRN: http://ssrn.com/abstract=276189 or doi:10.2139/ssrn.276189


Export to: Export Citation What's this?

Contact Information

Sanjiv Ranjan Das (Contact Author)
Santa Clara University - Leavey School of Business ( email )
Department of Finance
208 Kenna Hall
Santa Clara, CA 95053
United States
Mike Y. Chen
University of California, Berkeley ( email )
Computer Science Division
Berkeley, CA 94720
United States
Feedback to SSRN (Beta)


Paper statistics
Abstract Views: 6,034
Downloads: 853
Download Rank: 6,506
Citations: 19
Footnotes: 16

© 2009 Social Science Electronic Publishing, Inc. All Rights Reserved. Terms of Use  Privacy Policy
This page was served by apollo2 in 0.953 seconds.