A Random Forest Approach for Real-Time Sentiment Analysis of Twitter Data

5 Pages Posted: 6 May 2025

See all articles by Darab Khan

Darab Khan

Galgotias University

Zeeshan Ahmed

Galgotias University

Date Written: February 05, 2025

Abstract

Twitter sentiment analysis has become pivotal in understanding public emotions and trends. This study presents a comprehensive framework leveraging the Random Forest algorithm to classify sentiments expressed in tweets into positive, negative, and neutral categories. The dataset comprises 75,681 entries, including sentiment labels and text, with 74,995 unique tweet texts. Extensive preprocessing and feature extraction techniques, including tokenization and lemmatization, ensure data readiness for machine learning. The deployment of a Streamlit based application enhances usability by providing real-time analysis and visualization. Experimental results demonstrate high classification accuracy, emphasizing the effectiveness of the Random Forest model for sentiment analysis tasks. The integration of machine learning and web technologies highlights the potential of this system in real-world applications such as brand management, political analysis, and customer feedback monitoring.

Keywords: Sentiment Analysis, Random Forest, Machine Learning

Suggested Citation

Khan, Darab and Ahmed, Zeeshan, A Random Forest Approach for Real-Time Sentiment Analysis of Twitter Data (February 05, 2025). Available at SSRN: https://ssrn.com/abstract=5205637 or http://dx.doi.org/10.2139/ssrn.5205637

Darab Khan

Galgotias University ( email )

Zeeshan Ahmed (Contact Author)

Galgotias University ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
13
Abstract Views
32
PlumX Metrics