Classify Authentic & Fraudulent Reviews Using Supervised Machine Learning

13 Pages Posted: 23 Dec 2022

See all articles by Sowjanya Jindam

Sowjanya Jindam

Maturi Venkata Subba Rao Engineering College

Pranav Reddy Loka

Maturi Venkata Subba Rao (MVSR) Engineering College, Department of Information Technology

Vineela Munigala

Maturi Venkata Subba Rao (MVSR) Engineering College, Department of Information Technology

Jaimini Keerthan Mannem

Maturi Venkata Subba Rao (MVSR) Engineering College, Department of Information Technology

Date Written: December 6, 2022

Abstract

Earlier, if the customer has to buy a product, then that customer has to visit the showroom. For the suggestions and reviews,he would take help from salespersons, friends or family who has already purchased the product. In recent times e-commerce has taken a leap with the availability of technology for everyone and online shopping has become lucrative with the offers provided by the e-commerce sites. Sales of products mostly tend to depend on reviews given by the customer who purchased products previously. Large part of purchasers read reviews of products given by customers or stores before making the decision of what, or from where to buy and whether to buy the product or not. So, in order to retain customers and increase sales with monitory gain, there has been a huge increase in deceptive opinion spam on online review websites.

Basically, fake review or fraudulent review is an untruthful review. Our work is aimed at identifying whether a review and ratings given are fake or truthful one by using supervised machine learning approaches. It analyses the existing feature extraction techniques. It also compares the performance of several experiments done on Amazon Review data set, which is extensive and reputable. We have compared the performance of machine learning classifiers like K-Nearest Neighbors (KNN), Naïve Bayes (NB), Random Forest, Support Vector Model (SVM),Decision tree, XGBoost and Logistic Regression.The results reveal that Logistic Regression outperforms the rest of classifiers in terms of accuracy. The results show that the system has better ability to detect a review as fake or original.

Keywords: Supervised Machine Learning, Fake Reviews, Fake Review Detection, Logistic Regression, Naive Bayes (NB), K-Nearest Neighbors (KNN), Random Forest, Decision tree, XGBoost, Support Vector Model (SVM)

Suggested Citation

Jindam, Sowjanya and Loka, Pranav Reddy and Munigala, Vineela and Mannem, Jaimini Keerthan, Classify Authentic & Fraudulent Reviews Using Supervised Machine Learning (December 6, 2022). Available at SSRN: https://ssrn.com/abstract=4294478 or http://dx.doi.org/10.2139/ssrn.4294478

Sowjanya Jindam (Contact Author)

Maturi Venkata Subba Rao Engineering College ( email )

Nadergul
Balapur Mandal
Hyderabad, India 500060
India

HOME PAGE: http://www.mvsrec.edu.in

Pranav Reddy Loka

Maturi Venkata Subba Rao (MVSR) Engineering College, Department of Information Technology ( email )

Vineela Munigala

Maturi Venkata Subba Rao (MVSR) Engineering College, Department of Information Technology ( email )

Jaimini Keerthan Mannem

Maturi Venkata Subba Rao (MVSR) Engineering College, Department of Information Technology ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
74
Abstract Views
309
Rank
703,116
PlumX Metrics