A Big Data Approach to Black Friday Sales

M. J. Awan, M. Shafry, H. Nobanee, A. Yasin, O. I. Khalaf et al., "A big data approach to black friday sales," Intelligent Automation & Soft Computing, vol. 27, no.3, pp. 785–797, 2021.

13 Pages Posted: 15 Apr 2021

See all articles by Mazhar Javed Awan

Mazhar Javed Awan

Department of Software Engineering, University of Management and Technology, Lahore, Pakistan

Mohd Shafry Mohd Rahim

Universiti Teknologi Malaysia (UTM)

Haitham Nobanee

University of Oxford; Abu Dhabi University; University of Liverpool

Awais Yasin

National University of Technology - Department of Computer Engineering

Osamah Ibrahim Khalaf

Nahrain University - AlNahrain Nanorenewable Energy Research Centre

Date Written: March 01, 2021

Abstract

Retail companies recognize the need to analyze and predict their sales and customer behavior against their products and product categories. Our study aims to help retail companies create personalized deals and promotions for their customers, even during the COVID-19 pandemic, through a big data framework that allows them to handle massive sales volumes with more efficient models. In this paper, we used Black Friday sales data taken from a dataset on the Kaggle website, which contains nearly 550,000 observations analyzed with 10 features: qualitative and quantitative. The class label is purchases and sales (in U.S. dollars). Because the predictor label is continuous, regression models are suited in this case. Using the Apache Spark big data framework, which uses the MLlib machine learning library, we trained two machine learning models: linear regression and random forest. These machine learning algorithms were used to predict future pricing and sales. We first implemented a linear regression model and a random forest model without using the Spark framework and achieved accuracies of 68% and 74%, respectively. Then, we trained these models on the Spark machine learning big data framework where we achieved an accuracy of 72% for the linear regression model and 81% for the random forest model.

Keywords: Big data; correlation and regression analysis; machine learning; numerical algorithms; performance; prediction; Black Friday sales; cloud

Suggested Citation

Javed Awan, Mazhar and Mohd Rahim, Mohd Shafry and Nobanee, Haitham and Nobanee, Haitham and Yasin, Awais and Khalaf, Osamah Ibrahim, A Big Data Approach to Black Friday Sales (March 01, 2021). M. J. Awan, M. Shafry, H. Nobanee, A. Yasin, O. I. Khalaf et al., "A big data approach to black friday sales," Intelligent Automation & Soft Computing, vol. 27, no.3, pp. 785–797, 2021., Available at SSRN: https://ssrn.com/abstract=3827077

Mazhar Javed Awan (Contact Author)

Department of Software Engineering, University of Management and Technology, Lahore, Pakistan ( email )

Johar Town - Phase 1
Lahore, Punjab 54770
Pakistan

Mohd Shafry Mohd Rahim

Universiti Teknologi Malaysia (UTM) ( email )

81310 Sekolah Agama
Johor Bahru, Johor
Malaysia

Haitham Nobanee

Abu Dhabi University ( email )

Abu Dhabi
United Arab Emirates

University of Oxford ( email )

Mansfield Road
Oxford, Oxfordshire OX1 4AU
United Kingdom

University of Liverpool ( email )

Chatham Street
Brownlow Hill
Liverpool, L69 7ZA
United Kingdom

Awais Yasin

National University of Technology - Department of Computer Engineering ( email )

Pakistan

Osamah Ibrahim Khalaf

Nahrain University - AlNahrain Nanorenewable Energy Research Centre

Iraq

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
1,336
Abstract Views
3,150
Rank
23,445
PlumX Metrics