Performance Analysis of Under-Sampling and Over-Sampling Techniques for Solving Class Imbalance Problem

11 Pages Posted: 14 Jun 2019

See all articles by Rekha G

Rekha G

Koneru Lakshmaiah Education Foundation

Amit Kumar Tyagi

Lingaya's University - Department of Computer Science and Engineering

V. Krishna Reddy

Koneru Lakshmaiah Education Foundation - Department of Computer Science and Engineering

Date Written: March 20, 2019

Abstract

Most of the traditional classification algorithms assume their training data to be well-balanced in terms of class distribution. Real-world datasets, however, are imbalanced in nature thus degrade the performance of the traditional classifiers. An imbalance data-set typically make prediction accuracy difficult. Data pre-processing approaches discuss this issue by using random under-sampling or oversampling techniques. To solve this problem, many strategies are adopted to balance the class distribution at the data level. The data level methods balance the imbalance distribution between majority and minority classes using either oversampling or under-sampling techniques. In this paper, we present the performance analysis of under-sampling method and oversampling methods. The methods are implemented with 5 conventional classifiers like C4.5 Decision Tree (DT), k-Nearest Neighbor (k-NN), Multilayer Perceptron (MLP), Support Vector Machine (SVM), and Naive Bayes (NB) on 15 real life data sets. The experimental results show comparative study of under-sampling and over sampling technique.

Suggested Citation

G, Rekha and Tyagi, Amit Kumar and Reddy, V. Krishna, Performance Analysis of Under-Sampling and Over-Sampling Techniques for Solving Class Imbalance Problem (March 20, 2019). Proceedings of International Conference on Sustainable Computing in Science, Technology and Management (SUSCOM), Amity University Rajasthan, Jaipur - India, February 26-28, 2019, Available at SSRN: https://ssrn.com/abstract=3356374 or http://dx.doi.org/10.2139/ssrn.3356374

Rekha G (Contact Author)

Koneru Lakshmaiah Education Foundation ( email )

Amit Kumar Tyagi

Lingaya's University - Department of Computer Science and Engineering ( email )

India

V. Krishna Reddy

Koneru Lakshmaiah Education Foundation - Department of Computer Science and Engineering ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
325
Abstract Views
1,939
Rank
231,953
PlumX Metrics