Enhancing Risk Assessment in Auto Insurance with Data-Driven Insights using Machine Learning

7 Pages Posted: 28 May 2025

See all articles by Laxmana Murthy Karaka

Laxmana Murthy Karaka

Code Ace Solutions, Inc.

Rahul Vadisetty

Wayne State University

Vasu Velaga

Cintas Corporation; IBM India Private Limited

Kishankumar Routhu

Automatic Data Processing, Inc. (ADP)

GANGADHAR SADARAM

affiliation not provided to SSRN

Srikanth Reddy Vangala

University of Bridgeport

Suneel Babu Boppana

isite Technologies

Date Written: November 28, 2023

Abstract

The insurance firms, detecting auto insurance fraud is a major difficulty that may result in large financial losses. Insurance customers suffer substantial monetary losses, including higher premiums, as a result of claims that are forged. Manual inspections and rule-based techniques are the foundation of conventional fraud detection techniques, which are ineffective and unable to keep up with changing fraud trends. Machine learning (ML) is used in this investigation technique to improve fraud detection accuracy by analyzing vehicle insurance claim data. Extensive data preprocessing was applied, including handling missing values, feature selection, one-hot encoding, Min-Max normalization, and oversampling to address the severe class imbalance. A Random Forest (RF) classifier accomplished the uppermost accuracy (97.5%), outperforming Logistic Regression (LR) (87.1%) and EXtreme Gradient Boosting (XGBoost) (77.61%). Random Forest (RF) also showed superior precision (95.6%), recall (99.5%), and F1-score (97.5%), with an AUC of 0.98 from the ROC analysis, confirming its effectiveness. Despite its strong performance, limitations include dataset age and synthetic data from oversampling. The proposed approach offers an automated, scalable, and efficient fraud detection system, enhancing decision-making in the business of protection. Using machine learning (ML), this research offers an inexpensive and effective way to improve automobile insurance fraud detection.

Keywords: Auto Insurance, Risk Assessment, Insurance Fraud Prediction, Fraud Detection, Machine Learning (ML), vehicle insurance data

Suggested Citation

Karaka, Laxmana Murthy and Vadisetty, Rahul and Velaga, Vasu and Routhu, Kishankumar and SADARAM, GANGADHAR and Vangala, Srikanth Reddy and Boppana, Suneel Babu, Enhancing Risk Assessment in Auto Insurance with Data-Driven Insights using Machine Learning (November 28, 2023). Available at SSRN: https://ssrn.com/abstract=5254541 or http://dx.doi.org/10.2139/ssrn.5254541

Laxmana Murthy Karaka (Contact Author)

Code Ace Solutions, Inc. ( email )

Rahul Vadisetty

Wayne State University ( email )

Vasu Velaga

Cintas Corporation ( email )

6800 Cintas Blvd
Mason, OH 45040
United States

IBM India Private Limited ( email )

Embassy Tech Zone
Rajeev Gandhi IT Park, Phase III, Hinjawadi
Gurgaon, MS 411057
India

Kishankumar Routhu

Automatic Data Processing, Inc. (ADP) ( email )

Roseland, NJ
United States

GANGADHAR SADARAM

affiliation not provided to SSRN

Srikanth Reddy Vangala

University of Bridgeport ( email )

126 Park Avenue
Bridgeport, CT 06601
United States

Suneel Babu Boppana

isite Technologies ( email )

379 princeton Hightstown RD
East Windsor, NJ 08512
United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
14
Abstract Views
67
PlumX Metrics