Predictive Analytics For Healthcare Insurance Risk Assessment Using Ensemble Learning Models

Posted: 23 Jan 2025

Date Written: June 22, 2024

Abstract

Accurate healthcare insurance risk assessment is essential in designing costeffective and personalized insurance plans. The study proposes the Dynamic Ensemble Risk Stratification Algorithm (DERISA), a new approach using advanced ensemble learning techniques for predictive analytics in healthcare insurance. With Random Forest, Gradient Boosting Machine (GBM), and XGBoost models integrated within a dynamically weighted ensemble framework, DERISA predicts insurance risks with high precision. Feature engineering techniques such as PCA and mutual information are followed to extract and optimize relevant features from the dataset, including historical claims, demographic attributes, and medical histories. Experimental evaluation on real data for healthcare insurance reveals that the efficacy of DERISA is better than traditional machine learning. Accuracy is 95.2%, precision is 94.1%, recall is 93.8%, and AUC-ROC scores 0.97 is better than individual ensemble models. The algorithm stratifies policyholders into well-defined risk tiers, accurately capturing high-risk profiles with higher precision that can help in providing premium premium plans and preventive health incentives.

Keywords: Healthcare Insurance Risk Assessment, Ensemble Learning, Dynamic Risk Stratification, Predictive Analytics, Feature Engineering

Suggested Citation

1983, Narsimhan.s, Predictive Analytics For Healthcare Insurance Risk Assessment Using Ensemble Learning Models (June 22, 2024). Available at SSRN: https://ssrn.com/abstract=5034056

Narsimhan.s 1983 (Contact Author)

Independent Researcher ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Abstract Views
167
PlumX Metrics