A New Coal Seam Gas Content Prediction Model Based on Feature Selection and Machine Learning Fusion in Sparse Datasets

48 Pages Posted: 4 Dec 2024

See all articles by Sheng Su

Sheng Su

affiliation not provided to SSRN

Liang Wang

affiliation not provided to SSRN

Songwei Wu

affiliation not provided to SSRN

Yuechen Zhao

affiliation not provided to SSRN

Chenghao Wang

affiliation not provided to SSRN

Longyong Shu

affiliation not provided to SSRN

Abstract

Coal seam gas content is a key factor for mine safety and methane resource utilization. To improve prediction accuracy, we collected gas content data from 27 groups on the left flank of the Huaibei Taoyuan Coal Mining area. The main factors affecting gas content were selected based on geological structure, coal quality characteristics, and the conditions of coal occurrence. Subsequently, various algorithms were used to identify the main controlling factors of gas content, and different regression prediction models were applied based on the selected feature combinations. The Genetic Algorithm with Ant Colony Optimization (GA-ACO) was introduced and compared with several traditional feature selection algorithms, including All Subset Regression (ASR), Forward Selection Regression (FSR), Backward Selection Regression (BSR), Random Forest Algorithm (RFA), and Lasso regression, to construct the main control factor set for gas content. Additionally, The Particle Swarm Optimization Support Vector Regression (PSO-SVR) to build a regression prediction model. Various regression models such as Multivariate Linear Regression (MLR), Multi-Layer Perceptron (MLP), Gradient Boosting Regression Tree (GBRT) and SVR were compared to verify the reliability of the model. The dataset was randomly divided into training and validation sets in an 8:2 ratio, and the prediction performance was evaluated using metrices. The results show that the use of feature selection and machine learning effectively predicts the gas content of coal seams. PSO-SVR model based on ASR achieved the best prediction performance with an R2of 0.99 on the training set and 0.978 on the test set. The maximum error was 0.077. Finally, Using the constructed model to predict regional gas content and plotted a regional gas content distribution map. This study demonstrates the potential of combining feature selection with machine learning for regression prediction in coal bed geology, offering a feasible and promising approach for research in this field.

Keywords: Coal and gas outburst, Coal seam gas content, GA-ACO algorithm, PSO-SVR model

Suggested Citation

Su, Sheng and Wang, Liang and Wu, Songwei and Zhao, Yuechen and Wang, Chenghao and Shu, Longyong, A New Coal Seam Gas Content Prediction Model Based on Feature Selection and Machine Learning Fusion in Sparse Datasets. Available at SSRN: https://ssrn.com/abstract=5043991 or http://dx.doi.org/10.2139/ssrn.5043991

Sheng Su

affiliation not provided to SSRN ( email )

No Address Available

Liang Wang (Contact Author)

affiliation not provided to SSRN ( email )

No Address Available

Songwei Wu

affiliation not provided to SSRN ( email )

No Address Available

Yuechen Zhao

affiliation not provided to SSRN ( email )

No Address Available

Chenghao Wang

affiliation not provided to SSRN ( email )

No Address Available

Longyong Shu

affiliation not provided to SSRN ( email )

No Address Available

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
25
Abstract Views
148
PlumX Metrics