header

Tree-Based Machine Learning Models for Predicting the Maximum Depth of Corrosion Defects Based on Historical In-Line Inspection Data

28 Pages Posted: 5 Feb 2025 Publication Status: Under Review

See all articles by Eyad Abdullah Alshaye

Eyad Abdullah Alshaye

King Fahd University of Petroleum & Minerals (KFUPM)

Atif AlZahrani

King Fahd University of Petroleum & Minerals (KFUPM)

Abduljabar Al-Sayoud

King Fahd University of Petroleum & Minerals (KFUPM)

Md Shafiullah

King Fahd University of Petroleum & Minerals (KFUPM)

Abstract

Oil and gas pipelines are the primary means of fluid transportation in the industry due to their efficiency, reliability, and cost-effectiveness. However, pipeline corrosion poses significant risks, leading to loss of containment, operational interruptions, and potential loss of life. The introduction of novel and corrosive fluid services, such as hydrogen and carbon dioxide (CO2), is expected to exacerbate corrosion-related issues. Consequently, pipeline inspection techniques, particularly In-Line Inspection (ILI), are increasingly vital for corrosion monitoring. This paper establishes a framework for utilizing ILI data to develop machine learning models for corrosion prediction. Four tree-based machine learning techniques—eXtreme Gradient Boosting (XGBoost), Dropouts meet multiple Additive Regression Trees (DART), Light Gradient-Boosting Machine (LightGBM) with linear trees, and random forests—were employed to predict the maximum depth of corrosion defects based exclusively on historical ILI data. All models significantly outperformed the Naïve forecasting benchmark, with the best model achieving a root mean square error (RMSE) of 0.368 mm on the test set, surpassing the benchmark by 41.5%. Furthermore, the accuracy of these models exceeded that of most service-based corrosion prediction models in the literature. This success was achieved using an ILI dataset that exhibited extreme variations in maximum depth distribution and changes in reporting criteria. These findings indicate that the proposed framework is a superior alternative to service-based models, leveraging the vast amount of ILI data available to pipeline operators. Additionally, the service-agnostic framework supports the integration of process and service parameters, further enhancing the efficacy of machine learning models for corrosion prediction.

Keywords: In-line inspection, machine learning, Supervised learning, Pipeline corrosion, Maximum depth prediction, Statistical analysis, Sustainability

Suggested Citation

Alshaye, Eyad Abdullah and AlZahrani, Atif and Al-Sayoud, Abduljabar and Shafiullah, Md, Tree-Based Machine Learning Models for Predicting the Maximum Depth of Corrosion Defects Based on Historical In-Line Inspection Data. Available at SSRN: https://ssrn.com/abstract=5106767 or http://dx.doi.org/10.2139/ssrn.5106767

Eyad Abdullah Alshaye (Contact Author)

King Fahd University of Petroleum & Minerals (KFUPM) ( email )

Atif AlZahrani

King Fahd University of Petroleum & Minerals (KFUPM) ( email )

Abduljabar Al-Sayoud

King Fahd University of Petroleum & Minerals (KFUPM) ( email )

Md Shafiullah

King Fahd University of Petroleum & Minerals (KFUPM) ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
42
Abstract Views
211
PlumX Metrics