TOC Prediction from Well Logs Using Gradient Boosting and Neural Network in the Santos Basin, SE Brazil

40 Pages Posted: 10 Jun 2025

See all articles by Bernardo Chede

Bernardo Chede

Universidade Federal Fluminense

Andre Belem

UFF - Universidade Federal Fluminense - School of Engineering

ANA LUIZA S. ALBUQUERQUE

University of São Paulo (USP)

Luis Henrique Cordeiro

Universidade Federal Fluminense

IGOR M. VENANCIO

University of Bremen - Center for Marine Environmental Sciences

Victor Carreira

Universidade Federal do Acre (UFAC)

Kristoffer Hallam

Universidade Federal Fluminense

Lara de Paula Hercolano

Universidade Federal Fluminense

Rodrigo Sobrinho

Universidade Federal Fluminense

Andre Luiz Durante Spigolon

Petrobras

Pedro Afonso

Universidade Federal Fluminense

Ulrich G. Wortmann

University of Toronto

Date Written: April 09, 2025

Abstract

Accurate prediction of total organic carbon (TOC) in subsurface formations is crucial for evaluating source rock quality and optimizing exploration strategies in hydrocarbon prolific basins. Traditional methods like the ΔlogR technique often require local calibration and may fail to capture the non-linear relationships between well-log parameters and TOC, leading to inaccuracies. This study applies three machine learning (ML) models-Gradient Boosting Decision Trees (GBDT), Extreme Gradient Boosting (XGBoost), and Multi-Layer Perceptron (MLP)-to predict TOC from well-log data in the Santos Basin, Brazil's largest offshore basin. We employed robust data preprocessing techniques, including outlier detection using Density-Based Spatial Clustering and feature reduction through Principal Component Analysis. Bayesian optimization was utilized for hyperparameter tuning to enhance model performance. The results indicate that all ML models outperformed the traditional ΔlogR method, with GBDT achieving the highest prediction accuracy. Reducing the dataset from five wells to three more homogeneous wells significantly improved the GBDT model's performance, underscoring the importance of data quality and relevance. This study demonstrates the potential of ML models in capturing complex, non-linear relationships in geophysical data and highlights the challenges of generalizing these models across diverse geological settings. The findings contribute to improved TOC estimation and can enhance exploration strategies in similar geological contexts.

Keywords: Total Organic Carbon, Gradient Boosting, Neural Network, Supervised Machine Learning, Unsupervised Machine Learning, Santos Basin

Suggested Citation

Chede, Bernardo and Belem, Andre and ALBUQUERQUE, ANA LUIZA S. and Cordeiro, Luis Henrique and VENANCIO, IGOR M. and Carreira, Victor and Hallam, Kristoffer and Hercolano, Lara de Paula and Sobrinho, Rodrigo and Spigolon, Andre Luiz Durante and Afonso, Pedro and Wortmann, Ulrich G., TOC Prediction from Well Logs Using Gradient Boosting and Neural Network in the Santos Basin, SE Brazil (April 09, 2025). Available at SSRN: https://ssrn.com/abstract=5211368 or http://dx.doi.org/10.2139/ssrn.5211368

Bernardo Chede (Contact Author)

Universidade Federal Fluminense ( email )

Andre Belem

UFF - Universidade Federal Fluminense - School of Engineering ( email )

Brazil

Ana Luiza S. Albuquerque

University of São Paulo (USP) ( email )

Luis Henrique Cordeiro

Universidade Federal Fluminense ( email )

Igor M. Venancio

University of Bremen - Center for Marine Environmental Sciences ( email )

Universitaetsallee GW I
Bremen, D-28334
Germany

Victor Carreira

Universidade Federal do Acre (UFAC) ( email )

Kristoffer Hallam

Universidade Federal Fluminense ( email )

Lara De Paula Hercolano

Universidade Federal Fluminense ( email )

Rodrigo Sobrinho

Universidade Federal Fluminense ( email )

Andre Luiz Durante Spigolon

Petrobras ( email )

Av Chile, 65, room 401
Rio de Janeiro, 20031-912
Brazil

Pedro Afonso

Universidade Federal Fluminense ( email )

Ulrich G. Wortmann

University of Toronto

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
65
Abstract Views
451
Rank
933,592
PlumX Metrics