A Hybrid Physics-Xgboost Framework for Dam Breach Parameter Prediction with Missing Data

30 Pages Posted: 20 May 2025

See all articles by Meiman Zhang

Meiman Zhang

Hohai University

Lingling Wang

Hohai University

Jianjian Cui

affiliation not provided to SSRN

Youming Zhang

affiliation not provided to SSRN

Tianyu Lei

affiliation not provided to SSRN

Mengtian Wu

Hohai University

Jianjun Han

Hohai University

Jin Xu

Hohai University

Multiple version iconThere are 2 versions of this paper

Abstract

Study RegionGlobally, dam breach-prone regions, often characterized by aging infrastructure, densely populated downstream areas, and increasing extreme weather events, pose significant challenges to accurate risk assessment and sustainable mitigation strategies.Study focusAccurate prediction of dam breach parameters is critical for disaster risk mitigation, yet conventional methods face limitations due to data scarcity and nonlinear hydromechanical interactions. This study proposes a hybrid framework integrating the physical mechanisms of the BREACH model with data-driven machine learning (ML) to address missing-data challenges. Leveraging Ward’s clustering method, key parameters (e.g., dam height, reservoir storage, breach width) are weighted based on their hydro-mechanical coupling effects derived from the BREACH model, enabling the development of a physics-driven empirical formula for missing data imputation. The framework combines reconstructed parameters from 152 incomplete cases with 40 complete dam-break cases, forming a robust dataset of 192 samples.New hydrological insights for the regionValidated against traditional approaches, the hybrid Physics-XGBoost model achieves superior performance: it improves peak discharge prediction accuracy by 22% (R²=0.915 vs. 0.73 for mean imputation) and reduces errors by 16% compared to purely physics-based empirical formulas. The integration of BREACH-derived physical weights and clustering-driven relationships enhances interpretability while resolving the accuracy-efficiency trade-off in dam breach simulations. This advancement supports real-time emergency decision-making and infrastructure resilience enhancement, offering a paradigm shift in data-scarce hydraulic disaster modeling.

Keywords: Dam breach parameters, hybrid framework, physics-driven, empirical formula, XGBoost, Ward's clustering method, missing data

Suggested Citation

Zhang, Meiman and Wang, Lingling and Cui, Jianjian and Zhang, Youming and Lei, Tianyu and Wu, Mengtian and Han, Jianjun and Xu, Jin, A Hybrid Physics-Xgboost Framework for Dam Breach Parameter Prediction with Missing Data. Available at SSRN: https://ssrn.com/abstract=5261596 or http://dx.doi.org/10.2139/ssrn.5261596

Meiman Zhang

Hohai University ( email )

8 Focheng West Road
Jiangning District
Nanjing, 211100
China

Lingling Wang

Hohai University ( email )

Jianjian Cui

affiliation not provided to SSRN ( email )

No Address Available

Youming Zhang

affiliation not provided to SSRN ( email )

No Address Available

Tianyu Lei

affiliation not provided to SSRN ( email )

No Address Available

Mengtian Wu

Hohai University ( email )

8 Focheng West Road
Jiangning District
Nanjing, 211100
China

Jianjun Han

Hohai University ( email )

Jin Xu (Contact Author)

Hohai University ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
3
Abstract Views
45
PlumX Metrics