Method for Establishing Predictive Models for Total Organic Halogen Based on Piecewise Interpolation and Machine Learning

32 Pages Posted: 18 Sep 2022

See all articles by Yinan Bu

Yinan Bu

Hainan University

Liangliang Shi

Hainan University

Bin Ma

Hainan University

Multiple version iconThere are 2 versions of this paper

Abstract

In disinfection by-product (DBP) research, the parameter ‘total organic halogen’ (TOX) is a significant aggregate indicator and reports the total content of halogenated DBP in water, determined in a single experimental process. TOX modeling can facilitate the prediction, diagnosis, and control of the drinking water disinfection process. The modeling approach is often based on the reaction mechanisms of the disinfection process. However, building an accurate TOX model is difficult due to the complexity and nonlinearity of the disinfection reaction mechanisms, and many simplifications have been made in the modeling process, resulting in poor adaptability of the TOX model in practical applications. Machine learning algorithms are data-driven modeling methods that can achieve high prediction accuracy and are simple and convenient to apply. However, in practical experiments, the TOX dataset is often small (usually < 10 points), making TOX modeling through machine learning algorithms particularly difficult. To solve this issue, this study established a method using piecewise interpolation to expand the TOX dataset and subsequently machine learning algorithms to establish the model. Three common machine learning algorithms, backpropagation neural network, radial basis function neural network, and support vector machine, were used to evaluate the data expansion method. The modeling of TOX for a chloramination and chlorination disinfection process shows that this method can achieve satisfactory results regarding sensitivity and accuracy. All the models provided favorable predictions, with relatively high correlation coefficients (> 0.99) and low mean square errors (< 5.31 × 10 −5 ).

Keywords: Total organic halogen, disinfection by-product, piecewise interpolation, machinelearning, prediction

Suggested Citation

Bu, Yinan and Shi, Liangliang and Ma, Bin, Method for Establishing Predictive Models for Total Organic Halogen Based on Piecewise Interpolation and Machine Learning. Available at SSRN: https://ssrn.com/abstract=4220101 or http://dx.doi.org/10.2139/ssrn.4220101

Yinan Bu

Hainan University ( email )

No. 58, Renmin Avenue
570228, P.R.
Haikou, HainanProvince
China

Liangliang Shi

Hainan University ( email )

No. 58, Renmin Avenue
570228, P.R.
Haikou, HainanProvince
China

Bin Ma (Contact Author)

Hainan University ( email )

No. 58, Renmin Avenue
570228, P.R.
Haikou, HainanProvince
China

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
20
Abstract Views
267
PlumX Metrics