TY - JOUR
T1 - Enhanced understanding of osmotic membrane bioreactors through machine learning modeling of water flux and salinity
AU - Chang, Hau Ming
AU - Xu, Yanran
AU - Chen, Shiao Shing
AU - He, Zhen
N1 - Publisher Copyright:
© 2022 Elsevier B.V.
PY - 2022/9/10
Y1 - 2022/9/10
N2 - Mathematical modeling can be helpful to understand and optimize osmotic membrane bioreactors (OMBR), a promising technology for sustainable wastewater treatment with simultaneous water recovery. Herein, seven machine learning (ML) algorithms were employed to model both water flux and salinity of a lab-scale OMBR. Through the optimum hyperparameters tuning and 5-fold cross-validation, the ML models have achieved more accurate results without obvious overfitting and bias. The median R2 scores of water flux modeling were all over the 0.95 and the most of median R2 scores from total dissolved solids (TDS) modeling were higher than 0.90. During model testing, random forest (RF) algorithm presented the highest R2 score of 0.987 with the lowest root mean square error (RMSE = 0.044) for the water flux modeling, and extreme gradient boosting (XGB) algorithm exhibited the best results (R2 = 0.97; RMSE = 0.234) in the TDS modeling. The Shapley Additive exPlanations (SHAP) analysis found that the phosphorus concentration was a critical input feature for both water flux and TDS modeling. Finally, the selected ML models were used to predict water flux and salinity affected by two input features and the predication results confirmed the importance of the phosphate concentration. The results of this study have demonstrated the promise of ML modeling for investigating OMBR systems.
AB - Mathematical modeling can be helpful to understand and optimize osmotic membrane bioreactors (OMBR), a promising technology for sustainable wastewater treatment with simultaneous water recovery. Herein, seven machine learning (ML) algorithms were employed to model both water flux and salinity of a lab-scale OMBR. Through the optimum hyperparameters tuning and 5-fold cross-validation, the ML models have achieved more accurate results without obvious overfitting and bias. The median R2 scores of water flux modeling were all over the 0.95 and the most of median R2 scores from total dissolved solids (TDS) modeling were higher than 0.90. During model testing, random forest (RF) algorithm presented the highest R2 score of 0.987 with the lowest root mean square error (RMSE = 0.044) for the water flux modeling, and extreme gradient boosting (XGB) algorithm exhibited the best results (R2 = 0.97; RMSE = 0.234) in the TDS modeling. The Shapley Additive exPlanations (SHAP) analysis found that the phosphorus concentration was a critical input feature for both water flux and TDS modeling. Finally, the selected ML models were used to predict water flux and salinity affected by two input features and the predication results confirmed the importance of the phosphate concentration. The results of this study have demonstrated the promise of ML modeling for investigating OMBR systems.
KW - Artificial neural network
KW - Machine learning
KW - Modeling
KW - Osmotic membrane bioreactor
KW - SHAP analysis
KW - Water and wastewater treatment
UR - https://www.scopus.com/pages/publications/85130570411
U2 - 10.1016/j.scitotenv.2022.156009
DO - 10.1016/j.scitotenv.2022.156009
M3 - Article
C2 - 35595138
AN - SCOPUS:85130570411
SN - 0048-9697
VL - 838
JO - Science of the Total Environment
JF - Science of the Total Environment
M1 - 156009
ER -