Eine Plattform für die Wissenschaft: Bauingenieurwesen, Architektur und Urbanistik
Source identification and prediction of nitrogen and phosphorus pollution of Lake Taihu by an ensemble machine learning technique
Effective control of lake eutrophication necessitates a full understanding of the complicated nitrogen and phosphorus pollution sources, for which mathematical modeling is commonly adopted. In contrast to the conventional knowledge-based models that usually perform poorly due to insufficient knowledge of pollutant geochemical cycling, we employed an ensemble machine learning (ML) model to identify the key nitrogen and phosphorus sources of lakes. Six ML models were developed based on 13 years of historical data of Lake Taihu’s water quality, environmental input, and meteorological conditions, among which the XGBoost model stood out as the best model for total nitrogen (TN) and total phosphorus (TP) prediction. The results suggest that the lake TN is mainly affected by the endogenous load and inflow river water quality, while the lake TP is predominantly from endogenous sources. The prediction of the lake TN and TP concentration changes in response to these key feature variations suggests that endogenous source control is a highly desirable option for lake eutrophication control. Finally, one-month-ahead prediction of lake TN and TP concentrations (R2 of 0.85 and 0.95, respectively) was achieved based on this model with sliding time window lengths of 9 and 6 months, respectively. Our work demonstrates the great potential of using ensemble ML models for lake pollution source tracking and prediction, which may provide valuable references for early warning and rational control of lake eutrophication.
Source identification and prediction of nitrogen and phosphorus pollution of Lake Taihu by an ensemble machine learning technique
Effective control of lake eutrophication necessitates a full understanding of the complicated nitrogen and phosphorus pollution sources, for which mathematical modeling is commonly adopted. In contrast to the conventional knowledge-based models that usually perform poorly due to insufficient knowledge of pollutant geochemical cycling, we employed an ensemble machine learning (ML) model to identify the key nitrogen and phosphorus sources of lakes. Six ML models were developed based on 13 years of historical data of Lake Taihu’s water quality, environmental input, and meteorological conditions, among which the XGBoost model stood out as the best model for total nitrogen (TN) and total phosphorus (TP) prediction. The results suggest that the lake TN is mainly affected by the endogenous load and inflow river water quality, while the lake TP is predominantly from endogenous sources. The prediction of the lake TN and TP concentration changes in response to these key feature variations suggests that endogenous source control is a highly desirable option for lake eutrophication control. Finally, one-month-ahead prediction of lake TN and TP concentrations (R2 of 0.85 and 0.95, respectively) was achieved based on this model with sliding time window lengths of 9 and 6 months, respectively. Our work demonstrates the great potential of using ensemble ML models for lake pollution source tracking and prediction, which may provide valuable references for early warning and rational control of lake eutrophication.
Source identification and prediction of nitrogen and phosphorus pollution of Lake Taihu by an ensemble machine learning technique
Front. Environ. Sci. Eng.
Hu, Yirong (Autor:in) / Du, Wenjie (Autor:in) / Yang, Cheng (Autor:in) / Wang, Yang (Autor:in) / Huang, Tianyin (Autor:in) / Xu, Xiaoyi (Autor:in) / Li, Wenwei (Autor:in)
01.05.2023
Aufsatz (Zeitschrift)
Elektronische Ressource
Englisch
Spatio-Temporal Patterns and Source Identification of Water Pollution in Lake Taihu (China)
DOAJ | 2016
|British Library Online Contents | 2012
|Phosphorus fractionation and bio-availability in Taihu Lake (China) sediments
Online Contents | 2005
|