Cargando…

Prediction of Geosmin at Different Depths of Lake Using Machine Learning Techniques

Geosmin is a major concern in the management of water sources worldwide. Thus, we predicted concentration categories of geosmin at three different depths of lakes (i.e., surface, middle, and bottom), and analyzed relationships between geosmin concentration and factors such as phytoplankton abundance...

Descripción completa

Detalles Bibliográficos
Autores principales: Kwon, Yong-Su, Cho, In-Hwan, Kim, Ha-Kyung, Byun, Jeong-Hwan, Bae, Mi-Jung, Kim, Baik-Ho
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8508407/
https://www.ncbi.nlm.nih.gov/pubmed/34639603
http://dx.doi.org/10.3390/ijerph181910303
_version_ 1784582089984704512
author Kwon, Yong-Su
Cho, In-Hwan
Kim, Ha-Kyung
Byun, Jeong-Hwan
Bae, Mi-Jung
Kim, Baik-Ho
author_facet Kwon, Yong-Su
Cho, In-Hwan
Kim, Ha-Kyung
Byun, Jeong-Hwan
Bae, Mi-Jung
Kim, Baik-Ho
author_sort Kwon, Yong-Su
collection PubMed
description Geosmin is a major concern in the management of water sources worldwide. Thus, we predicted concentration categories of geosmin at three different depths of lakes (i.e., surface, middle, and bottom), and analyzed relationships between geosmin concentration and factors such as phytoplankton abundance and environmental variables. Data were collected monthly from three major lakes (Uiam, Cheongpyeong, and Paldang lakes) in Korea from May 2014 to December 2015. Before predicting geosmin concentration, we categorized it into four groups based on the boxplot method, and multivariate adaptive regression splines, classification and regression trees, and random forest (RF) were applied to identify the most appropriate modelling to predict geosmin concentration. Overall, using environmental variables was more accurate than using phytoplankton abundance to predict the four categories of geosmin concentration based on AUC and accuracy in all three models as well as each layer. The RF model had the highest predictive power among the three SDMs. When predicting geosmin in the three water layers, the relative importance of environmental variables and phytoplankton abundance in the sensitivity analysis was different for each layer. Water temperature and abundance of Cyanophyceae were the most important factors for predicting geosmin concentration categories in the surface layer, whereas total abundance of phytoplankton exhibited relatively higher importance in the bottom layer.
format Online
Article
Text
id pubmed-8508407
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-85084072021-10-13 Prediction of Geosmin at Different Depths of Lake Using Machine Learning Techniques Kwon, Yong-Su Cho, In-Hwan Kim, Ha-Kyung Byun, Jeong-Hwan Bae, Mi-Jung Kim, Baik-Ho Int J Environ Res Public Health Article Geosmin is a major concern in the management of water sources worldwide. Thus, we predicted concentration categories of geosmin at three different depths of lakes (i.e., surface, middle, and bottom), and analyzed relationships between geosmin concentration and factors such as phytoplankton abundance and environmental variables. Data were collected monthly from three major lakes (Uiam, Cheongpyeong, and Paldang lakes) in Korea from May 2014 to December 2015. Before predicting geosmin concentration, we categorized it into four groups based on the boxplot method, and multivariate adaptive regression splines, classification and regression trees, and random forest (RF) were applied to identify the most appropriate modelling to predict geosmin concentration. Overall, using environmental variables was more accurate than using phytoplankton abundance to predict the four categories of geosmin concentration based on AUC and accuracy in all three models as well as each layer. The RF model had the highest predictive power among the three SDMs. When predicting geosmin in the three water layers, the relative importance of environmental variables and phytoplankton abundance in the sensitivity analysis was different for each layer. Water temperature and abundance of Cyanophyceae were the most important factors for predicting geosmin concentration categories in the surface layer, whereas total abundance of phytoplankton exhibited relatively higher importance in the bottom layer. MDPI 2021-09-30 /pmc/articles/PMC8508407/ /pubmed/34639603 http://dx.doi.org/10.3390/ijerph181910303 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kwon, Yong-Su
Cho, In-Hwan
Kim, Ha-Kyung
Byun, Jeong-Hwan
Bae, Mi-Jung
Kim, Baik-Ho
Prediction of Geosmin at Different Depths of Lake Using Machine Learning Techniques
title Prediction of Geosmin at Different Depths of Lake Using Machine Learning Techniques
title_full Prediction of Geosmin at Different Depths of Lake Using Machine Learning Techniques
title_fullStr Prediction of Geosmin at Different Depths of Lake Using Machine Learning Techniques
title_full_unstemmed Prediction of Geosmin at Different Depths of Lake Using Machine Learning Techniques
title_short Prediction of Geosmin at Different Depths of Lake Using Machine Learning Techniques
title_sort prediction of geosmin at different depths of lake using machine learning techniques
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8508407/
https://www.ncbi.nlm.nih.gov/pubmed/34639603
http://dx.doi.org/10.3390/ijerph181910303
work_keys_str_mv AT kwonyongsu predictionofgeosminatdifferentdepthsoflakeusingmachinelearningtechniques
AT choinhwan predictionofgeosminatdifferentdepthsoflakeusingmachinelearningtechniques
AT kimhakyung predictionofgeosminatdifferentdepthsoflakeusingmachinelearningtechniques
AT byunjeonghwan predictionofgeosminatdifferentdepthsoflakeusingmachinelearningtechniques
AT baemijung predictionofgeosminatdifferentdepthsoflakeusingmachinelearningtechniques
AT kimbaikho predictionofgeosminatdifferentdepthsoflakeusingmachinelearningtechniques