Cargando…
Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning
Batrachospermaceae is the largest family of freshwater red algae, widely distributed around the world, and plays an important role in maintaining the balance of spring and creek ecosystems. The deterioration of the current global ecological environment has also destroyed the habitat of Batrachosperm...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9781829/ https://www.ncbi.nlm.nih.gov/pubmed/36559598 http://dx.doi.org/10.3390/plants11243485 |
_version_ | 1784857170090655744 |
---|---|
author | Yang, Qiqin Nan, Fangru Liu, Xudong Liu, Qi Lv, Junping Feng, Jia Wang, Fei Xie, Shulian |
author_facet | Yang, Qiqin Nan, Fangru Liu, Xudong Liu, Qi Lv, Junping Feng, Jia Wang, Fei Xie, Shulian |
author_sort | Yang, Qiqin |
collection | PubMed |
description | Batrachospermaceae is the largest family of freshwater red algae, widely distributed around the world, and plays an important role in maintaining the balance of spring and creek ecosystems. The deterioration of the current global ecological environment has also destroyed the habitat of Batrachospermaceae. The research on the environmental factors of Batrachospermaceae and the accurate classification of the genus is necessary for the protection, restoration, excavation, and utilization of Batrachospermaceae resources. In this paper, the database of geographical distribution and environmental factors of Batrachospermaceae was sorted out, and the relationship between the classification of genus and environmental factors in Batrachospermaceae was analyzed based on two machine learning methods, random forest and XGBoost. The result shows: (1) The models constructed by the two machine learning methods can effectively distinguish the genus of Batrachospermaceae based on environmental factors; (2) The overall AUC score of the random forest model for the classification and prediction of the genus of Batrachospermaceae reached 90.41%, and the overall AUC score of the taxonomic prediction of each genus of Batrachospermaceae reached 85.85%; (3) Combining the two methods, it is believed that the environmental factors that affect the distinction of the genus of Batrachospermaceae are mainly altitude, average relative humidity, average temperature, and minimum temperature, among which altitude has the greatest influence. The results can further clarify the taxonomy of the genus in Batrachospermaceae and enrich the research on the differences in environmental factors of Batrachospermaceae. |
format | Online Article Text |
id | pubmed-9781829 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-97818292022-12-24 Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning Yang, Qiqin Nan, Fangru Liu, Xudong Liu, Qi Lv, Junping Feng, Jia Wang, Fei Xie, Shulian Plants (Basel) Article Batrachospermaceae is the largest family of freshwater red algae, widely distributed around the world, and plays an important role in maintaining the balance of spring and creek ecosystems. The deterioration of the current global ecological environment has also destroyed the habitat of Batrachospermaceae. The research on the environmental factors of Batrachospermaceae and the accurate classification of the genus is necessary for the protection, restoration, excavation, and utilization of Batrachospermaceae resources. In this paper, the database of geographical distribution and environmental factors of Batrachospermaceae was sorted out, and the relationship between the classification of genus and environmental factors in Batrachospermaceae was analyzed based on two machine learning methods, random forest and XGBoost. The result shows: (1) The models constructed by the two machine learning methods can effectively distinguish the genus of Batrachospermaceae based on environmental factors; (2) The overall AUC score of the random forest model for the classification and prediction of the genus of Batrachospermaceae reached 90.41%, and the overall AUC score of the taxonomic prediction of each genus of Batrachospermaceae reached 85.85%; (3) Combining the two methods, it is believed that the environmental factors that affect the distinction of the genus of Batrachospermaceae are mainly altitude, average relative humidity, average temperature, and minimum temperature, among which altitude has the greatest influence. The results can further clarify the taxonomy of the genus in Batrachospermaceae and enrich the research on the differences in environmental factors of Batrachospermaceae. MDPI 2022-12-13 /pmc/articles/PMC9781829/ /pubmed/36559598 http://dx.doi.org/10.3390/plants11243485 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Yang, Qiqin Nan, Fangru Liu, Xudong Liu, Qi Lv, Junping Feng, Jia Wang, Fei Xie, Shulian Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning |
title | Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning |
title_full | Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning |
title_fullStr | Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning |
title_full_unstemmed | Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning |
title_short | Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning |
title_sort | association between the classification of the genus of batrachospermaceae (rhodophyta) and the environmental factors based on machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9781829/ https://www.ncbi.nlm.nih.gov/pubmed/36559598 http://dx.doi.org/10.3390/plants11243485 |
work_keys_str_mv | AT yangqiqin associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning AT nanfangru associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning AT liuxudong associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning AT liuqi associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning AT lvjunping associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning AT fengjia associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning AT wangfei associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning AT xieshulian associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning |