Cargando…

Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning

Batrachospermaceae is the largest family of freshwater red algae, widely distributed around the world, and plays an important role in maintaining the balance of spring and creek ecosystems. The deterioration of the current global ecological environment has also destroyed the habitat of Batrachosperm...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Qiqin, Nan, Fangru, Liu, Xudong, Liu, Qi, Lv, Junping, Feng, Jia, Wang, Fei, Xie, Shulian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9781829/
https://www.ncbi.nlm.nih.gov/pubmed/36559598
http://dx.doi.org/10.3390/plants11243485
_version_ 1784857170090655744
author Yang, Qiqin
Nan, Fangru
Liu, Xudong
Liu, Qi
Lv, Junping
Feng, Jia
Wang, Fei
Xie, Shulian
author_facet Yang, Qiqin
Nan, Fangru
Liu, Xudong
Liu, Qi
Lv, Junping
Feng, Jia
Wang, Fei
Xie, Shulian
author_sort Yang, Qiqin
collection PubMed
description Batrachospermaceae is the largest family of freshwater red algae, widely distributed around the world, and plays an important role in maintaining the balance of spring and creek ecosystems. The deterioration of the current global ecological environment has also destroyed the habitat of Batrachospermaceae. The research on the environmental factors of Batrachospermaceae and the accurate classification of the genus is necessary for the protection, restoration, excavation, and utilization of Batrachospermaceae resources. In this paper, the database of geographical distribution and environmental factors of Batrachospermaceae was sorted out, and the relationship between the classification of genus and environmental factors in Batrachospermaceae was analyzed based on two machine learning methods, random forest and XGBoost. The result shows: (1) The models constructed by the two machine learning methods can effectively distinguish the genus of Batrachospermaceae based on environmental factors; (2) The overall AUC score of the random forest model for the classification and prediction of the genus of Batrachospermaceae reached 90.41%, and the overall AUC score of the taxonomic prediction of each genus of Batrachospermaceae reached 85.85%; (3) Combining the two methods, it is believed that the environmental factors that affect the distinction of the genus of Batrachospermaceae are mainly altitude, average relative humidity, average temperature, and minimum temperature, among which altitude has the greatest influence. The results can further clarify the taxonomy of the genus in Batrachospermaceae and enrich the research on the differences in environmental factors of Batrachospermaceae.
format Online
Article
Text
id pubmed-9781829
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-97818292022-12-24 Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning Yang, Qiqin Nan, Fangru Liu, Xudong Liu, Qi Lv, Junping Feng, Jia Wang, Fei Xie, Shulian Plants (Basel) Article Batrachospermaceae is the largest family of freshwater red algae, widely distributed around the world, and plays an important role in maintaining the balance of spring and creek ecosystems. The deterioration of the current global ecological environment has also destroyed the habitat of Batrachospermaceae. The research on the environmental factors of Batrachospermaceae and the accurate classification of the genus is necessary for the protection, restoration, excavation, and utilization of Batrachospermaceae resources. In this paper, the database of geographical distribution and environmental factors of Batrachospermaceae was sorted out, and the relationship between the classification of genus and environmental factors in Batrachospermaceae was analyzed based on two machine learning methods, random forest and XGBoost. The result shows: (1) The models constructed by the two machine learning methods can effectively distinguish the genus of Batrachospermaceae based on environmental factors; (2) The overall AUC score of the random forest model for the classification and prediction of the genus of Batrachospermaceae reached 90.41%, and the overall AUC score of the taxonomic prediction of each genus of Batrachospermaceae reached 85.85%; (3) Combining the two methods, it is believed that the environmental factors that affect the distinction of the genus of Batrachospermaceae are mainly altitude, average relative humidity, average temperature, and minimum temperature, among which altitude has the greatest influence. The results can further clarify the taxonomy of the genus in Batrachospermaceae and enrich the research on the differences in environmental factors of Batrachospermaceae. MDPI 2022-12-13 /pmc/articles/PMC9781829/ /pubmed/36559598 http://dx.doi.org/10.3390/plants11243485 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yang, Qiqin
Nan, Fangru
Liu, Xudong
Liu, Qi
Lv, Junping
Feng, Jia
Wang, Fei
Xie, Shulian
Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning
title Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning
title_full Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning
title_fullStr Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning
title_full_unstemmed Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning
title_short Association between the Classification of the Genus of Batrachospermaceae (Rhodophyta) and the Environmental Factors Based on Machine Learning
title_sort association between the classification of the genus of batrachospermaceae (rhodophyta) and the environmental factors based on machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9781829/
https://www.ncbi.nlm.nih.gov/pubmed/36559598
http://dx.doi.org/10.3390/plants11243485
work_keys_str_mv AT yangqiqin associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning
AT nanfangru associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning
AT liuxudong associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning
AT liuqi associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning
AT lvjunping associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning
AT fengjia associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning
AT wangfei associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning
AT xieshulian associationbetweentheclassificationofthegenusofbatrachospermaceaerhodophytaandtheenvironmentalfactorsbasedonmachinelearning