Cargando…

Machine learning approaches identify male body size as the most accurate predictor of species richness 

BACKGROUND: A major challenge in biodiversity science is to understand the factors contributing to the variability of species richness –the number of different species in a community or region - among comparable taxonomic lineages. Multiple biotic and abiotic factors have been hypothesized to have a...

Descripción completa

Detalles Bibliográficos
Autores principales: Čandek, Klemen, Pristovšek Čandek, Urška, Kuntner, Matjaž
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7453550/
https://www.ncbi.nlm.nih.gov/pubmed/32854698
http://dx.doi.org/10.1186/s12915-020-00835-y
_version_ 1783575374291009536
author Čandek, Klemen
Pristovšek Čandek, Urška
Kuntner, Matjaž
author_facet Čandek, Klemen
Pristovšek Čandek, Urška
Kuntner, Matjaž
author_sort Čandek, Klemen
collection PubMed
description BACKGROUND: A major challenge in biodiversity science is to understand the factors contributing to the variability of species richness –the number of different species in a community or region - among comparable taxonomic lineages. Multiple biotic and abiotic factors have been hypothesized to have an effect on species richness and have been used as its predictors, but identifying accurate predictors is not straightforward. Spiders are a highly diverse group, with some 48,000 species in 120 families; yet nearly 75% of all species are found within just the ten most speciose families. Here we use a Random Forest machine learning algorithm to test the predictive power of different variables hypothesized to affect species richness of spider genera. RESULTS: We test the predictive power of 22 variables from spiders’ morphological, genetic, geographic, ecological and behavioral landscapes on species richness of 45 genera selected to represent the phylogenetic and biological breath of Araneae. Among the variables, Random Forest analyses find body size (specifically, minimum male body size) to best predict species richness. Multiple Correspondence analysis confirms this outcome through a negative relationship between male body size and species richness. Multiple Correspondence analyses furthermore establish that geographic distribution of congeneric species is positively associated with genus diversity, and that genera from phylogenetically older lineages are species poorer. Of the spider-specific traits, neither the presence of ballooning behavior, nor sexual size dimorphism, can predict species richness. CONCLUSIONS: We show that machine learning analyses can be used in deciphering the factors associated with diversity patterns. Since no spider-specific biology could predict species richness, but the biologically universal body size did, we believe these conclusions are worthy of broader biological testing. Future work on other groups of organisms will establish whether the detected associations of species richness with small body size and wide geographic ranges hold more broadly.
format Online
Article
Text
id pubmed-7453550
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-74535502020-08-28 Machine learning approaches identify male body size as the most accurate predictor of species richness  Čandek, Klemen Pristovšek Čandek, Urška Kuntner, Matjaž BMC Biol Research Article BACKGROUND: A major challenge in biodiversity science is to understand the factors contributing to the variability of species richness –the number of different species in a community or region - among comparable taxonomic lineages. Multiple biotic and abiotic factors have been hypothesized to have an effect on species richness and have been used as its predictors, but identifying accurate predictors is not straightforward. Spiders are a highly diverse group, with some 48,000 species in 120 families; yet nearly 75% of all species are found within just the ten most speciose families. Here we use a Random Forest machine learning algorithm to test the predictive power of different variables hypothesized to affect species richness of spider genera. RESULTS: We test the predictive power of 22 variables from spiders’ morphological, genetic, geographic, ecological and behavioral landscapes on species richness of 45 genera selected to represent the phylogenetic and biological breath of Araneae. Among the variables, Random Forest analyses find body size (specifically, minimum male body size) to best predict species richness. Multiple Correspondence analysis confirms this outcome through a negative relationship between male body size and species richness. Multiple Correspondence analyses furthermore establish that geographic distribution of congeneric species is positively associated with genus diversity, and that genera from phylogenetically older lineages are species poorer. Of the spider-specific traits, neither the presence of ballooning behavior, nor sexual size dimorphism, can predict species richness. CONCLUSIONS: We show that machine learning analyses can be used in deciphering the factors associated with diversity patterns. Since no spider-specific biology could predict species richness, but the biologically universal body size did, we believe these conclusions are worthy of broader biological testing. Future work on other groups of organisms will establish whether the detected associations of species richness with small body size and wide geographic ranges hold more broadly. BioMed Central 2020-08-28 /pmc/articles/PMC7453550/ /pubmed/32854698 http://dx.doi.org/10.1186/s12915-020-00835-y Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Čandek, Klemen
Pristovšek Čandek, Urška
Kuntner, Matjaž
Machine learning approaches identify male body size as the most accurate predictor of species richness 
title Machine learning approaches identify male body size as the most accurate predictor of species richness 
title_full Machine learning approaches identify male body size as the most accurate predictor of species richness 
title_fullStr Machine learning approaches identify male body size as the most accurate predictor of species richness 
title_full_unstemmed Machine learning approaches identify male body size as the most accurate predictor of species richness 
title_short Machine learning approaches identify male body size as the most accurate predictor of species richness 
title_sort machine learning approaches identify male body size as the most accurate predictor of species richness 
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7453550/
https://www.ncbi.nlm.nih.gov/pubmed/32854698
http://dx.doi.org/10.1186/s12915-020-00835-y
work_keys_str_mv AT candekklemen machinelearningapproachesidentifymalebodysizeasthemostaccuratepredictorofspeciesrichness
AT pristovsekcandekurska machinelearningapproachesidentifymalebodysizeasthemostaccuratepredictorofspeciesrichness
AT kuntnermatjaz machinelearningapproachesidentifymalebodysizeasthemostaccuratepredictorofspeciesrichness