Cargando…

Machine learning accurately predicts the multivariate performance phenotype from morphology in lizards

Completing the genotype-to-phenotype map requires rigorous measurement of the entire multivariate organismal phenotype. However, phenotyping on a large scale is not feasible for many kinds of traits, resulting in missing data that can also cause problems for comparative analyses and the assessment o...

Descripción completa

Detalles Bibliográficos
Autores principales: Lailvaux, Simon P., Mishra, Avdesh, Pun, Pooja, Ul Kabir, Md Wasi, Wilson, Robbie S., Herrel, Anthony, Hoque, Md Tamjidul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8782310/
https://www.ncbi.nlm.nih.gov/pubmed/35061733
http://dx.doi.org/10.1371/journal.pone.0261613
_version_ 1784638284491653120
author Lailvaux, Simon P.
Mishra, Avdesh
Pun, Pooja
Ul Kabir, Md Wasi
Wilson, Robbie S.
Herrel, Anthony
Hoque, Md Tamjidul
author_facet Lailvaux, Simon P.
Mishra, Avdesh
Pun, Pooja
Ul Kabir, Md Wasi
Wilson, Robbie S.
Herrel, Anthony
Hoque, Md Tamjidul
author_sort Lailvaux, Simon P.
collection PubMed
description Completing the genotype-to-phenotype map requires rigorous measurement of the entire multivariate organismal phenotype. However, phenotyping on a large scale is not feasible for many kinds of traits, resulting in missing data that can also cause problems for comparative analyses and the assessment of evolutionary trends across species. Measuring the multivariate performance phenotype is especially logistically challenging, and our ability to predict several performance traits from a given morphology is consequently poor. We developed a machine learning model to accurately estimate multivariate performance data from morphology alone by training it on a dataset containing performance and morphology data from 68 lizard species. Our final, stacked model predicts missing performance data accurately at the level of the individual from simple morphological measures. This model performed exceptionally well, even for performance traits that were missing values for >90% of the sampled individuals. Furthermore, incorporating phylogeny did not improve model fit, indicating that the phenotypic data alone preserved sufficient information to predict the performance based on morphological information. This approach can both significantly increase our understanding of performance evolution and act as a bridge to incorporate performance into future work on phenomics.
format Online
Article
Text
id pubmed-8782310
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-87823102022-01-22 Machine learning accurately predicts the multivariate performance phenotype from morphology in lizards Lailvaux, Simon P. Mishra, Avdesh Pun, Pooja Ul Kabir, Md Wasi Wilson, Robbie S. Herrel, Anthony Hoque, Md Tamjidul PLoS One Research Article Completing the genotype-to-phenotype map requires rigorous measurement of the entire multivariate organismal phenotype. However, phenotyping on a large scale is not feasible for many kinds of traits, resulting in missing data that can also cause problems for comparative analyses and the assessment of evolutionary trends across species. Measuring the multivariate performance phenotype is especially logistically challenging, and our ability to predict several performance traits from a given morphology is consequently poor. We developed a machine learning model to accurately estimate multivariate performance data from morphology alone by training it on a dataset containing performance and morphology data from 68 lizard species. Our final, stacked model predicts missing performance data accurately at the level of the individual from simple morphological measures. This model performed exceptionally well, even for performance traits that were missing values for >90% of the sampled individuals. Furthermore, incorporating phylogeny did not improve model fit, indicating that the phenotypic data alone preserved sufficient information to predict the performance based on morphological information. This approach can both significantly increase our understanding of performance evolution and act as a bridge to incorporate performance into future work on phenomics. Public Library of Science 2022-01-21 /pmc/articles/PMC8782310/ /pubmed/35061733 http://dx.doi.org/10.1371/journal.pone.0261613 Text en © 2022 Lailvaux et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lailvaux, Simon P.
Mishra, Avdesh
Pun, Pooja
Ul Kabir, Md Wasi
Wilson, Robbie S.
Herrel, Anthony
Hoque, Md Tamjidul
Machine learning accurately predicts the multivariate performance phenotype from morphology in lizards
title Machine learning accurately predicts the multivariate performance phenotype from morphology in lizards
title_full Machine learning accurately predicts the multivariate performance phenotype from morphology in lizards
title_fullStr Machine learning accurately predicts the multivariate performance phenotype from morphology in lizards
title_full_unstemmed Machine learning accurately predicts the multivariate performance phenotype from morphology in lizards
title_short Machine learning accurately predicts the multivariate performance phenotype from morphology in lizards
title_sort machine learning accurately predicts the multivariate performance phenotype from morphology in lizards
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8782310/
https://www.ncbi.nlm.nih.gov/pubmed/35061733
http://dx.doi.org/10.1371/journal.pone.0261613
work_keys_str_mv AT lailvauxsimonp machinelearningaccuratelypredictsthemultivariateperformancephenotypefrommorphologyinlizards
AT mishraavdesh machinelearningaccuratelypredictsthemultivariateperformancephenotypefrommorphologyinlizards
AT punpooja machinelearningaccuratelypredictsthemultivariateperformancephenotypefrommorphologyinlizards
AT ulkabirmdwasi machinelearningaccuratelypredictsthemultivariateperformancephenotypefrommorphologyinlizards
AT wilsonrobbies machinelearningaccuratelypredictsthemultivariateperformancephenotypefrommorphologyinlizards
AT herrelanthony machinelearningaccuratelypredictsthemultivariateperformancephenotypefrommorphologyinlizards
AT hoquemdtamjidul machinelearningaccuratelypredictsthemultivariateperformancephenotypefrommorphologyinlizards