Cargando…

Realistic scenarios of missing taxa in phylogenetic comparative methods and their effects on model selection and parameter estimation

Model-based analyses of continuous trait evolution enable rich evolutionary insight. These analyses require a phylogenetic tree and a vector of trait values for the tree’s terminal taxa, but rarely do a tree and dataset include all taxa within a clade. Because the probability that a taxon is include...

Descripción completa

Detalles Bibliográficos
Autor principal: Marcondes, Rafael S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6791351/
https://www.ncbi.nlm.nih.gov/pubmed/31616606
http://dx.doi.org/10.7717/peerj.7917
_version_ 1783458967831183360
author Marcondes, Rafael S.
author_facet Marcondes, Rafael S.
author_sort Marcondes, Rafael S.
collection PubMed
description Model-based analyses of continuous trait evolution enable rich evolutionary insight. These analyses require a phylogenetic tree and a vector of trait values for the tree’s terminal taxa, but rarely do a tree and dataset include all taxa within a clade. Because the probability that a taxon is included in a dataset depends on ecological traits that have phylogenetic signal, missing taxa in real datasets should be expected to be phylogenetically clumped or correlated to the modelled trait. I examined whether those types of missing taxa represent a problem for model selection and parameter estimation. I simulated univariate traits under a suite of Brownian Motion and Ornstein-Uhlenbeck models, and assessed the performance of model selection and parameter estimation under absent, random, clumped or correlated missing taxa. I found that those analyses perform well under almost all scenarios, including situations with very sparsely sampled phylogenies. The only notable biases I detected were in parameter estimation under a very high percentage (90%) of correlated missing taxa. My results offer a degree of reassurance for studies of continuous trait evolution with missing taxa, but the problem of missing taxa in phylogenetic comparative methods still demands much further investigation. The framework I have described here might provide a starting point for future work.
format Online
Article
Text
id pubmed-6791351
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-67913512019-10-15 Realistic scenarios of missing taxa in phylogenetic comparative methods and their effects on model selection and parameter estimation Marcondes, Rafael S. PeerJ Biodiversity Model-based analyses of continuous trait evolution enable rich evolutionary insight. These analyses require a phylogenetic tree and a vector of trait values for the tree’s terminal taxa, but rarely do a tree and dataset include all taxa within a clade. Because the probability that a taxon is included in a dataset depends on ecological traits that have phylogenetic signal, missing taxa in real datasets should be expected to be phylogenetically clumped or correlated to the modelled trait. I examined whether those types of missing taxa represent a problem for model selection and parameter estimation. I simulated univariate traits under a suite of Brownian Motion and Ornstein-Uhlenbeck models, and assessed the performance of model selection and parameter estimation under absent, random, clumped or correlated missing taxa. I found that those analyses perform well under almost all scenarios, including situations with very sparsely sampled phylogenies. The only notable biases I detected were in parameter estimation under a very high percentage (90%) of correlated missing taxa. My results offer a degree of reassurance for studies of continuous trait evolution with missing taxa, but the problem of missing taxa in phylogenetic comparative methods still demands much further investigation. The framework I have described here might provide a starting point for future work. PeerJ Inc. 2019-10-11 /pmc/articles/PMC6791351/ /pubmed/31616606 http://dx.doi.org/10.7717/peerj.7917 Text en ©2019 Marcondes https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Biodiversity
Marcondes, Rafael S.
Realistic scenarios of missing taxa in phylogenetic comparative methods and their effects on model selection and parameter estimation
title Realistic scenarios of missing taxa in phylogenetic comparative methods and their effects on model selection and parameter estimation
title_full Realistic scenarios of missing taxa in phylogenetic comparative methods and their effects on model selection and parameter estimation
title_fullStr Realistic scenarios of missing taxa in phylogenetic comparative methods and their effects on model selection and parameter estimation
title_full_unstemmed Realistic scenarios of missing taxa in phylogenetic comparative methods and their effects on model selection and parameter estimation
title_short Realistic scenarios of missing taxa in phylogenetic comparative methods and their effects on model selection and parameter estimation
title_sort realistic scenarios of missing taxa in phylogenetic comparative methods and their effects on model selection and parameter estimation
topic Biodiversity
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6791351/
https://www.ncbi.nlm.nih.gov/pubmed/31616606
http://dx.doi.org/10.7717/peerj.7917
work_keys_str_mv AT marcondesrafaels realisticscenariosofmissingtaxainphylogeneticcomparativemethodsandtheireffectsonmodelselectionandparameterestimation