Cargando…

Can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? A case study on major arthropod relationships

BACKGROUND: Whenever different data sets arrive at conflicting phylogenetic hypotheses, only testable causal explanations of sources of errors in at least one of the data sets allow us to critically choose among the conflicting hypotheses of relationships. The large (28S) and small (18S) subunit rRN...

Descripción completa

Detalles Bibliográficos
Autores principales: von Reumont, Björn M, Meusemann, Karen, Szucsich, Nikolaus U, Dell'Ampio, Emiliano, Gowri-Shankar, Vivek, Bartel, Daniela, Simon, Sabrina, Letsch, Harald O, Stocsits, Roman R, Luan, Yun-xia, Wägele, Johann Wolfgang, Pass, Günther, Hadrys, Heike, Misof, Bernhard
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2695459/
https://www.ncbi.nlm.nih.gov/pubmed/19473484
http://dx.doi.org/10.1186/1471-2148-9-119
_version_ 1782168196728487936
author von Reumont, Björn M
Meusemann, Karen
Szucsich, Nikolaus U
Dell'Ampio, Emiliano
Gowri-Shankar, Vivek
Bartel, Daniela
Simon, Sabrina
Letsch, Harald O
Stocsits, Roman R
Luan, Yun-xia
Wägele, Johann Wolfgang
Pass, Günther
Hadrys, Heike
Misof, Bernhard
author_facet von Reumont, Björn M
Meusemann, Karen
Szucsich, Nikolaus U
Dell'Ampio, Emiliano
Gowri-Shankar, Vivek
Bartel, Daniela
Simon, Sabrina
Letsch, Harald O
Stocsits, Roman R
Luan, Yun-xia
Wägele, Johann Wolfgang
Pass, Günther
Hadrys, Heike
Misof, Bernhard
author_sort von Reumont, Björn M
collection PubMed
description BACKGROUND: Whenever different data sets arrive at conflicting phylogenetic hypotheses, only testable causal explanations of sources of errors in at least one of the data sets allow us to critically choose among the conflicting hypotheses of relationships. The large (28S) and small (18S) subunit rRNAs are among the most popular markers for studies of deep phylogenies. However, some nodes supported by this data are suspected of being artifacts caused by peculiarities of the evolution of these molecules. Arthropod phylogeny is an especially controversial subject dotted with conflicting hypotheses which are dependent on data set and method of reconstruction. We assume that phylogenetic analyses based on these genes can be improved further i) by enlarging the taxon sample and ii) employing more realistic models of sequence evolution incorporating non-stationary substitution processes and iii) considering covariation and pairing of sites in rRNA-genes. RESULTS: We analyzed a large set of arthropod sequences, applied new tools for quality control of data prior to tree reconstruction, and increased the biological realism of substitution models. Although the split-decomposition network indicated a high noise content in the data set, our measures were able to both improve the analyses and give causal explanations for some incongruities mentioned from analyses of rRNA sequences. However, misleading effects did not completely disappear. CONCLUSION: Analyses of data sets that result in ambiguous phylogenetic hypotheses demand for methods, which do not only filter stochastic noise, but likewise allow to differentiate phylogenetic signal from systematic biases. Such methods can only rely on our findings regarding the evolution of the analyzed data. Analyses on independent data sets then are crucial to test the plausibility of the results. Our approach can easily be extended to genomic data, as well, whereby layers of quality assessment are set up applicable to phylogenetic reconstructions in general.
format Text
id pubmed-2695459
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26954592009-06-12 Can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? A case study on major arthropod relationships von Reumont, Björn M Meusemann, Karen Szucsich, Nikolaus U Dell'Ampio, Emiliano Gowri-Shankar, Vivek Bartel, Daniela Simon, Sabrina Letsch, Harald O Stocsits, Roman R Luan, Yun-xia Wägele, Johann Wolfgang Pass, Günther Hadrys, Heike Misof, Bernhard BMC Evol Biol Research Article BACKGROUND: Whenever different data sets arrive at conflicting phylogenetic hypotheses, only testable causal explanations of sources of errors in at least one of the data sets allow us to critically choose among the conflicting hypotheses of relationships. The large (28S) and small (18S) subunit rRNAs are among the most popular markers for studies of deep phylogenies. However, some nodes supported by this data are suspected of being artifacts caused by peculiarities of the evolution of these molecules. Arthropod phylogeny is an especially controversial subject dotted with conflicting hypotheses which are dependent on data set and method of reconstruction. We assume that phylogenetic analyses based on these genes can be improved further i) by enlarging the taxon sample and ii) employing more realistic models of sequence evolution incorporating non-stationary substitution processes and iii) considering covariation and pairing of sites in rRNA-genes. RESULTS: We analyzed a large set of arthropod sequences, applied new tools for quality control of data prior to tree reconstruction, and increased the biological realism of substitution models. Although the split-decomposition network indicated a high noise content in the data set, our measures were able to both improve the analyses and give causal explanations for some incongruities mentioned from analyses of rRNA sequences. However, misleading effects did not completely disappear. CONCLUSION: Analyses of data sets that result in ambiguous phylogenetic hypotheses demand for methods, which do not only filter stochastic noise, but likewise allow to differentiate phylogenetic signal from systematic biases. Such methods can only rely on our findings regarding the evolution of the analyzed data. Analyses on independent data sets then are crucial to test the plausibility of the results. Our approach can easily be extended to genomic data, as well, whereby layers of quality assessment are set up applicable to phylogenetic reconstructions in general. BioMed Central 2009-05-27 /pmc/articles/PMC2695459/ /pubmed/19473484 http://dx.doi.org/10.1186/1471-2148-9-119 Text en Copyright © 2009 von Reumont et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
von Reumont, Björn M
Meusemann, Karen
Szucsich, Nikolaus U
Dell'Ampio, Emiliano
Gowri-Shankar, Vivek
Bartel, Daniela
Simon, Sabrina
Letsch, Harald O
Stocsits, Roman R
Luan, Yun-xia
Wägele, Johann Wolfgang
Pass, Günther
Hadrys, Heike
Misof, Bernhard
Can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? A case study on major arthropod relationships
title Can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? A case study on major arthropod relationships
title_full Can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? A case study on major arthropod relationships
title_fullStr Can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? A case study on major arthropod relationships
title_full_unstemmed Can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? A case study on major arthropod relationships
title_short Can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? A case study on major arthropod relationships
title_sort can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? a case study on major arthropod relationships
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2695459/
https://www.ncbi.nlm.nih.gov/pubmed/19473484
http://dx.doi.org/10.1186/1471-2148-9-119
work_keys_str_mv AT vonreumontbjornm cancomprehensivebackgroundknowledgebeincorporatedintosubstitutionmodelstoimprovephylogeneticanalysesacasestudyonmajorarthropodrelationships
AT meusemannkaren cancomprehensivebackgroundknowledgebeincorporatedintosubstitutionmodelstoimprovephylogeneticanalysesacasestudyonmajorarthropodrelationships
AT szucsichnikolausu cancomprehensivebackgroundknowledgebeincorporatedintosubstitutionmodelstoimprovephylogeneticanalysesacasestudyonmajorarthropodrelationships
AT dellampioemiliano cancomprehensivebackgroundknowledgebeincorporatedintosubstitutionmodelstoimprovephylogeneticanalysesacasestudyonmajorarthropodrelationships
AT gowrishankarvivek cancomprehensivebackgroundknowledgebeincorporatedintosubstitutionmodelstoimprovephylogeneticanalysesacasestudyonmajorarthropodrelationships
AT barteldaniela cancomprehensivebackgroundknowledgebeincorporatedintosubstitutionmodelstoimprovephylogeneticanalysesacasestudyonmajorarthropodrelationships
AT simonsabrina cancomprehensivebackgroundknowledgebeincorporatedintosubstitutionmodelstoimprovephylogeneticanalysesacasestudyonmajorarthropodrelationships
AT letschharaldo cancomprehensivebackgroundknowledgebeincorporatedintosubstitutionmodelstoimprovephylogeneticanalysesacasestudyonmajorarthropodrelationships
AT stocsitsromanr cancomprehensivebackgroundknowledgebeincorporatedintosubstitutionmodelstoimprovephylogeneticanalysesacasestudyonmajorarthropodrelationships
AT luanyunxia cancomprehensivebackgroundknowledgebeincorporatedintosubstitutionmodelstoimprovephylogeneticanalysesacasestudyonmajorarthropodrelationships
AT wagelejohannwolfgang cancomprehensivebackgroundknowledgebeincorporatedintosubstitutionmodelstoimprovephylogeneticanalysesacasestudyonmajorarthropodrelationships
AT passgunther cancomprehensivebackgroundknowledgebeincorporatedintosubstitutionmodelstoimprovephylogeneticanalysesacasestudyonmajorarthropodrelationships
AT hadrysheike cancomprehensivebackgroundknowledgebeincorporatedintosubstitutionmodelstoimprovephylogeneticanalysesacasestudyonmajorarthropodrelationships
AT misofbernhard cancomprehensivebackgroundknowledgebeincorporatedintosubstitutionmodelstoimprovephylogeneticanalysesacasestudyonmajorarthropodrelationships