Cargando…

Confirming the Phylogeny of Mammals by Use of Large Comparative Sequence Data Sets

The ongoing generation of prodigious amounts of genomic sequence data from myriad vertebrates is providing unparalleled opportunities for establishing definitive phylogenetic relationships among species. The size and complexities of such comparative sequence data sets not only allow smaller and more...

Descripción completa

Detalles Bibliográficos
Autores principales: Prasad, Arjun B., Allard, Marc W., Green, Eric D.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2515873/
https://www.ncbi.nlm.nih.gov/pubmed/18453548
http://dx.doi.org/10.1093/molbev/msn104
_version_ 1782158446362099712
author Prasad, Arjun B.
Allard, Marc W.
Green, Eric D.
author_facet Prasad, Arjun B.
Allard, Marc W.
Green, Eric D.
author_sort Prasad, Arjun B.
collection PubMed
description The ongoing generation of prodigious amounts of genomic sequence data from myriad vertebrates is providing unparalleled opportunities for establishing definitive phylogenetic relationships among species. The size and complexities of such comparative sequence data sets not only allow smaller and more difficult branches to be resolved but also present unique challenges, including large computational requirements and the negative consequences of systematic biases. To explore these issues and to clarify the phylogenetic relationships among mammals, we have analyzed a large data set of over 60 megabase pairs (Mb) of high-quality genomic sequence, which we generated from 41 mammals and 3 other vertebrates. All sequences are orthologous to a 1.9-Mb region of the human genome that encompasses the cystic fibrosis transmembrane conductance regulator gene (CFTR). To understand the characteristics and challenges associated with phylogenetic analyses of such a large data set, we partitioned the sequence data in several ways and utilized maximum likelihood, maximum parsimony, and Neighbor-Joining algorithms, implemented in parallel on Linux clusters. These studies yielded well-supported phylogenetic trees, largely confirming other recent molecular phylogenetic analyses. Our results provide support for rooting the placental mammal tree between Atlantogenata (Xenarthra and Afrotheria) and Boreoeutheria (Euarchontoglires and Laurasiatheria), illustrate the difficulty in resolving some branches even with large amounts of data (e.g., in the case of Laurasiatheria), and demonstrate the valuable role that very large comparative sequence data sets can play in refining our understanding of the evolutionary relationships of vertebrates.
format Text
id pubmed-2515873
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-25158732009-02-25 Confirming the Phylogeny of Mammals by Use of Large Comparative Sequence Data Sets Prasad, Arjun B. Allard, Marc W. Green, Eric D. Mol Biol Evol Research Articles The ongoing generation of prodigious amounts of genomic sequence data from myriad vertebrates is providing unparalleled opportunities for establishing definitive phylogenetic relationships among species. The size and complexities of such comparative sequence data sets not only allow smaller and more difficult branches to be resolved but also present unique challenges, including large computational requirements and the negative consequences of systematic biases. To explore these issues and to clarify the phylogenetic relationships among mammals, we have analyzed a large data set of over 60 megabase pairs (Mb) of high-quality genomic sequence, which we generated from 41 mammals and 3 other vertebrates. All sequences are orthologous to a 1.9-Mb region of the human genome that encompasses the cystic fibrosis transmembrane conductance regulator gene (CFTR). To understand the characteristics and challenges associated with phylogenetic analyses of such a large data set, we partitioned the sequence data in several ways and utilized maximum likelihood, maximum parsimony, and Neighbor-Joining algorithms, implemented in parallel on Linux clusters. These studies yielded well-supported phylogenetic trees, largely confirming other recent molecular phylogenetic analyses. Our results provide support for rooting the placental mammal tree between Atlantogenata (Xenarthra and Afrotheria) and Boreoeutheria (Euarchontoglires and Laurasiatheria), illustrate the difficulty in resolving some branches even with large amounts of data (e.g., in the case of Laurasiatheria), and demonstrate the valuable role that very large comparative sequence data sets can play in refining our understanding of the evolutionary relationships of vertebrates. Oxford University Press 2008-09 2008-05-02 /pmc/articles/PMC2515873/ /pubmed/18453548 http://dx.doi.org/10.1093/molbev/msn104 Text en Published by Oxford University Press 2008. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Prasad, Arjun B.
Allard, Marc W.
Green, Eric D.
Confirming the Phylogeny of Mammals by Use of Large Comparative Sequence Data Sets
title Confirming the Phylogeny of Mammals by Use of Large Comparative Sequence Data Sets
title_full Confirming the Phylogeny of Mammals by Use of Large Comparative Sequence Data Sets
title_fullStr Confirming the Phylogeny of Mammals by Use of Large Comparative Sequence Data Sets
title_full_unstemmed Confirming the Phylogeny of Mammals by Use of Large Comparative Sequence Data Sets
title_short Confirming the Phylogeny of Mammals by Use of Large Comparative Sequence Data Sets
title_sort confirming the phylogeny of mammals by use of large comparative sequence data sets
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2515873/
https://www.ncbi.nlm.nih.gov/pubmed/18453548
http://dx.doi.org/10.1093/molbev/msn104
work_keys_str_mv AT prasadarjunb confirmingthephylogenyofmammalsbyuseoflargecomparativesequencedatasets
AT allardmarcw confirmingthephylogenyofmammalsbyuseoflargecomparativesequencedatasets
AT greenericd confirmingthephylogenyofmammalsbyuseoflargecomparativesequencedatasets
AT confirmingthephylogenyofmammalsbyuseoflargecomparativesequencedatasets