Cargando…
Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice
Comparative genomics has revealed the ubiquity of gene and genome duplication and subsequent gene loss. In the case of gene duplication and subsequent loss, gene trees can differ from species trees, thus frequent gene duplication poses a challenge for reconstruction of species relationships. Here I...
Autor principal: | |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2640465/ https://www.ncbi.nlm.nih.gov/pubmed/19234600 http://dx.doi.org/10.1371/journal.pone.0004568 |
_version_ | 1782164579918282752 |
---|---|
author | Roy, Scott William |
author_facet | Roy, Scott William |
author_sort | Roy, Scott William |
collection | PubMed |
description | Comparative genomics has revealed the ubiquity of gene and genome duplication and subsequent gene loss. In the case of gene duplication and subsequent loss, gene trees can differ from species trees, thus frequent gene duplication poses a challenge for reconstruction of species relationships. Here I address the case of multi-gene sets of putative orthologs that include some unrecognized paralogs due to ancestral gene duplication, and ask how outgroups should best be chosen to reduce the degree of non-species tree (NST) signal. Consideration of expected internal branch lengths supports several conclusions: (i) when a single outgroup is used, the degree of NST signal arising from gene duplication is either independent of outgroup choice, or is minimized by use of a maximally closely related post-duplication (MCRPD) outgroup; (ii) when two outgroups are used, NST signal is minimized by using one MCRPD outgroup, while the position of the second outgroup is of lesser importance; and (iii) when two outgroups are used, the ability to detect gene trees that are inconsistent with known aspects of the species tree is maximized by use of one MCRPD, and is either independent of the position of the second outgroup, or is maximized for a more distantly related second outgroup. Overall, these results generalize the utility of closely-related outgroups for phylogenetic analysis. |
format | Text |
id | pubmed-2640465 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-26404652009-02-23 Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice Roy, Scott William PLoS One Research Article Comparative genomics has revealed the ubiquity of gene and genome duplication and subsequent gene loss. In the case of gene duplication and subsequent loss, gene trees can differ from species trees, thus frequent gene duplication poses a challenge for reconstruction of species relationships. Here I address the case of multi-gene sets of putative orthologs that include some unrecognized paralogs due to ancestral gene duplication, and ask how outgroups should best be chosen to reduce the degree of non-species tree (NST) signal. Consideration of expected internal branch lengths supports several conclusions: (i) when a single outgroup is used, the degree of NST signal arising from gene duplication is either independent of outgroup choice, or is minimized by use of a maximally closely related post-duplication (MCRPD) outgroup; (ii) when two outgroups are used, NST signal is minimized by using one MCRPD outgroup, while the position of the second outgroup is of lesser importance; and (iii) when two outgroups are used, the ability to detect gene trees that are inconsistent with known aspects of the species tree is maximized by use of one MCRPD, and is either independent of the position of the second outgroup, or is maximized for a more distantly related second outgroup. Overall, these results generalize the utility of closely-related outgroups for phylogenetic analysis. Public Library of Science 2009-02-23 /pmc/articles/PMC2640465/ /pubmed/19234600 http://dx.doi.org/10.1371/journal.pone.0004568 Text en This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. |
spellingShingle | Research Article Roy, Scott William Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice |
title | Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice |
title_full | Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice |
title_fullStr | Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice |
title_full_unstemmed | Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice |
title_short | Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice |
title_sort | phylogenomics: gene duplication, unrecognized paralogy and outgroup choice |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2640465/ https://www.ncbi.nlm.nih.gov/pubmed/19234600 http://dx.doi.org/10.1371/journal.pone.0004568 |
work_keys_str_mv | AT royscottwilliam phylogenomicsgeneduplicationunrecognizedparalogyandoutgroupchoice |