Cargando…

Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice

Comparative genomics has revealed the ubiquity of gene and genome duplication and subsequent gene loss. In the case of gene duplication and subsequent loss, gene trees can differ from species trees, thus frequent gene duplication poses a challenge for reconstruction of species relationships. Here I...

Descripción completa

Detalles Bibliográficos
Autor principal: Roy, Scott William
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2640465/
https://www.ncbi.nlm.nih.gov/pubmed/19234600
http://dx.doi.org/10.1371/journal.pone.0004568
_version_ 1782164579918282752
author Roy, Scott William
author_facet Roy, Scott William
author_sort Roy, Scott William
collection PubMed
description Comparative genomics has revealed the ubiquity of gene and genome duplication and subsequent gene loss. In the case of gene duplication and subsequent loss, gene trees can differ from species trees, thus frequent gene duplication poses a challenge for reconstruction of species relationships. Here I address the case of multi-gene sets of putative orthologs that include some unrecognized paralogs due to ancestral gene duplication, and ask how outgroups should best be chosen to reduce the degree of non-species tree (NST) signal. Consideration of expected internal branch lengths supports several conclusions: (i) when a single outgroup is used, the degree of NST signal arising from gene duplication is either independent of outgroup choice, or is minimized by use of a maximally closely related post-duplication (MCRPD) outgroup; (ii) when two outgroups are used, NST signal is minimized by using one MCRPD outgroup, while the position of the second outgroup is of lesser importance; and (iii) when two outgroups are used, the ability to detect gene trees that are inconsistent with known aspects of the species tree is maximized by use of one MCRPD, and is either independent of the position of the second outgroup, or is maximized for a more distantly related second outgroup. Overall, these results generalize the utility of closely-related outgroups for phylogenetic analysis.
format Text
id pubmed-2640465
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-26404652009-02-23 Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice Roy, Scott William PLoS One Research Article Comparative genomics has revealed the ubiquity of gene and genome duplication and subsequent gene loss. In the case of gene duplication and subsequent loss, gene trees can differ from species trees, thus frequent gene duplication poses a challenge for reconstruction of species relationships. Here I address the case of multi-gene sets of putative orthologs that include some unrecognized paralogs due to ancestral gene duplication, and ask how outgroups should best be chosen to reduce the degree of non-species tree (NST) signal. Consideration of expected internal branch lengths supports several conclusions: (i) when a single outgroup is used, the degree of NST signal arising from gene duplication is either independent of outgroup choice, or is minimized by use of a maximally closely related post-duplication (MCRPD) outgroup; (ii) when two outgroups are used, NST signal is minimized by using one MCRPD outgroup, while the position of the second outgroup is of lesser importance; and (iii) when two outgroups are used, the ability to detect gene trees that are inconsistent with known aspects of the species tree is maximized by use of one MCRPD, and is either independent of the position of the second outgroup, or is maximized for a more distantly related second outgroup. Overall, these results generalize the utility of closely-related outgroups for phylogenetic analysis. Public Library of Science 2009-02-23 /pmc/articles/PMC2640465/ /pubmed/19234600 http://dx.doi.org/10.1371/journal.pone.0004568 Text en This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Roy, Scott William
Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice
title Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice
title_full Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice
title_fullStr Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice
title_full_unstemmed Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice
title_short Phylogenomics: Gene Duplication, Unrecognized Paralogy and Outgroup Choice
title_sort phylogenomics: gene duplication, unrecognized paralogy and outgroup choice
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2640465/
https://www.ncbi.nlm.nih.gov/pubmed/19234600
http://dx.doi.org/10.1371/journal.pone.0004568
work_keys_str_mv AT royscottwilliam phylogenomicsgeneduplicationunrecognizedparalogyandoutgroupchoice