Cargando…

2× genomes - depth does matter

BACKGROUND: Given the availability of full genome sequences, mapping gene gains, duplications, and losses during evolution should theoretically be straightforward. However, this endeavor suffers from overemphasis on detecting conserved genome features, which in turn has led to sequencing multiple eu...

Descripción completa

Detalles Bibliográficos
Autores principales: Milinkovitch, Michel C, Helaers, Raphaël, Depiereux, Eric, Tzika, Athanasia C, Gabaldón, Toni
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2872876/
https://www.ncbi.nlm.nih.gov/pubmed/20144222
http://dx.doi.org/10.1186/gb-2010-11-2-r16
_version_ 1782181278237327360
author Milinkovitch, Michel C
Helaers, Raphaël
Depiereux, Eric
Tzika, Athanasia C
Gabaldón, Toni
author_facet Milinkovitch, Michel C
Helaers, Raphaël
Depiereux, Eric
Tzika, Athanasia C
Gabaldón, Toni
author_sort Milinkovitch, Michel C
collection PubMed
description BACKGROUND: Given the availability of full genome sequences, mapping gene gains, duplications, and losses during evolution should theoretically be straightforward. However, this endeavor suffers from overemphasis on detecting conserved genome features, which in turn has led to sequencing multiple eutherian genomes with low coverage rather than fewer genomes with high-coverage and more even distribution in the phylogeny. Although limitations associated with analysis of low coverage genomes are recognized, they have not been quantified. RESULTS: Here, using recently developed comparative genomic application systems, we evaluate the impact of low-coverage genomes on inferences pertaining to gene gains and losses when analyzing eukaryote genome evolution through gene duplication. We demonstrate that, when performing inference of genome content evolution, low-coverage genomes generate not only a massive number of false gene losses, but also striking artifacts in gene duplication inference, especially at the most recent common ancestor of low-coverage genomes. We show that the artifactual gains are caused by the low coverage of genome sequence per se rather than by the increased taxon sampling in a biased portion of the species tree. CONCLUSIONS: We argue that it will remain difficult to differentiate artifacts from true changes in modes and tempo of genome evolution until there is better homogeneity in both taxon sampling and high-coverage sequencing. This is important for broadening the utility of full genome data to the community of evolutionary biologists, whose interests go well beyond widely conserved physiologies and developmental patterns as they seek to understand the generative mechanisms underlying biological diversity.
format Text
id pubmed-2872876
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28728762010-05-20 2× genomes - depth does matter Milinkovitch, Michel C Helaers, Raphaël Depiereux, Eric Tzika, Athanasia C Gabaldón, Toni Genome Biol Research BACKGROUND: Given the availability of full genome sequences, mapping gene gains, duplications, and losses during evolution should theoretically be straightforward. However, this endeavor suffers from overemphasis on detecting conserved genome features, which in turn has led to sequencing multiple eutherian genomes with low coverage rather than fewer genomes with high-coverage and more even distribution in the phylogeny. Although limitations associated with analysis of low coverage genomes are recognized, they have not been quantified. RESULTS: Here, using recently developed comparative genomic application systems, we evaluate the impact of low-coverage genomes on inferences pertaining to gene gains and losses when analyzing eukaryote genome evolution through gene duplication. We demonstrate that, when performing inference of genome content evolution, low-coverage genomes generate not only a massive number of false gene losses, but also striking artifacts in gene duplication inference, especially at the most recent common ancestor of low-coverage genomes. We show that the artifactual gains are caused by the low coverage of genome sequence per se rather than by the increased taxon sampling in a biased portion of the species tree. CONCLUSIONS: We argue that it will remain difficult to differentiate artifacts from true changes in modes and tempo of genome evolution until there is better homogeneity in both taxon sampling and high-coverage sequencing. This is important for broadening the utility of full genome data to the community of evolutionary biologists, whose interests go well beyond widely conserved physiologies and developmental patterns as they seek to understand the generative mechanisms underlying biological diversity. BioMed Central 2010 2010-02-09 /pmc/articles/PMC2872876/ /pubmed/20144222 http://dx.doi.org/10.1186/gb-2010-11-2-r16 Text en Copyright ©2010 Milinkovitch et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Milinkovitch, Michel C
Helaers, Raphaël
Depiereux, Eric
Tzika, Athanasia C
Gabaldón, Toni
2× genomes - depth does matter
title 2× genomes - depth does matter
title_full 2× genomes - depth does matter
title_fullStr 2× genomes - depth does matter
title_full_unstemmed 2× genomes - depth does matter
title_short 2× genomes - depth does matter
title_sort 2× genomes - depth does matter
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2872876/
https://www.ncbi.nlm.nih.gov/pubmed/20144222
http://dx.doi.org/10.1186/gb-2010-11-2-r16
work_keys_str_mv AT milinkovitchmichelc 2genomesdepthdoesmatter
AT helaersraphael 2genomesdepthdoesmatter
AT depiereuxeric 2genomesdepthdoesmatter
AT tzikaathanasiac 2genomesdepthdoesmatter
AT gabaldontoni 2genomesdepthdoesmatter