Cargando…

A human genome-wide library of local phylogeny predictions for whole-genome inference problems

BACKGROUND: Many common inference problems in computational genetics depend on inferring aspects of the evolutionary history of a data set given a set of observed modern sequences. Detailed predictions of the full phylogenies are therefore of value in improving our ability to make further inferences...

Descripción completa

Detalles Bibliográficos
Autores principales: Sridhar, Srinath, Schwartz, Russell
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2556685/
https://www.ncbi.nlm.nih.gov/pubmed/18710563
http://dx.doi.org/10.1186/1471-2164-9-389
_version_ 1782159585874804736
author Sridhar, Srinath
Schwartz, Russell
author_facet Sridhar, Srinath
Schwartz, Russell
author_sort Sridhar, Srinath
collection PubMed
description BACKGROUND: Many common inference problems in computational genetics depend on inferring aspects of the evolutionary history of a data set given a set of observed modern sequences. Detailed predictions of the full phylogenies are therefore of value in improving our ability to make further inferences about population history and sources of genetic variation. Making phylogenetic predictions on the scale needed for whole-genome analysis is, however, extremely computationally demanding. RESULTS: In order to facilitate phylogeny-based predictions on a genomic scale, we develop a library of maximum parsimony phylogenies within local regions spanning all autosomal human chromosomes based on Haplotype Map variation data. We demonstrate the utility of this library for population genetic inferences by examining a tree statistic we call 'imperfection,' which measures the reuse of variant sites within a phylogeny. This statistic is significantly predictive of recombination rate, shows additional regional and population-specific conservation, and allows us to identify outlier genes likely to have experienced unusual amounts of variation in recent human history. CONCLUSION: Recent theoretical advances in algorithms for phylogenetic tree reconstruction have made it possible to perform large-scale inferences of local maximum parsimony phylogenies from single nucleotide polymorphism (SNP) data. As results from the imperfection statistic demonstrate, phylogeny predictions encode substantial information useful for detecting genomic features and population history. This data set should serve as a platform for many kinds of inferences one may wish to make about human population history and genetic variation.
format Text
id pubmed-2556685
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25566852008-10-01 A human genome-wide library of local phylogeny predictions for whole-genome inference problems Sridhar, Srinath Schwartz, Russell BMC Genomics Research Article BACKGROUND: Many common inference problems in computational genetics depend on inferring aspects of the evolutionary history of a data set given a set of observed modern sequences. Detailed predictions of the full phylogenies are therefore of value in improving our ability to make further inferences about population history and sources of genetic variation. Making phylogenetic predictions on the scale needed for whole-genome analysis is, however, extremely computationally demanding. RESULTS: In order to facilitate phylogeny-based predictions on a genomic scale, we develop a library of maximum parsimony phylogenies within local regions spanning all autosomal human chromosomes based on Haplotype Map variation data. We demonstrate the utility of this library for population genetic inferences by examining a tree statistic we call 'imperfection,' which measures the reuse of variant sites within a phylogeny. This statistic is significantly predictive of recombination rate, shows additional regional and population-specific conservation, and allows us to identify outlier genes likely to have experienced unusual amounts of variation in recent human history. CONCLUSION: Recent theoretical advances in algorithms for phylogenetic tree reconstruction have made it possible to perform large-scale inferences of local maximum parsimony phylogenies from single nucleotide polymorphism (SNP) data. As results from the imperfection statistic demonstrate, phylogeny predictions encode substantial information useful for detecting genomic features and population history. This data set should serve as a platform for many kinds of inferences one may wish to make about human population history and genetic variation. BioMed Central 2008-08-18 /pmc/articles/PMC2556685/ /pubmed/18710563 http://dx.doi.org/10.1186/1471-2164-9-389 Text en Copyright © 2008 Sridhar and Schwartz; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Sridhar, Srinath
Schwartz, Russell
A human genome-wide library of local phylogeny predictions for whole-genome inference problems
title A human genome-wide library of local phylogeny predictions for whole-genome inference problems
title_full A human genome-wide library of local phylogeny predictions for whole-genome inference problems
title_fullStr A human genome-wide library of local phylogeny predictions for whole-genome inference problems
title_full_unstemmed A human genome-wide library of local phylogeny predictions for whole-genome inference problems
title_short A human genome-wide library of local phylogeny predictions for whole-genome inference problems
title_sort human genome-wide library of local phylogeny predictions for whole-genome inference problems
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2556685/
https://www.ncbi.nlm.nih.gov/pubmed/18710563
http://dx.doi.org/10.1186/1471-2164-9-389
work_keys_str_mv AT sridharsrinath ahumangenomewidelibraryoflocalphylogenypredictionsforwholegenomeinferenceproblems
AT schwartzrussell ahumangenomewidelibraryoflocalphylogenypredictionsforwholegenomeinferenceproblems
AT sridharsrinath humangenomewidelibraryoflocalphylogenypredictionsforwholegenomeinferenceproblems
AT schwartzrussell humangenomewidelibraryoflocalphylogenypredictionsforwholegenomeinferenceproblems