Cargando…

De novo inference of stratification and local admixture in sequencing studies

Analysis of population structures and genome local ancestry has become increasingly important in population and disease genetics. With the advance of next generation sequencing technologies, complete genetic variants in individuals' genomes are quickly generated, providing unprecedented opportu...

Descripción completa

Detalles Bibliográficos
Autor principal: Zhang, Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3622634/
https://www.ncbi.nlm.nih.gov/pubmed/23734678
http://dx.doi.org/10.1186/1471-2105-14-S5-S17
_version_ 1782265858040528896
author Zhang, Yu
author_facet Zhang, Yu
author_sort Zhang, Yu
collection PubMed
description Analysis of population structures and genome local ancestry has become increasingly important in population and disease genetics. With the advance of next generation sequencing technologies, complete genetic variants in individuals' genomes are quickly generated, providing unprecedented opportunities for learning population evolution histories and identifying local genetic signatures at the SNP resolution. The successes of those studies critically rely on accurate and powerful computational tools that can fully utilize the sequencing information. Although many algorithms have been developed for population structure inference and admixture mapping, many of them only work for independent SNPs in genotype or haplotype format, and require a large panel of reference individuals. In this paper, we propose a novel probabilistic method for detecting population structure and local admixture. The method takes input of sequencing data, genotype data and haplotype data. The method characterizes the dependence of genetic variants via haplotype segmentation, such that all variants detected in a sequencing study can be fully utilized for inference. The method further utilizes a infinite-state Bayesian Markov model to perform de novo stratification and admixture inference. Using simulated datasets from HapMapII and 1000Genomes, we show that our method performs superior than several existing algorithms, particularly when limited or no reference individuals are available. Our method is applicable to not only human studies but also studies of other species of interests, for which little reference information is available. Software Availability: http://stat.psu.edu/~yuzhang/software/dbm.tar
format Online
Article
Text
id pubmed-3622634
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36226342013-04-15 De novo inference of stratification and local admixture in sequencing studies Zhang, Yu BMC Bioinformatics Proceedings Analysis of population structures and genome local ancestry has become increasingly important in population and disease genetics. With the advance of next generation sequencing technologies, complete genetic variants in individuals' genomes are quickly generated, providing unprecedented opportunities for learning population evolution histories and identifying local genetic signatures at the SNP resolution. The successes of those studies critically rely on accurate and powerful computational tools that can fully utilize the sequencing information. Although many algorithms have been developed for population structure inference and admixture mapping, many of them only work for independent SNPs in genotype or haplotype format, and require a large panel of reference individuals. In this paper, we propose a novel probabilistic method for detecting population structure and local admixture. The method takes input of sequencing data, genotype data and haplotype data. The method characterizes the dependence of genetic variants via haplotype segmentation, such that all variants detected in a sequencing study can be fully utilized for inference. The method further utilizes a infinite-state Bayesian Markov model to perform de novo stratification and admixture inference. Using simulated datasets from HapMapII and 1000Genomes, we show that our method performs superior than several existing algorithms, particularly when limited or no reference individuals are available. Our method is applicable to not only human studies but also studies of other species of interests, for which little reference information is available. Software Availability: http://stat.psu.edu/~yuzhang/software/dbm.tar BioMed Central 2013-04-10 /pmc/articles/PMC3622634/ /pubmed/23734678 http://dx.doi.org/10.1186/1471-2105-14-S5-S17 Text en Copyright © 2013 Zhang.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Zhang, Yu
De novo inference of stratification and local admixture in sequencing studies
title De novo inference of stratification and local admixture in sequencing studies
title_full De novo inference of stratification and local admixture in sequencing studies
title_fullStr De novo inference of stratification and local admixture in sequencing studies
title_full_unstemmed De novo inference of stratification and local admixture in sequencing studies
title_short De novo inference of stratification and local admixture in sequencing studies
title_sort de novo inference of stratification and local admixture in sequencing studies
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3622634/
https://www.ncbi.nlm.nih.gov/pubmed/23734678
http://dx.doi.org/10.1186/1471-2105-14-S5-S17
work_keys_str_mv AT zhangyu denovoinferenceofstratificationandlocaladmixtureinsequencingstudies