Cargando…

Enhanced Methods for Local Ancestry Assignment in Sequenced Admixed Individuals

Inferring the ancestry at each locus in the genome of recently admixed individuals (e.g., Latino Americans) plays a major role in medical and population genetic inferences, ranging from finding disease-risk loci, to inferring recombination rates, to mapping missing contigs in the human genome. Altho...

Descripción completa

Detalles Bibliográficos
Autores principales: Brown, Robert, Pasaniuc, Bogdan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3990492/
https://www.ncbi.nlm.nih.gov/pubmed/24743331
http://dx.doi.org/10.1371/journal.pcbi.1003555
_version_ 1782312288569524224
author Brown, Robert
Pasaniuc, Bogdan
author_facet Brown, Robert
Pasaniuc, Bogdan
author_sort Brown, Robert
collection PubMed
description Inferring the ancestry at each locus in the genome of recently admixed individuals (e.g., Latino Americans) plays a major role in medical and population genetic inferences, ranging from finding disease-risk loci, to inferring recombination rates, to mapping missing contigs in the human genome. Although many methods for local ancestry inference have been proposed, most are designed for use with genotyping arrays and fail to make use of the full spectrum of data available from sequencing. In addition, current haplotype-based approaches are very computationally demanding, requiring large computational time for moderately large sample sizes. Here we present new methods for local ancestry inference that leverage continent-specific variants (CSVs) to attain increased performance over existing approaches in sequenced admixed genomes. A key feature of our approach is that it incorporates the admixed genomes themselves jointly with public datasets, such as 1000 Genomes, to improve the accuracy of CSV calling. We use simulations to show that our approach attains accuracy similar to widely used computationally intensive haplotype-based approaches with large decreases in runtime. Most importantly, we show that our method recovers comparable local ancestries, as the 1000 Genomes consensus local ancestry calls in the real admixed individuals from the 1000 Genomes Project. We extend our approach to account for low-coverage sequencing and show that accurate local ancestry inference can be attained at low sequencing coverage. Finally, we generalize CSVs to sub-continental population-specific variants (sCSVs) and show that in some cases it is possible to determine the sub-continental ancestry for short chromosomal segments on the basis of sCSVs.
format Online
Article
Text
id pubmed-3990492
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39904922014-04-21 Enhanced Methods for Local Ancestry Assignment in Sequenced Admixed Individuals Brown, Robert Pasaniuc, Bogdan PLoS Comput Biol Research Article Inferring the ancestry at each locus in the genome of recently admixed individuals (e.g., Latino Americans) plays a major role in medical and population genetic inferences, ranging from finding disease-risk loci, to inferring recombination rates, to mapping missing contigs in the human genome. Although many methods for local ancestry inference have been proposed, most are designed for use with genotyping arrays and fail to make use of the full spectrum of data available from sequencing. In addition, current haplotype-based approaches are very computationally demanding, requiring large computational time for moderately large sample sizes. Here we present new methods for local ancestry inference that leverage continent-specific variants (CSVs) to attain increased performance over existing approaches in sequenced admixed genomes. A key feature of our approach is that it incorporates the admixed genomes themselves jointly with public datasets, such as 1000 Genomes, to improve the accuracy of CSV calling. We use simulations to show that our approach attains accuracy similar to widely used computationally intensive haplotype-based approaches with large decreases in runtime. Most importantly, we show that our method recovers comparable local ancestries, as the 1000 Genomes consensus local ancestry calls in the real admixed individuals from the 1000 Genomes Project. We extend our approach to account for low-coverage sequencing and show that accurate local ancestry inference can be attained at low sequencing coverage. Finally, we generalize CSVs to sub-continental population-specific variants (sCSVs) and show that in some cases it is possible to determine the sub-continental ancestry for short chromosomal segments on the basis of sCSVs. Public Library of Science 2014-04-17 /pmc/articles/PMC3990492/ /pubmed/24743331 http://dx.doi.org/10.1371/journal.pcbi.1003555 Text en © 2014 Brown, Pasaniuc http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Brown, Robert
Pasaniuc, Bogdan
Enhanced Methods for Local Ancestry Assignment in Sequenced Admixed Individuals
title Enhanced Methods for Local Ancestry Assignment in Sequenced Admixed Individuals
title_full Enhanced Methods for Local Ancestry Assignment in Sequenced Admixed Individuals
title_fullStr Enhanced Methods for Local Ancestry Assignment in Sequenced Admixed Individuals
title_full_unstemmed Enhanced Methods for Local Ancestry Assignment in Sequenced Admixed Individuals
title_short Enhanced Methods for Local Ancestry Assignment in Sequenced Admixed Individuals
title_sort enhanced methods for local ancestry assignment in sequenced admixed individuals
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3990492/
https://www.ncbi.nlm.nih.gov/pubmed/24743331
http://dx.doi.org/10.1371/journal.pcbi.1003555
work_keys_str_mv AT brownrobert enhancedmethodsforlocalancestryassignmentinsequencedadmixedindividuals
AT pasaniucbogdan enhancedmethodsforlocalancestryassignmentinsequencedadmixedindividuals