Cargando…

Improving the Accuracy and Efficiency of Identity-by-Descent Detection in Population Data

Segments of indentity-by-descent (IBD) detected from high-density genetic data are useful for many applications, including long-range phase determination, phasing family data, imputation, IBD mapping, and heritability analysis in founder populations. We present Refined IBD, a new method for IBD segm...

Descripción completa

Detalles Bibliográficos
Autores principales: Browning, Brian L., Browning, Sharon R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3664855/
https://www.ncbi.nlm.nih.gov/pubmed/23535385
http://dx.doi.org/10.1534/genetics.113.150029
_version_ 1782271172932534272
author Browning, Brian L.
Browning, Sharon R.
author_facet Browning, Brian L.
Browning, Sharon R.
author_sort Browning, Brian L.
collection PubMed
description Segments of indentity-by-descent (IBD) detected from high-density genetic data are useful for many applications, including long-range phase determination, phasing family data, imputation, IBD mapping, and heritability analysis in founder populations. We present Refined IBD, a new method for IBD segment detection. Refined IBD achieves both computational efficiency and highly accurate IBD segment reporting by searching for IBD in two steps. The first step (identification) uses the GERMLINE algorithm to find shared haplotypes exceeding a length threshold. The second step (refinement) evaluates candidate segments with a probabilistic approach to assess the evidence for IBD. Like GERMLINE, Refined IBD allows for IBD reporting on a haplotype level, which facilitates determination of multi-individual IBD and allows for haplotype-based downstream analyses. To investigate the properties of Refined IBD, we simulate SNP data from a model with recent superexponential population growth that is designed to match United Kingdom data. The simulation results show that Refined IBD achieves a better power/accuracy profile than fastIBD or GERMLINE. We find that a single run of Refined IBD achieves greater power than 10 runs of fastIBD. We also apply Refined IBD to SNP data for samples from the United Kingdom and from Northern Finland and describe the IBD sharing in these data sets. Refined IBD is powerful, highly accurate, and easy to use and is implemented in Beagle version 4.
format Online
Article
Text
id pubmed-3664855
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-36648552013-06-01 Improving the Accuracy and Efficiency of Identity-by-Descent Detection in Population Data Browning, Brian L. Browning, Sharon R. Genetics Investigations Segments of indentity-by-descent (IBD) detected from high-density genetic data are useful for many applications, including long-range phase determination, phasing family data, imputation, IBD mapping, and heritability analysis in founder populations. We present Refined IBD, a new method for IBD segment detection. Refined IBD achieves both computational efficiency and highly accurate IBD segment reporting by searching for IBD in two steps. The first step (identification) uses the GERMLINE algorithm to find shared haplotypes exceeding a length threshold. The second step (refinement) evaluates candidate segments with a probabilistic approach to assess the evidence for IBD. Like GERMLINE, Refined IBD allows for IBD reporting on a haplotype level, which facilitates determination of multi-individual IBD and allows for haplotype-based downstream analyses. To investigate the properties of Refined IBD, we simulate SNP data from a model with recent superexponential population growth that is designed to match United Kingdom data. The simulation results show that Refined IBD achieves a better power/accuracy profile than fastIBD or GERMLINE. We find that a single run of Refined IBD achieves greater power than 10 runs of fastIBD. We also apply Refined IBD to SNP data for samples from the United Kingdom and from Northern Finland and describe the IBD sharing in these data sets. Refined IBD is powerful, highly accurate, and easy to use and is implemented in Beagle version 4. Genetics Society of America 2013-06 /pmc/articles/PMC3664855/ /pubmed/23535385 http://dx.doi.org/10.1534/genetics.113.150029 Text en Copyright © 2013 by the Genetics Society of America Available freely online through the author-supported open access option.
spellingShingle Investigations
Browning, Brian L.
Browning, Sharon R.
Improving the Accuracy and Efficiency of Identity-by-Descent Detection in Population Data
title Improving the Accuracy and Efficiency of Identity-by-Descent Detection in Population Data
title_full Improving the Accuracy and Efficiency of Identity-by-Descent Detection in Population Data
title_fullStr Improving the Accuracy and Efficiency of Identity-by-Descent Detection in Population Data
title_full_unstemmed Improving the Accuracy and Efficiency of Identity-by-Descent Detection in Population Data
title_short Improving the Accuracy and Efficiency of Identity-by-Descent Detection in Population Data
title_sort improving the accuracy and efficiency of identity-by-descent detection in population data
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3664855/
https://www.ncbi.nlm.nih.gov/pubmed/23535385
http://dx.doi.org/10.1534/genetics.113.150029
work_keys_str_mv AT browningbrianl improvingtheaccuracyandefficiencyofidentitybydescentdetectioninpopulationdata
AT browningsharonr improvingtheaccuracyandefficiencyofidentitybydescentdetectioninpopulationdata