Cargando…

Parente2: a fast and accurate method for detecting identity by descent

Identity-by-descent (IBD) inference is the problem of establishing a genetic connection between two individuals through a genomic segment that is inherited by both individuals from a recent common ancestor. IBD inference is an important preceding step in a variety of population genomic studies, rang...

Descripción completa

Detalles Bibliográficos
Autores principales: Rodriguez, Jesse M., Bercovici, Sivan, Huang, Lin, Frostig, Roy, Batzoglou, Serafim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4315301/
https://www.ncbi.nlm.nih.gov/pubmed/25273070
http://dx.doi.org/10.1101/gr.173641.114
_version_ 1782355456067371008
author Rodriguez, Jesse M.
Bercovici, Sivan
Huang, Lin
Frostig, Roy
Batzoglou, Serafim
author_facet Rodriguez, Jesse M.
Bercovici, Sivan
Huang, Lin
Frostig, Roy
Batzoglou, Serafim
author_sort Rodriguez, Jesse M.
collection PubMed
description Identity-by-descent (IBD) inference is the problem of establishing a genetic connection between two individuals through a genomic segment that is inherited by both individuals from a recent common ancestor. IBD inference is an important preceding step in a variety of population genomic studies, ranging from demographic studies to linking genomic variation with phenotype and disease. The problem of accurate IBD detection has become increasingly challenging with the availability of large collections of human genotypes and genomes: Given a cohort’s size, a quadratic number of pairwise genome comparisons must be performed. Therefore, computation time and the false discovery rate can also scale quadratically. To enable accurate and efficient large-scale IBD detection, we present Parente2, a novel method for detecting IBD segments. Parente2 is based on an embedded log-likelihood ratio and uses a model that accounts for linkage disequilibrium by explicitly modeling haplotype frequencies. Parente2 operates directly on genotype data without the need to phase data prior to IBD inference. We evaluate Parente2’s performance through extensive simulations using real data, and we show that it provides substantially higher accuracy compared to previous state-of-the-art methods while maintaining high computational efficiency.
format Online
Article
Text
id pubmed-4315301
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-43153012015-02-05 Parente2: a fast and accurate method for detecting identity by descent Rodriguez, Jesse M. Bercovici, Sivan Huang, Lin Frostig, Roy Batzoglou, Serafim Genome Res Method Identity-by-descent (IBD) inference is the problem of establishing a genetic connection between two individuals through a genomic segment that is inherited by both individuals from a recent common ancestor. IBD inference is an important preceding step in a variety of population genomic studies, ranging from demographic studies to linking genomic variation with phenotype and disease. The problem of accurate IBD detection has become increasingly challenging with the availability of large collections of human genotypes and genomes: Given a cohort’s size, a quadratic number of pairwise genome comparisons must be performed. Therefore, computation time and the false discovery rate can also scale quadratically. To enable accurate and efficient large-scale IBD detection, we present Parente2, a novel method for detecting IBD segments. Parente2 is based on an embedded log-likelihood ratio and uses a model that accounts for linkage disequilibrium by explicitly modeling haplotype frequencies. Parente2 operates directly on genotype data without the need to phase data prior to IBD inference. We evaluate Parente2’s performance through extensive simulations using real data, and we show that it provides substantially higher accuracy compared to previous state-of-the-art methods while maintaining high computational efficiency. Cold Spring Harbor Laboratory Press 2015-02 /pmc/articles/PMC4315301/ /pubmed/25273070 http://dx.doi.org/10.1101/gr.173641.114 Text en © 2015 Rodriguez et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Rodriguez, Jesse M.
Bercovici, Sivan
Huang, Lin
Frostig, Roy
Batzoglou, Serafim
Parente2: a fast and accurate method for detecting identity by descent
title Parente2: a fast and accurate method for detecting identity by descent
title_full Parente2: a fast and accurate method for detecting identity by descent
title_fullStr Parente2: a fast and accurate method for detecting identity by descent
title_full_unstemmed Parente2: a fast and accurate method for detecting identity by descent
title_short Parente2: a fast and accurate method for detecting identity by descent
title_sort parente2: a fast and accurate method for detecting identity by descent
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4315301/
https://www.ncbi.nlm.nih.gov/pubmed/25273070
http://dx.doi.org/10.1101/gr.173641.114
work_keys_str_mv AT rodriguezjessem parente2afastandaccuratemethodfordetectingidentitybydescent
AT bercovicisivan parente2afastandaccuratemethodfordetectingidentitybydescent
AT huanglin parente2afastandaccuratemethodfordetectingidentitybydescent
AT frostigroy parente2afastandaccuratemethodfordetectingidentitybydescent
AT batzoglouserafim parente2afastandaccuratemethodfordetectingidentitybydescent