Cargando…

Probabilistic Estimation of Identity by Descent Segment Endpoints and Detection of Recent Selection

Most methods for fast detection of identity by descent (IBD) segments report identity by state segments without any quantification of the uncertainty in the endpoints and lengths of the IBD segments. We present a method for determining the posterior probability distribution of IBD segment endpoints....

Descripción completa

Detalles Bibliográficos
Autores principales: Browning, Sharon R., Browning, Brian L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7553009/
https://www.ncbi.nlm.nih.gov/pubmed/33053335
http://dx.doi.org/10.1016/j.ajhg.2020.09.010
_version_ 1783593519472967680
author Browning, Sharon R.
Browning, Brian L.
author_facet Browning, Sharon R.
Browning, Brian L.
author_sort Browning, Sharon R.
collection PubMed
description Most methods for fast detection of identity by descent (IBD) segments report identity by state segments without any quantification of the uncertainty in the endpoints and lengths of the IBD segments. We present a method for determining the posterior probability distribution of IBD segment endpoints. Our approach accounts for genotype errors, recent mutations, and gene conversions which disrupt DNA sequence identity within IBD segments, and it can be applied to large cohorts with whole-genome sequence or SNP array data. We find that our method’s estimates of uncertainty are well calibrated for homogeneous samples. We quantify endpoint uncertainty for 77.7 billion IBD segments from 408,883 individuals of white British ancestry in the UK Biobank, and we use these IBD segments to find regions showing evidence of recent natural selection. We show that many spurious selection signals are eliminated by the use of unbiased estimates of IBD segment endpoints and a pedigree-based genetic map. Eleven of the twelve regions with the greatest evidence for recent selection in our scan have been identified as selected in previous analyses using different approaches. Our computationally efficient method for quantifying IBD segment endpoint uncertainty is implemented in the open source ibd-ends software package.
format Online
Article
Text
id pubmed-7553009
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-75530092020-10-13 Probabilistic Estimation of Identity by Descent Segment Endpoints and Detection of Recent Selection Browning, Sharon R. Browning, Brian L. Am J Hum Genet Article Most methods for fast detection of identity by descent (IBD) segments report identity by state segments without any quantification of the uncertainty in the endpoints and lengths of the IBD segments. We present a method for determining the posterior probability distribution of IBD segment endpoints. Our approach accounts for genotype errors, recent mutations, and gene conversions which disrupt DNA sequence identity within IBD segments, and it can be applied to large cohorts with whole-genome sequence or SNP array data. We find that our method’s estimates of uncertainty are well calibrated for homogeneous samples. We quantify endpoint uncertainty for 77.7 billion IBD segments from 408,883 individuals of white British ancestry in the UK Biobank, and we use these IBD segments to find regions showing evidence of recent natural selection. We show that many spurious selection signals are eliminated by the use of unbiased estimates of IBD segment endpoints and a pedigree-based genetic map. Eleven of the twelve regions with the greatest evidence for recent selection in our scan have been identified as selected in previous analyses using different approaches. Our computationally efficient method for quantifying IBD segment endpoint uncertainty is implemented in the open source ibd-ends software package. Elsevier 2020-11-05 2020-10-13 /pmc/articles/PMC7553009/ /pubmed/33053335 http://dx.doi.org/10.1016/j.ajhg.2020.09.010 Text en © 2020 American Society of Human Genetics.
spellingShingle Article
Browning, Sharon R.
Browning, Brian L.
Probabilistic Estimation of Identity by Descent Segment Endpoints and Detection of Recent Selection
title Probabilistic Estimation of Identity by Descent Segment Endpoints and Detection of Recent Selection
title_full Probabilistic Estimation of Identity by Descent Segment Endpoints and Detection of Recent Selection
title_fullStr Probabilistic Estimation of Identity by Descent Segment Endpoints and Detection of Recent Selection
title_full_unstemmed Probabilistic Estimation of Identity by Descent Segment Endpoints and Detection of Recent Selection
title_short Probabilistic Estimation of Identity by Descent Segment Endpoints and Detection of Recent Selection
title_sort probabilistic estimation of identity by descent segment endpoints and detection of recent selection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7553009/
https://www.ncbi.nlm.nih.gov/pubmed/33053335
http://dx.doi.org/10.1016/j.ajhg.2020.09.010
work_keys_str_mv AT browningsharonr probabilisticestimationofidentitybydescentsegmentendpointsanddetectionofrecentselection
AT browningbrianl probabilisticestimationofidentitybydescentsegmentendpointsanddetectionofrecentselection