Cargando…

Ancestral haplotype reconstruction in endogamous populations using identity-by-descent

In this work we develop a novel algorithm for reconstructing the genomes of ancestral individuals, given genotype or sequence data from contemporary individuals and an extended pedigree of family relationships. A pedigree with complete genomes for every individual enables the study of allele frequen...

Descripción completa

Detalles Bibliográficos
Autores principales: Finke, Kelly, Kourakos, Michael, Brown, Gabriela, Dang, Huyen Trang, Tan, Shi Jie Samuel, Simons, Yuval B., Ramdas, Shweta, Schäffer, Alejandro A., Kember, Rachel L., Bućan, Maja, Mathieson, Sara
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946327/
https://www.ncbi.nlm.nih.gov/pubmed/33635861
http://dx.doi.org/10.1371/journal.pcbi.1008638
_version_ 1783663031737122816
author Finke, Kelly
Kourakos, Michael
Brown, Gabriela
Dang, Huyen Trang
Tan, Shi Jie Samuel
Simons, Yuval B.
Ramdas, Shweta
Schäffer, Alejandro A.
Kember, Rachel L.
Bućan, Maja
Mathieson, Sara
author_facet Finke, Kelly
Kourakos, Michael
Brown, Gabriela
Dang, Huyen Trang
Tan, Shi Jie Samuel
Simons, Yuval B.
Ramdas, Shweta
Schäffer, Alejandro A.
Kember, Rachel L.
Bućan, Maja
Mathieson, Sara
author_sort Finke, Kelly
collection PubMed
description In this work we develop a novel algorithm for reconstructing the genomes of ancestral individuals, given genotype or sequence data from contemporary individuals and an extended pedigree of family relationships. A pedigree with complete genomes for every individual enables the study of allele frequency dynamics and haplotype diversity across generations, including deviations from neutrality such as transmission distortion. When studying heritable diseases, ancestral haplotypes can be used to augment genome-wide association studies and track disease inheritance patterns. The building blocks of our reconstruction algorithm are segments of Identity-By-Descent (IBD) shared between two or more genotyped individuals. The method alternates between identifying a source for each IBD segment and assembling IBD segments placed within each ancestral individual. Unlike previous approaches, our method is able to accommodate complex pedigree structures with hundreds of individuals genotyped at millions of SNPs. We apply our method to an Old Order Amish pedigree from Lancaster, Pennsylvania, whose founders came to North America from Europe during the early 18th century. The pedigree includes 1338 individuals from the past 12 generations, 394 with genotype data. The motivation for reconstruction is to understand the genetic basis of diseases segregating in the family through tracking haplotype transmission over time. Using our algorithm thread, we are able to reconstruct an average of 224 ancestral individuals per chromosome. For these ancestral individuals, on average we reconstruct 79% of their haplotypes. We also identify a region on chromosome 16 that is difficult to reconstruct—we find that this region harbors a short Amish-specific copy number variation and the gene HYDIN. thread was developed for endogamous populations, but can be applied to any extensive pedigree with the recent generations genotyped. We anticipate that this type of practical ancestral reconstruction will become more common and necessary to understand rare and complex heritable diseases in extended families.
format Online
Article
Text
id pubmed-7946327
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-79463272021-03-19 Ancestral haplotype reconstruction in endogamous populations using identity-by-descent Finke, Kelly Kourakos, Michael Brown, Gabriela Dang, Huyen Trang Tan, Shi Jie Samuel Simons, Yuval B. Ramdas, Shweta Schäffer, Alejandro A. Kember, Rachel L. Bućan, Maja Mathieson, Sara PLoS Comput Biol Research Article In this work we develop a novel algorithm for reconstructing the genomes of ancestral individuals, given genotype or sequence data from contemporary individuals and an extended pedigree of family relationships. A pedigree with complete genomes for every individual enables the study of allele frequency dynamics and haplotype diversity across generations, including deviations from neutrality such as transmission distortion. When studying heritable diseases, ancestral haplotypes can be used to augment genome-wide association studies and track disease inheritance patterns. The building blocks of our reconstruction algorithm are segments of Identity-By-Descent (IBD) shared between two or more genotyped individuals. The method alternates between identifying a source for each IBD segment and assembling IBD segments placed within each ancestral individual. Unlike previous approaches, our method is able to accommodate complex pedigree structures with hundreds of individuals genotyped at millions of SNPs. We apply our method to an Old Order Amish pedigree from Lancaster, Pennsylvania, whose founders came to North America from Europe during the early 18th century. The pedigree includes 1338 individuals from the past 12 generations, 394 with genotype data. The motivation for reconstruction is to understand the genetic basis of diseases segregating in the family through tracking haplotype transmission over time. Using our algorithm thread, we are able to reconstruct an average of 224 ancestral individuals per chromosome. For these ancestral individuals, on average we reconstruct 79% of their haplotypes. We also identify a region on chromosome 16 that is difficult to reconstruct—we find that this region harbors a short Amish-specific copy number variation and the gene HYDIN. thread was developed for endogamous populations, but can be applied to any extensive pedigree with the recent generations genotyped. We anticipate that this type of practical ancestral reconstruction will become more common and necessary to understand rare and complex heritable diseases in extended families. Public Library of Science 2021-02-26 /pmc/articles/PMC7946327/ /pubmed/33635861 http://dx.doi.org/10.1371/journal.pcbi.1008638 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Finke, Kelly
Kourakos, Michael
Brown, Gabriela
Dang, Huyen Trang
Tan, Shi Jie Samuel
Simons, Yuval B.
Ramdas, Shweta
Schäffer, Alejandro A.
Kember, Rachel L.
Bućan, Maja
Mathieson, Sara
Ancestral haplotype reconstruction in endogamous populations using identity-by-descent
title Ancestral haplotype reconstruction in endogamous populations using identity-by-descent
title_full Ancestral haplotype reconstruction in endogamous populations using identity-by-descent
title_fullStr Ancestral haplotype reconstruction in endogamous populations using identity-by-descent
title_full_unstemmed Ancestral haplotype reconstruction in endogamous populations using identity-by-descent
title_short Ancestral haplotype reconstruction in endogamous populations using identity-by-descent
title_sort ancestral haplotype reconstruction in endogamous populations using identity-by-descent
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946327/
https://www.ncbi.nlm.nih.gov/pubmed/33635861
http://dx.doi.org/10.1371/journal.pcbi.1008638
work_keys_str_mv AT finkekelly ancestralhaplotypereconstructioninendogamouspopulationsusingidentitybydescent
AT kourakosmichael ancestralhaplotypereconstructioninendogamouspopulationsusingidentitybydescent
AT browngabriela ancestralhaplotypereconstructioninendogamouspopulationsusingidentitybydescent
AT danghuyentrang ancestralhaplotypereconstructioninendogamouspopulationsusingidentitybydescent
AT tanshijiesamuel ancestralhaplotypereconstructioninendogamouspopulationsusingidentitybydescent
AT simonsyuvalb ancestralhaplotypereconstructioninendogamouspopulationsusingidentitybydescent
AT ramdasshweta ancestralhaplotypereconstructioninendogamouspopulationsusingidentitybydescent
AT schafferalejandroa ancestralhaplotypereconstructioninendogamouspopulationsusingidentitybydescent
AT kemberrachell ancestralhaplotypereconstructioninendogamouspopulationsusingidentitybydescent
AT bucanmaja ancestralhaplotypereconstructioninendogamouspopulationsusingidentitybydescent
AT mathiesonsara ancestralhaplotypereconstructioninendogamouspopulationsusingidentitybydescent