Cargando…

Efficient genome ancestry inference in complex pedigrees with inbreeding

Motivation: High-density SNP data of model animal resources provides opportunities for fine-resolution genetic variation studies. These genetic resources are generated through a variety of breeding schemes that involve multiple generations of matings derived from a set of founder animals. In this ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Eric Yi, Zhang, Qi, McMillan, Leonard, de Villena, Fernando Pardo-Manuel, Wang, Wei
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881372/
https://www.ncbi.nlm.nih.gov/pubmed/20529906
http://dx.doi.org/10.1093/bioinformatics/btq187
_version_ 1782182108937060352
author Liu, Eric Yi
Zhang, Qi
McMillan, Leonard
de Villena, Fernando Pardo-Manuel
Wang, Wei
author_facet Liu, Eric Yi
Zhang, Qi
McMillan, Leonard
de Villena, Fernando Pardo-Manuel
Wang, Wei
author_sort Liu, Eric Yi
collection PubMed
description Motivation: High-density SNP data of model animal resources provides opportunities for fine-resolution genetic variation studies. These genetic resources are generated through a variety of breeding schemes that involve multiple generations of matings derived from a set of founder animals. In this article, we investigate the problem of inferring the most probable ancestry of resulting genotypes, given a set of founder genotypes. Due to computational difficulty, existing methods either handle only small pedigree data or disregard the pedigree structure. However, large pedigrees of model animal resources often contain repetitive substructures that can be utilized in accelerating computation. Results: We present an accurate and efficient method that can accept complex pedigrees with inbreeding in inferring genome ancestry. Inbreeding is a commonly used process in generating genetically diverse and reproducible animals. It is often carried out for many generations and can account for most of the computational complexity in real-world model animal pedigrees. Our method builds a hidden Markov model that derives the ancestry probabilities through inbreeding process without explicit modeling in every generation. The ancestry inference is accurate and fast, independent of the number of generations, for model animal resources such as the Collaborative Cross (CC). Experiments on both simulated and real CC data demonstrate that our method offers comparable accuracy to those methods that build an explicit model of the entire pedigree, but much better scalability with respect to the pedigree size. Contact: weiwang@cs.unc.edu
format Text
id pubmed-2881372
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28813722010-06-08 Efficient genome ancestry inference in complex pedigrees with inbreeding Liu, Eric Yi Zhang, Qi McMillan, Leonard de Villena, Fernando Pardo-Manuel Wang, Wei Bioinformatics Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa Motivation: High-density SNP data of model animal resources provides opportunities for fine-resolution genetic variation studies. These genetic resources are generated through a variety of breeding schemes that involve multiple generations of matings derived from a set of founder animals. In this article, we investigate the problem of inferring the most probable ancestry of resulting genotypes, given a set of founder genotypes. Due to computational difficulty, existing methods either handle only small pedigree data or disregard the pedigree structure. However, large pedigrees of model animal resources often contain repetitive substructures that can be utilized in accelerating computation. Results: We present an accurate and efficient method that can accept complex pedigrees with inbreeding in inferring genome ancestry. Inbreeding is a commonly used process in generating genetically diverse and reproducible animals. It is often carried out for many generations and can account for most of the computational complexity in real-world model animal pedigrees. Our method builds a hidden Markov model that derives the ancestry probabilities through inbreeding process without explicit modeling in every generation. The ancestry inference is accurate and fast, independent of the number of generations, for model animal resources such as the Collaborative Cross (CC). Experiments on both simulated and real CC data demonstrate that our method offers comparable accuracy to those methods that build an explicit model of the entire pedigree, but much better scalability with respect to the pedigree size. Contact: weiwang@cs.unc.edu Oxford University Press 2010-06-15 2010-06-01 /pmc/articles/PMC2881372/ /pubmed/20529906 http://dx.doi.org/10.1093/bioinformatics/btq187 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa
Liu, Eric Yi
Zhang, Qi
McMillan, Leonard
de Villena, Fernando Pardo-Manuel
Wang, Wei
Efficient genome ancestry inference in complex pedigrees with inbreeding
title Efficient genome ancestry inference in complex pedigrees with inbreeding
title_full Efficient genome ancestry inference in complex pedigrees with inbreeding
title_fullStr Efficient genome ancestry inference in complex pedigrees with inbreeding
title_full_unstemmed Efficient genome ancestry inference in complex pedigrees with inbreeding
title_short Efficient genome ancestry inference in complex pedigrees with inbreeding
title_sort efficient genome ancestry inference in complex pedigrees with inbreeding
topic Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881372/
https://www.ncbi.nlm.nih.gov/pubmed/20529906
http://dx.doi.org/10.1093/bioinformatics/btq187
work_keys_str_mv AT liuericyi efficientgenomeancestryinferenceincomplexpedigreeswithinbreeding
AT zhangqi efficientgenomeancestryinferenceincomplexpedigreeswithinbreeding
AT mcmillanleonard efficientgenomeancestryinferenceincomplexpedigreeswithinbreeding
AT devillenafernandopardomanuel efficientgenomeancestryinferenceincomplexpedigreeswithinbreeding
AT wangwei efficientgenomeancestryinferenceincomplexpedigreeswithinbreeding