Cargando…

A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on a pedigree

BACKGROUND: When studying genetic diseases in which genetic variations are passed on to offspring, the ability to distinguish between paternal and maternal alleles is essential. Determining haplotypes from genotype data is called haplotype inference. Most existing computational algorithms for haplot...

Descripción completa

Detalles Bibliográficos
Autores principales: Lai, En-Yu, Wang, Wei-Bung, Jiang, Tao, Wu, Kun-Pin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3521470/
https://www.ncbi.nlm.nih.gov/pubmed/23281626
http://dx.doi.org/10.1186/1471-2105-13-S17-S19
_version_ 1782252961445969920
author Lai, En-Yu
Wang, Wei-Bung
Jiang, Tao
Wu, Kun-Pin
author_facet Lai, En-Yu
Wang, Wei-Bung
Jiang, Tao
Wu, Kun-Pin
author_sort Lai, En-Yu
collection PubMed
description BACKGROUND: When studying genetic diseases in which genetic variations are passed on to offspring, the ability to distinguish between paternal and maternal alleles is essential. Determining haplotypes from genotype data is called haplotype inference. Most existing computational algorithms for haplotype inference have been designed to use genotype data collected from individuals in the form of a pedigree. A haplotype is regarded as a hereditary unit and therefore input pedigrees are preferred that are free of mutational events and have a minimum number of genetic recombinational events. These ideas motivated the zero-recombinant haplotype configuration (ZRHC) problem, which strictly follows the Mendelian law of inheritance, namely that one haplotype of each child is inherited from the father and the other haplotype is inherited from the mother, both without any mutation. So far no linear-time algorithm for ZRHC has been proposed for general pedigrees, even though the number of mating loops in a human pedigree is usually very small and can be regarded as constant. RESULTS: Given a pedigree with n individuals, m marker loci, and k mating loops, we proposed an algorithm that can provide a general solution to the zero-recombinant haplotype configuration problem in O(kmn + k(2)m) time. In addition, this algorithm can be modified to detect inconsistencies within the genotype data without loss of efficiency. The proposed algorithm was subject to 12000 experiments to verify its performance using different (n, m) combinations. The value of k was uniformly distributed between zero and six throughout all experiments. The experimental results show a great linearity in terms of execution time in relation to input size when both n and m are larger than 100. For those experiments where n or m are less than 100, the proposed algorithm runs very fast, in thousandth to hundredth of a second, on a personal desktop computer. CONCLUSIONS: We have developed the first deterministic linear-time algorithm for the zero-recombinant haplotype configuration problem. Our experimental results demonstrated the linearity of its execution time in relation to the input size. The proposed algorithm can be modified to detect inconsistency within the genotype data without loss of efficiency and is expected to be able to handle recombinant and missing data with further extension.
format Online
Article
Text
id pubmed-3521470
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35214702012-12-14 A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on a pedigree Lai, En-Yu Wang, Wei-Bung Jiang, Tao Wu, Kun-Pin BMC Bioinformatics Proceedings BACKGROUND: When studying genetic diseases in which genetic variations are passed on to offspring, the ability to distinguish between paternal and maternal alleles is essential. Determining haplotypes from genotype data is called haplotype inference. Most existing computational algorithms for haplotype inference have been designed to use genotype data collected from individuals in the form of a pedigree. A haplotype is regarded as a hereditary unit and therefore input pedigrees are preferred that are free of mutational events and have a minimum number of genetic recombinational events. These ideas motivated the zero-recombinant haplotype configuration (ZRHC) problem, which strictly follows the Mendelian law of inheritance, namely that one haplotype of each child is inherited from the father and the other haplotype is inherited from the mother, both without any mutation. So far no linear-time algorithm for ZRHC has been proposed for general pedigrees, even though the number of mating loops in a human pedigree is usually very small and can be regarded as constant. RESULTS: Given a pedigree with n individuals, m marker loci, and k mating loops, we proposed an algorithm that can provide a general solution to the zero-recombinant haplotype configuration problem in O(kmn + k(2)m) time. In addition, this algorithm can be modified to detect inconsistencies within the genotype data without loss of efficiency. The proposed algorithm was subject to 12000 experiments to verify its performance using different (n, m) combinations. The value of k was uniformly distributed between zero and six throughout all experiments. The experimental results show a great linearity in terms of execution time in relation to input size when both n and m are larger than 100. For those experiments where n or m are less than 100, the proposed algorithm runs very fast, in thousandth to hundredth of a second, on a personal desktop computer. CONCLUSIONS: We have developed the first deterministic linear-time algorithm for the zero-recombinant haplotype configuration problem. Our experimental results demonstrated the linearity of its execution time in relation to the input size. The proposed algorithm can be modified to detect inconsistency within the genotype data without loss of efficiency and is expected to be able to handle recombinant and missing data with further extension. BioMed Central 2012-12-07 /pmc/articles/PMC3521470/ /pubmed/23281626 http://dx.doi.org/10.1186/1471-2105-13-S17-S19 Text en Copyright ©2012 Lai et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Lai, En-Yu
Wang, Wei-Bung
Jiang, Tao
Wu, Kun-Pin
A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on a pedigree
title A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on a pedigree
title_full A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on a pedigree
title_fullStr A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on a pedigree
title_full_unstemmed A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on a pedigree
title_short A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on a pedigree
title_sort linear-time algorithm for reconstructing zero-recombinant haplotype configuration on a pedigree
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3521470/
https://www.ncbi.nlm.nih.gov/pubmed/23281626
http://dx.doi.org/10.1186/1471-2105-13-S17-S19
work_keys_str_mv AT laienyu alineartimealgorithmforreconstructingzerorecombinanthaplotypeconfigurationonapedigree
AT wangweibung alineartimealgorithmforreconstructingzerorecombinanthaplotypeconfigurationonapedigree
AT jiangtao alineartimealgorithmforreconstructingzerorecombinanthaplotypeconfigurationonapedigree
AT wukunpin alineartimealgorithmforreconstructingzerorecombinanthaplotypeconfigurationonapedigree
AT laienyu lineartimealgorithmforreconstructingzerorecombinanthaplotypeconfigurationonapedigree
AT wangweibung lineartimealgorithmforreconstructingzerorecombinanthaplotypeconfigurationonapedigree
AT jiangtao lineartimealgorithmforreconstructingzerorecombinanthaplotypeconfigurationonapedigree
AT wukunpin lineartimealgorithmforreconstructingzerorecombinanthaplotypeconfigurationonapedigree