Cargando…

A Class Representative Model for Pure Parsimony Haplotyping under Uncertain Data

The Pure Parsimony Haplotyping (PPH) problem is a NP-hard combinatorial optimization problem that consists of finding the minimum number of haplotypes necessary to explain a given set of genotypes. PPH has attracted more and more attention in recent years due to its importance in analysis of many fi...

Descripción completa

Detalles Bibliográficos
Autores principales: Catanzaro, Daniele, Labbé, Martine, Porretta, Luciano
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3064666/
https://www.ncbi.nlm.nih.gov/pubmed/21464966
http://dx.doi.org/10.1371/journal.pone.0017937
_version_ 1782200910822244352
author Catanzaro, Daniele
Labbé, Martine
Porretta, Luciano
author_facet Catanzaro, Daniele
Labbé, Martine
Porretta, Luciano
author_sort Catanzaro, Daniele
collection PubMed
description The Pure Parsimony Haplotyping (PPH) problem is a NP-hard combinatorial optimization problem that consists of finding the minimum number of haplotypes necessary to explain a given set of genotypes. PPH has attracted more and more attention in recent years due to its importance in analysis of many fine-scale genetic data. Its application fields range from mapping complex disease genes to inferring population histories, passing through designing drugs, functional genomics and pharmacogenetics. In this article we investigate, for the first time, a recent version of PPH called the Pure Parsimony Haplotype problem under Uncertain Data (PPH-UD). This version mainly arises when the input genotypes are not accurate, i.e., when some single nucleotide polymorphisms are missing or affected by errors. We propose an exact approach to solution of PPH-UD based on an extended version of Catanzaro et al. [1] class representative model for PPH, currently the state-of-the-art integer programming model for PPH. The model is efficient, accurate, compact, polynomial-sized, easy to implement, solvable with any solver for mixed integer programming, and usable in all those cases for which the parsimony criterion is well suited for haplotype estimation.
format Text
id pubmed-3064666
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30646662011-04-04 A Class Representative Model for Pure Parsimony Haplotyping under Uncertain Data Catanzaro, Daniele Labbé, Martine Porretta, Luciano PLoS One Research Article The Pure Parsimony Haplotyping (PPH) problem is a NP-hard combinatorial optimization problem that consists of finding the minimum number of haplotypes necessary to explain a given set of genotypes. PPH has attracted more and more attention in recent years due to its importance in analysis of many fine-scale genetic data. Its application fields range from mapping complex disease genes to inferring population histories, passing through designing drugs, functional genomics and pharmacogenetics. In this article we investigate, for the first time, a recent version of PPH called the Pure Parsimony Haplotype problem under Uncertain Data (PPH-UD). This version mainly arises when the input genotypes are not accurate, i.e., when some single nucleotide polymorphisms are missing or affected by errors. We propose an exact approach to solution of PPH-UD based on an extended version of Catanzaro et al. [1] class representative model for PPH, currently the state-of-the-art integer programming model for PPH. The model is efficient, accurate, compact, polynomial-sized, easy to implement, solvable with any solver for mixed integer programming, and usable in all those cases for which the parsimony criterion is well suited for haplotype estimation. Public Library of Science 2011-03-25 /pmc/articles/PMC3064666/ /pubmed/21464966 http://dx.doi.org/10.1371/journal.pone.0017937 Text en Catanzaro et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Catanzaro, Daniele
Labbé, Martine
Porretta, Luciano
A Class Representative Model for Pure Parsimony Haplotyping under Uncertain Data
title A Class Representative Model for Pure Parsimony Haplotyping under Uncertain Data
title_full A Class Representative Model for Pure Parsimony Haplotyping under Uncertain Data
title_fullStr A Class Representative Model for Pure Parsimony Haplotyping under Uncertain Data
title_full_unstemmed A Class Representative Model for Pure Parsimony Haplotyping under Uncertain Data
title_short A Class Representative Model for Pure Parsimony Haplotyping under Uncertain Data
title_sort class representative model for pure parsimony haplotyping under uncertain data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3064666/
https://www.ncbi.nlm.nih.gov/pubmed/21464966
http://dx.doi.org/10.1371/journal.pone.0017937
work_keys_str_mv AT catanzarodaniele aclassrepresentativemodelforpureparsimonyhaplotypingunderuncertaindata
AT labbemartine aclassrepresentativemodelforpureparsimonyhaplotypingunderuncertaindata
AT porrettaluciano aclassrepresentativemodelforpureparsimonyhaplotypingunderuncertaindata
AT catanzarodaniele classrepresentativemodelforpureparsimonyhaplotypingunderuncertaindata
AT labbemartine classrepresentativemodelforpureparsimonyhaplotypingunderuncertaindata
AT porrettaluciano classrepresentativemodelforpureparsimonyhaplotypingunderuncertaindata