Cargando…

Estimating population size via line graph reconstruction

BACKGROUND: We propose a novel graph theoretic method to estimate haplotype population size from genotype data. The method considers only the potential sharing of haplotypes between individuals and is based on transforming the graph of potential haplotype sharing into a line graph using a minimum nu...

Descripción completa

Detalles Bibliográficos
Autores principales: Halldórsson, Bjarni V, Blokh, Dima, Sharan, Roded
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3716786/
https://www.ncbi.nlm.nih.gov/pubmed/23829669
http://dx.doi.org/10.1186/1748-7188-8-17
_version_ 1782277598465753088
author Halldórsson, Bjarni V
Blokh, Dima
Sharan, Roded
author_facet Halldórsson, Bjarni V
Blokh, Dima
Sharan, Roded
author_sort Halldórsson, Bjarni V
collection PubMed
description BACKGROUND: We propose a novel graph theoretic method to estimate haplotype population size from genotype data. The method considers only the potential sharing of haplotypes between individuals and is based on transforming the graph of potential haplotype sharing into a line graph using a minimum number of edge and vertex deletions. RESULTS: We show that the resulting line graph deletion problems are NP complete and provide exact integer programming solutions for them. We test our approach using extensive simulations of multiple population evolution and genotypes sampling scenarios. Our results also indicate that the method may be useful in comparing populations and it may be used as a first step in a method for haplotype phasing. CONCLUSIONS: Our computational experiments show that when most of the sharings are true sharings the problem can be solved very fast and the estimated size is very close to the true size; when many of the potential sharings do not stem from true haplotype sharing, our method gives reasonable lower bounds on the underlying number of haplotypes. In comparison, a naive approach of phasing the input genotypes provides trivial upper bounds of twice the number of genotypes.
format Online
Article
Text
id pubmed-3716786
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37167862013-07-23 Estimating population size via line graph reconstruction Halldórsson, Bjarni V Blokh, Dima Sharan, Roded Algorithms Mol Biol Research BACKGROUND: We propose a novel graph theoretic method to estimate haplotype population size from genotype data. The method considers only the potential sharing of haplotypes between individuals and is based on transforming the graph of potential haplotype sharing into a line graph using a minimum number of edge and vertex deletions. RESULTS: We show that the resulting line graph deletion problems are NP complete and provide exact integer programming solutions for them. We test our approach using extensive simulations of multiple population evolution and genotypes sampling scenarios. Our results also indicate that the method may be useful in comparing populations and it may be used as a first step in a method for haplotype phasing. CONCLUSIONS: Our computational experiments show that when most of the sharings are true sharings the problem can be solved very fast and the estimated size is very close to the true size; when many of the potential sharings do not stem from true haplotype sharing, our method gives reasonable lower bounds on the underlying number of haplotypes. In comparison, a naive approach of phasing the input genotypes provides trivial upper bounds of twice the number of genotypes. BioMed Central 2013-07-05 /pmc/articles/PMC3716786/ /pubmed/23829669 http://dx.doi.org/10.1186/1748-7188-8-17 Text en Copyright © 2013 Halldórsson et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Halldórsson, Bjarni V
Blokh, Dima
Sharan, Roded
Estimating population size via line graph reconstruction
title Estimating population size via line graph reconstruction
title_full Estimating population size via line graph reconstruction
title_fullStr Estimating population size via line graph reconstruction
title_full_unstemmed Estimating population size via line graph reconstruction
title_short Estimating population size via line graph reconstruction
title_sort estimating population size via line graph reconstruction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3716786/
https://www.ncbi.nlm.nih.gov/pubmed/23829669
http://dx.doi.org/10.1186/1748-7188-8-17
work_keys_str_mv AT halldorssonbjarniv estimatingpopulationsizevialinegraphreconstruction
AT blokhdima estimatingpopulationsizevialinegraphreconstruction
AT sharanroded estimatingpopulationsizevialinegraphreconstruction