Cargando…
Estimating population size via line graph reconstruction
BACKGROUND: We propose a novel graph theoretic method to estimate haplotype population size from genotype data. The method considers only the potential sharing of haplotypes between individuals and is based on transforming the graph of potential haplotype sharing into a line graph using a minimum nu...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3716786/ https://www.ncbi.nlm.nih.gov/pubmed/23829669 http://dx.doi.org/10.1186/1748-7188-8-17 |
_version_ | 1782277598465753088 |
---|---|
author | Halldórsson, Bjarni V Blokh, Dima Sharan, Roded |
author_facet | Halldórsson, Bjarni V Blokh, Dima Sharan, Roded |
author_sort | Halldórsson, Bjarni V |
collection | PubMed |
description | BACKGROUND: We propose a novel graph theoretic method to estimate haplotype population size from genotype data. The method considers only the potential sharing of haplotypes between individuals and is based on transforming the graph of potential haplotype sharing into a line graph using a minimum number of edge and vertex deletions. RESULTS: We show that the resulting line graph deletion problems are NP complete and provide exact integer programming solutions for them. We test our approach using extensive simulations of multiple population evolution and genotypes sampling scenarios. Our results also indicate that the method may be useful in comparing populations and it may be used as a first step in a method for haplotype phasing. CONCLUSIONS: Our computational experiments show that when most of the sharings are true sharings the problem can be solved very fast and the estimated size is very close to the true size; when many of the potential sharings do not stem from true haplotype sharing, our method gives reasonable lower bounds on the underlying number of haplotypes. In comparison, a naive approach of phasing the input genotypes provides trivial upper bounds of twice the number of genotypes. |
format | Online Article Text |
id | pubmed-3716786 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-37167862013-07-23 Estimating population size via line graph reconstruction Halldórsson, Bjarni V Blokh, Dima Sharan, Roded Algorithms Mol Biol Research BACKGROUND: We propose a novel graph theoretic method to estimate haplotype population size from genotype data. The method considers only the potential sharing of haplotypes between individuals and is based on transforming the graph of potential haplotype sharing into a line graph using a minimum number of edge and vertex deletions. RESULTS: We show that the resulting line graph deletion problems are NP complete and provide exact integer programming solutions for them. We test our approach using extensive simulations of multiple population evolution and genotypes sampling scenarios. Our results also indicate that the method may be useful in comparing populations and it may be used as a first step in a method for haplotype phasing. CONCLUSIONS: Our computational experiments show that when most of the sharings are true sharings the problem can be solved very fast and the estimated size is very close to the true size; when many of the potential sharings do not stem from true haplotype sharing, our method gives reasonable lower bounds on the underlying number of haplotypes. In comparison, a naive approach of phasing the input genotypes provides trivial upper bounds of twice the number of genotypes. BioMed Central 2013-07-05 /pmc/articles/PMC3716786/ /pubmed/23829669 http://dx.doi.org/10.1186/1748-7188-8-17 Text en Copyright © 2013 Halldórsson et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Halldórsson, Bjarni V Blokh, Dima Sharan, Roded Estimating population size via line graph reconstruction |
title | Estimating population size via line graph reconstruction |
title_full | Estimating population size via line graph reconstruction |
title_fullStr | Estimating population size via line graph reconstruction |
title_full_unstemmed | Estimating population size via line graph reconstruction |
title_short | Estimating population size via line graph reconstruction |
title_sort | estimating population size via line graph reconstruction |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3716786/ https://www.ncbi.nlm.nih.gov/pubmed/23829669 http://dx.doi.org/10.1186/1748-7188-8-17 |
work_keys_str_mv | AT halldorssonbjarniv estimatingpopulationsizevialinegraphreconstruction AT blokhdima estimatingpopulationsizevialinegraphreconstruction AT sharanroded estimatingpopulationsizevialinegraphreconstruction |