Cargando…

cFinder: definition and quantification of multiple haplotypes in a mixed sample

BACKGROUND: Next-generation sequencing allows for determining the genetic composition of a mixed sample. For instance, when performing resistance testing for BCR-ABL1 it is necessary to identify clones and define compound mutations; together with an exact quantification this may complement diagnosis...

Descripción completa

Detalles Bibliográficos
Autores principales: Niklas, Norbert, Hafenscher, Julia, Barna, Agnes, Wiesinger, Karin, Pröll, Johannes, Dreiseitl, Stephan, Preuner-Stix, Sandra, Valent, Peter, Lion, Thomas, Gabriel, Christian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4562109/
https://www.ncbi.nlm.nih.gov/pubmed/26346608
http://dx.doi.org/10.1186/s13104-015-1382-7
_version_ 1782389114807517184
author Niklas, Norbert
Hafenscher, Julia
Barna, Agnes
Wiesinger, Karin
Pröll, Johannes
Dreiseitl, Stephan
Preuner-Stix, Sandra
Valent, Peter
Lion, Thomas
Gabriel, Christian
author_facet Niklas, Norbert
Hafenscher, Julia
Barna, Agnes
Wiesinger, Karin
Pröll, Johannes
Dreiseitl, Stephan
Preuner-Stix, Sandra
Valent, Peter
Lion, Thomas
Gabriel, Christian
author_sort Niklas, Norbert
collection PubMed
description BACKGROUND: Next-generation sequencing allows for determining the genetic composition of a mixed sample. For instance, when performing resistance testing for BCR-ABL1 it is necessary to identify clones and define compound mutations; together with an exact quantification this may complement diagnosis and therapy decisions with additional information. Moreover, that applies not only to oncological issues but also determination of viral, bacterial or fungal infection. The efforts to retrieve multiple haplotypes (more than two) and proportion information from data with conventional software are difficult, cumbersome and demand multiple manual steps. RESULTS: Therefore, we developed a tool called cFinder that is capable of automatic detection of haplotypes and their accurate quantification within one sample. BCR-ABL1 samples containing multiple clones were used for testing and our cFinder could identify all previously found clones together with their abundance and even refine some results. Additionally, reads were simulated using GemSIM with multiple haplotypes, the detection was very close to linear (R(2) = 0.96). Our aim is not to deduce haploblocks over statistics, but to characterize one sample’s composition precisely. As a result the cFinder reports the connections of variants (haplotypes) with their readcount and relative occurrence (percentage). Download is available at http://sourceforge.net/projects/cfinder/. CONCLUSIONS: Our cFinder is implemented in an efficient algorithm that can be run on a low-performance desktop computer. Furthermore, it considers paired-end information (if available) and is generally open for any current next-generation sequencing technology and alignment strategy. To our knowledge, this is the first software that enables researchers without extensive bioinformatic support to designate multiple haplotypes and how they constitute to a sample. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13104-015-1382-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4562109
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45621092015-09-09 cFinder: definition and quantification of multiple haplotypes in a mixed sample Niklas, Norbert Hafenscher, Julia Barna, Agnes Wiesinger, Karin Pröll, Johannes Dreiseitl, Stephan Preuner-Stix, Sandra Valent, Peter Lion, Thomas Gabriel, Christian BMC Res Notes Technical Note BACKGROUND: Next-generation sequencing allows for determining the genetic composition of a mixed sample. For instance, when performing resistance testing for BCR-ABL1 it is necessary to identify clones and define compound mutations; together with an exact quantification this may complement diagnosis and therapy decisions with additional information. Moreover, that applies not only to oncological issues but also determination of viral, bacterial or fungal infection. The efforts to retrieve multiple haplotypes (more than two) and proportion information from data with conventional software are difficult, cumbersome and demand multiple manual steps. RESULTS: Therefore, we developed a tool called cFinder that is capable of automatic detection of haplotypes and their accurate quantification within one sample. BCR-ABL1 samples containing multiple clones were used for testing and our cFinder could identify all previously found clones together with their abundance and even refine some results. Additionally, reads were simulated using GemSIM with multiple haplotypes, the detection was very close to linear (R(2) = 0.96). Our aim is not to deduce haploblocks over statistics, but to characterize one sample’s composition precisely. As a result the cFinder reports the connections of variants (haplotypes) with their readcount and relative occurrence (percentage). Download is available at http://sourceforge.net/projects/cfinder/. CONCLUSIONS: Our cFinder is implemented in an efficient algorithm that can be run on a low-performance desktop computer. Furthermore, it considers paired-end information (if available) and is generally open for any current next-generation sequencing technology and alignment strategy. To our knowledge, this is the first software that enables researchers without extensive bioinformatic support to designate multiple haplotypes and how they constitute to a sample. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13104-015-1382-7) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-07 /pmc/articles/PMC4562109/ /pubmed/26346608 http://dx.doi.org/10.1186/s13104-015-1382-7 Text en © Niklas et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Technical Note
Niklas, Norbert
Hafenscher, Julia
Barna, Agnes
Wiesinger, Karin
Pröll, Johannes
Dreiseitl, Stephan
Preuner-Stix, Sandra
Valent, Peter
Lion, Thomas
Gabriel, Christian
cFinder: definition and quantification of multiple haplotypes in a mixed sample
title cFinder: definition and quantification of multiple haplotypes in a mixed sample
title_full cFinder: definition and quantification of multiple haplotypes in a mixed sample
title_fullStr cFinder: definition and quantification of multiple haplotypes in a mixed sample
title_full_unstemmed cFinder: definition and quantification of multiple haplotypes in a mixed sample
title_short cFinder: definition and quantification of multiple haplotypes in a mixed sample
title_sort cfinder: definition and quantification of multiple haplotypes in a mixed sample
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4562109/
https://www.ncbi.nlm.nih.gov/pubmed/26346608
http://dx.doi.org/10.1186/s13104-015-1382-7
work_keys_str_mv AT niklasnorbert cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample
AT hafenscherjulia cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample
AT barnaagnes cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample
AT wiesingerkarin cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample
AT prolljohannes cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample
AT dreiseitlstephan cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample
AT preunerstixsandra cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample
AT valentpeter cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample
AT lionthomas cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample
AT gabrielchristian cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample