Cargando…
cFinder: definition and quantification of multiple haplotypes in a mixed sample
BACKGROUND: Next-generation sequencing allows for determining the genetic composition of a mixed sample. For instance, when performing resistance testing for BCR-ABL1 it is necessary to identify clones and define compound mutations; together with an exact quantification this may complement diagnosis...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4562109/ https://www.ncbi.nlm.nih.gov/pubmed/26346608 http://dx.doi.org/10.1186/s13104-015-1382-7 |
_version_ | 1782389114807517184 |
---|---|
author | Niklas, Norbert Hafenscher, Julia Barna, Agnes Wiesinger, Karin Pröll, Johannes Dreiseitl, Stephan Preuner-Stix, Sandra Valent, Peter Lion, Thomas Gabriel, Christian |
author_facet | Niklas, Norbert Hafenscher, Julia Barna, Agnes Wiesinger, Karin Pröll, Johannes Dreiseitl, Stephan Preuner-Stix, Sandra Valent, Peter Lion, Thomas Gabriel, Christian |
author_sort | Niklas, Norbert |
collection | PubMed |
description | BACKGROUND: Next-generation sequencing allows for determining the genetic composition of a mixed sample. For instance, when performing resistance testing for BCR-ABL1 it is necessary to identify clones and define compound mutations; together with an exact quantification this may complement diagnosis and therapy decisions with additional information. Moreover, that applies not only to oncological issues but also determination of viral, bacterial or fungal infection. The efforts to retrieve multiple haplotypes (more than two) and proportion information from data with conventional software are difficult, cumbersome and demand multiple manual steps. RESULTS: Therefore, we developed a tool called cFinder that is capable of automatic detection of haplotypes and their accurate quantification within one sample. BCR-ABL1 samples containing multiple clones were used for testing and our cFinder could identify all previously found clones together with their abundance and even refine some results. Additionally, reads were simulated using GemSIM with multiple haplotypes, the detection was very close to linear (R(2) = 0.96). Our aim is not to deduce haploblocks over statistics, but to characterize one sample’s composition precisely. As a result the cFinder reports the connections of variants (haplotypes) with their readcount and relative occurrence (percentage). Download is available at http://sourceforge.net/projects/cfinder/. CONCLUSIONS: Our cFinder is implemented in an efficient algorithm that can be run on a low-performance desktop computer. Furthermore, it considers paired-end information (if available) and is generally open for any current next-generation sequencing technology and alignment strategy. To our knowledge, this is the first software that enables researchers without extensive bioinformatic support to designate multiple haplotypes and how they constitute to a sample. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13104-015-1382-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4562109 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45621092015-09-09 cFinder: definition and quantification of multiple haplotypes in a mixed sample Niklas, Norbert Hafenscher, Julia Barna, Agnes Wiesinger, Karin Pröll, Johannes Dreiseitl, Stephan Preuner-Stix, Sandra Valent, Peter Lion, Thomas Gabriel, Christian BMC Res Notes Technical Note BACKGROUND: Next-generation sequencing allows for determining the genetic composition of a mixed sample. For instance, when performing resistance testing for BCR-ABL1 it is necessary to identify clones and define compound mutations; together with an exact quantification this may complement diagnosis and therapy decisions with additional information. Moreover, that applies not only to oncological issues but also determination of viral, bacterial or fungal infection. The efforts to retrieve multiple haplotypes (more than two) and proportion information from data with conventional software are difficult, cumbersome and demand multiple manual steps. RESULTS: Therefore, we developed a tool called cFinder that is capable of automatic detection of haplotypes and their accurate quantification within one sample. BCR-ABL1 samples containing multiple clones were used for testing and our cFinder could identify all previously found clones together with their abundance and even refine some results. Additionally, reads were simulated using GemSIM with multiple haplotypes, the detection was very close to linear (R(2) = 0.96). Our aim is not to deduce haploblocks over statistics, but to characterize one sample’s composition precisely. As a result the cFinder reports the connections of variants (haplotypes) with their readcount and relative occurrence (percentage). Download is available at http://sourceforge.net/projects/cfinder/. CONCLUSIONS: Our cFinder is implemented in an efficient algorithm that can be run on a low-performance desktop computer. Furthermore, it considers paired-end information (if available) and is generally open for any current next-generation sequencing technology and alignment strategy. To our knowledge, this is the first software that enables researchers without extensive bioinformatic support to designate multiple haplotypes and how they constitute to a sample. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13104-015-1382-7) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-07 /pmc/articles/PMC4562109/ /pubmed/26346608 http://dx.doi.org/10.1186/s13104-015-1382-7 Text en © Niklas et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Technical Note Niklas, Norbert Hafenscher, Julia Barna, Agnes Wiesinger, Karin Pröll, Johannes Dreiseitl, Stephan Preuner-Stix, Sandra Valent, Peter Lion, Thomas Gabriel, Christian cFinder: definition and quantification of multiple haplotypes in a mixed sample |
title | cFinder: definition and quantification of multiple haplotypes in a mixed sample |
title_full | cFinder: definition and quantification of multiple haplotypes in a mixed sample |
title_fullStr | cFinder: definition and quantification of multiple haplotypes in a mixed sample |
title_full_unstemmed | cFinder: definition and quantification of multiple haplotypes in a mixed sample |
title_short | cFinder: definition and quantification of multiple haplotypes in a mixed sample |
title_sort | cfinder: definition and quantification of multiple haplotypes in a mixed sample |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4562109/ https://www.ncbi.nlm.nih.gov/pubmed/26346608 http://dx.doi.org/10.1186/s13104-015-1382-7 |
work_keys_str_mv | AT niklasnorbert cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample AT hafenscherjulia cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample AT barnaagnes cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample AT wiesingerkarin cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample AT prolljohannes cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample AT dreiseitlstephan cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample AT preunerstixsandra cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample AT valentpeter cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample AT lionthomas cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample AT gabrielchristian cfinderdefinitionandquantificationofmultiplehaplotypesinamixedsample |