Cargando…
MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method
Targeted sequencing is commonly used in clinical application of NGS technology since it enables generation of sufficient sequencing depth in the targeted genes of interest and thus ensures the best possible downstream analysis. This notwithstanding, the accurate discovery and annotation of disease c...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5143680/ https://www.ncbi.nlm.nih.gov/pubmed/28008336 http://dx.doi.org/10.3389/fgene.2016.00214 |
_version_ | 1782472975531900928 |
---|---|
author | Rudewicz, Justine Soueidan, Hayssam Uricaru, Raluca Bonnefoi, Hervé Iggo, Richard Bergh, Jonas Nikolski, Macha |
author_facet | Rudewicz, Justine Soueidan, Hayssam Uricaru, Raluca Bonnefoi, Hervé Iggo, Richard Bergh, Jonas Nikolski, Macha |
author_sort | Rudewicz, Justine |
collection | PubMed |
description | Targeted sequencing is commonly used in clinical application of NGS technology since it enables generation of sufficient sequencing depth in the targeted genes of interest and thus ensures the best possible downstream analysis. This notwithstanding, the accurate discovery and annotation of disease causing mutations remains a challenging problem even in such favorable context. The difficulty is particularly salient in the case of third generation sequencing technology, such as PacBio. We present MICADo, a de Bruijn graph based method, implemented in python, that makes possible to distinguish between patient specific mutations and other alterations for targeted sequencing of a cohort of patients. MICADo analyses NGS reads for each sample within the context of the data of the whole cohort in order to capture the differences between specificities of the sample with respect to the cohort. MICADo is particularly suitable for sequencing data from highly heterogeneous samples, especially when it involves high rates of non-uniform sequencing errors. It was validated on PacBio sequencing datasets from several cohorts of patients. The comparison with two widely used available tools, namely VarScan and GATK, shows that MICADo is more accurate, especially when true mutations have frequencies close to backgound noise. The source code is available at http://github.com/cbib/MICADo. |
format | Online Article Text |
id | pubmed-5143680 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-51436802016-12-22 MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method Rudewicz, Justine Soueidan, Hayssam Uricaru, Raluca Bonnefoi, Hervé Iggo, Richard Bergh, Jonas Nikolski, Macha Front Genet Genetics Targeted sequencing is commonly used in clinical application of NGS technology since it enables generation of sufficient sequencing depth in the targeted genes of interest and thus ensures the best possible downstream analysis. This notwithstanding, the accurate discovery and annotation of disease causing mutations remains a challenging problem even in such favorable context. The difficulty is particularly salient in the case of third generation sequencing technology, such as PacBio. We present MICADo, a de Bruijn graph based method, implemented in python, that makes possible to distinguish between patient specific mutations and other alterations for targeted sequencing of a cohort of patients. MICADo analyses NGS reads for each sample within the context of the data of the whole cohort in order to capture the differences between specificities of the sample with respect to the cohort. MICADo is particularly suitable for sequencing data from highly heterogeneous samples, especially when it involves high rates of non-uniform sequencing errors. It was validated on PacBio sequencing datasets from several cohorts of patients. The comparison with two widely used available tools, namely VarScan and GATK, shows that MICADo is more accurate, especially when true mutations have frequencies close to backgound noise. The source code is available at http://github.com/cbib/MICADo. Frontiers Media S.A. 2016-12-08 /pmc/articles/PMC5143680/ /pubmed/28008336 http://dx.doi.org/10.3389/fgene.2016.00214 Text en Copyright © 2016 Rudewicz, Soueidan, Uricaru, Bonnefoi, Iggo, Bergh and Nikolski. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Rudewicz, Justine Soueidan, Hayssam Uricaru, Raluca Bonnefoi, Hervé Iggo, Richard Bergh, Jonas Nikolski, Macha MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method |
title | MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method |
title_full | MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method |
title_fullStr | MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method |
title_full_unstemmed | MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method |
title_short | MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method |
title_sort | micado – looking for mutations in targeted pacbio cancer data: an alignment-free method |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5143680/ https://www.ncbi.nlm.nih.gov/pubmed/28008336 http://dx.doi.org/10.3389/fgene.2016.00214 |
work_keys_str_mv | AT rudewiczjustine micadolookingformutationsintargetedpacbiocancerdataanalignmentfreemethod AT soueidanhayssam micadolookingformutationsintargetedpacbiocancerdataanalignmentfreemethod AT uricaruraluca micadolookingformutationsintargetedpacbiocancerdataanalignmentfreemethod AT bonnefoiherve micadolookingformutationsintargetedpacbiocancerdataanalignmentfreemethod AT iggorichard micadolookingformutationsintargetedpacbiocancerdataanalignmentfreemethod AT berghjonas micadolookingformutationsintargetedpacbiocancerdataanalignmentfreemethod AT nikolskimacha micadolookingformutationsintargetedpacbiocancerdataanalignmentfreemethod |