Cargando…

MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method

Targeted sequencing is commonly used in clinical application of NGS technology since it enables generation of sufficient sequencing depth in the targeted genes of interest and thus ensures the best possible downstream analysis. This notwithstanding, the accurate discovery and annotation of disease c...

Descripción completa

Detalles Bibliográficos
Autores principales: Rudewicz, Justine, Soueidan, Hayssam, Uricaru, Raluca, Bonnefoi, Hervé, Iggo, Richard, Bergh, Jonas, Nikolski, Macha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5143680/
https://www.ncbi.nlm.nih.gov/pubmed/28008336
http://dx.doi.org/10.3389/fgene.2016.00214
_version_ 1782472975531900928
author Rudewicz, Justine
Soueidan, Hayssam
Uricaru, Raluca
Bonnefoi, Hervé
Iggo, Richard
Bergh, Jonas
Nikolski, Macha
author_facet Rudewicz, Justine
Soueidan, Hayssam
Uricaru, Raluca
Bonnefoi, Hervé
Iggo, Richard
Bergh, Jonas
Nikolski, Macha
author_sort Rudewicz, Justine
collection PubMed
description Targeted sequencing is commonly used in clinical application of NGS technology since it enables generation of sufficient sequencing depth in the targeted genes of interest and thus ensures the best possible downstream analysis. This notwithstanding, the accurate discovery and annotation of disease causing mutations remains a challenging problem even in such favorable context. The difficulty is particularly salient in the case of third generation sequencing technology, such as PacBio. We present MICADo, a de Bruijn graph based method, implemented in python, that makes possible to distinguish between patient specific mutations and other alterations for targeted sequencing of a cohort of patients. MICADo analyses NGS reads for each sample within the context of the data of the whole cohort in order to capture the differences between specificities of the sample with respect to the cohort. MICADo is particularly suitable for sequencing data from highly heterogeneous samples, especially when it involves high rates of non-uniform sequencing errors. It was validated on PacBio sequencing datasets from several cohorts of patients. The comparison with two widely used available tools, namely VarScan and GATK, shows that MICADo is more accurate, especially when true mutations have frequencies close to backgound noise. The source code is available at http://github.com/cbib/MICADo.
format Online
Article
Text
id pubmed-5143680
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-51436802016-12-22 MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method Rudewicz, Justine Soueidan, Hayssam Uricaru, Raluca Bonnefoi, Hervé Iggo, Richard Bergh, Jonas Nikolski, Macha Front Genet Genetics Targeted sequencing is commonly used in clinical application of NGS technology since it enables generation of sufficient sequencing depth in the targeted genes of interest and thus ensures the best possible downstream analysis. This notwithstanding, the accurate discovery and annotation of disease causing mutations remains a challenging problem even in such favorable context. The difficulty is particularly salient in the case of third generation sequencing technology, such as PacBio. We present MICADo, a de Bruijn graph based method, implemented in python, that makes possible to distinguish between patient specific mutations and other alterations for targeted sequencing of a cohort of patients. MICADo analyses NGS reads for each sample within the context of the data of the whole cohort in order to capture the differences between specificities of the sample with respect to the cohort. MICADo is particularly suitable for sequencing data from highly heterogeneous samples, especially when it involves high rates of non-uniform sequencing errors. It was validated on PacBio sequencing datasets from several cohorts of patients. The comparison with two widely used available tools, namely VarScan and GATK, shows that MICADo is more accurate, especially when true mutations have frequencies close to backgound noise. The source code is available at http://github.com/cbib/MICADo. Frontiers Media S.A. 2016-12-08 /pmc/articles/PMC5143680/ /pubmed/28008336 http://dx.doi.org/10.3389/fgene.2016.00214 Text en Copyright © 2016 Rudewicz, Soueidan, Uricaru, Bonnefoi, Iggo, Bergh and Nikolski. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Rudewicz, Justine
Soueidan, Hayssam
Uricaru, Raluca
Bonnefoi, Hervé
Iggo, Richard
Bergh, Jonas
Nikolski, Macha
MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method
title MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method
title_full MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method
title_fullStr MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method
title_full_unstemmed MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method
title_short MICADo – Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method
title_sort micado – looking for mutations in targeted pacbio cancer data: an alignment-free method
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5143680/
https://www.ncbi.nlm.nih.gov/pubmed/28008336
http://dx.doi.org/10.3389/fgene.2016.00214
work_keys_str_mv AT rudewiczjustine micadolookingformutationsintargetedpacbiocancerdataanalignmentfreemethod
AT soueidanhayssam micadolookingformutationsintargetedpacbiocancerdataanalignmentfreemethod
AT uricaruraluca micadolookingformutationsintargetedpacbiocancerdataanalignmentfreemethod
AT bonnefoiherve micadolookingformutationsintargetedpacbiocancerdataanalignmentfreemethod
AT iggorichard micadolookingformutationsintargetedpacbiocancerdataanalignmentfreemethod
AT berghjonas micadolookingformutationsintargetedpacbiocancerdataanalignmentfreemethod
AT nikolskimacha micadolookingformutationsintargetedpacbiocancerdataanalignmentfreemethod