Cargando…

RFcaller: a machine learning approach combined with read-level features to detect somatic mutations

The cost reduction in sequencing and the extensive genomic characterization of a wide variety of cancers are expanding tumor sequencing to a wide number of research groups and the clinical practice. Although specific pipelines have been generated for the identification of somatic mutations, their re...

Descripción completa

Detalles Bibliográficos
Autores principales: Díaz-Navarro, Ander, Bousquets-Muñoz, Pablo, Nadeu, Ferran, López-Tamargo, Sara, Beà, Silvia, Campo, Elias, Puente, Xose S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10227442/
https://www.ncbi.nlm.nih.gov/pubmed/37260508
http://dx.doi.org/10.1093/nargab/lqad056
_version_ 1785050773161246720
author Díaz-Navarro, Ander
Bousquets-Muñoz, Pablo
Nadeu, Ferran
López-Tamargo, Sara
Beà, Silvia
Campo, Elias
Puente, Xose S
author_facet Díaz-Navarro, Ander
Bousquets-Muñoz, Pablo
Nadeu, Ferran
López-Tamargo, Sara
Beà, Silvia
Campo, Elias
Puente, Xose S
author_sort Díaz-Navarro, Ander
collection PubMed
description The cost reduction in sequencing and the extensive genomic characterization of a wide variety of cancers are expanding tumor sequencing to a wide number of research groups and the clinical practice. Although specific pipelines have been generated for the identification of somatic mutations, their results usually differ considerably, and a common approach is to use several callers to achieve a more reliable set of mutations. This procedure is computationally expensive and time-consuming, and it suffers from the same limitations in sensitivity and specificity as other approaches. Expert revision of mutant calls is therefore required to verify calls that might be used for clinical diagnosis. This step could take advantage of machine learning techniques, as they provide a useful approach to incorporate expert-reviewed information for the identification of somatic mutations. Here we present RFcaller, a pipeline based on machine learning algorithms, for the detection of somatic mutations in tumor–normal paired samples that does not require large computing resources. RFcaller shows high accuracy for the detection of substitutions and insertions/deletions from whole genome or exome data. It allows the detection of mutations in driver genes missed by other approaches, and has been validated by comparison to deep and Sanger sequencing.
format Online
Article
Text
id pubmed-10227442
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-102274422023-05-31 RFcaller: a machine learning approach combined with read-level features to detect somatic mutations Díaz-Navarro, Ander Bousquets-Muñoz, Pablo Nadeu, Ferran López-Tamargo, Sara Beà, Silvia Campo, Elias Puente, Xose S NAR Genom Bioinform Standard Article The cost reduction in sequencing and the extensive genomic characterization of a wide variety of cancers are expanding tumor sequencing to a wide number of research groups and the clinical practice. Although specific pipelines have been generated for the identification of somatic mutations, their results usually differ considerably, and a common approach is to use several callers to achieve a more reliable set of mutations. This procedure is computationally expensive and time-consuming, and it suffers from the same limitations in sensitivity and specificity as other approaches. Expert revision of mutant calls is therefore required to verify calls that might be used for clinical diagnosis. This step could take advantage of machine learning techniques, as they provide a useful approach to incorporate expert-reviewed information for the identification of somatic mutations. Here we present RFcaller, a pipeline based on machine learning algorithms, for the detection of somatic mutations in tumor–normal paired samples that does not require large computing resources. RFcaller shows high accuracy for the detection of substitutions and insertions/deletions from whole genome or exome data. It allows the detection of mutations in driver genes missed by other approaches, and has been validated by comparison to deep and Sanger sequencing. Oxford University Press 2023-05-30 /pmc/articles/PMC10227442/ /pubmed/37260508 http://dx.doi.org/10.1093/nargab/lqad056 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Standard Article
Díaz-Navarro, Ander
Bousquets-Muñoz, Pablo
Nadeu, Ferran
López-Tamargo, Sara
Beà, Silvia
Campo, Elias
Puente, Xose S
RFcaller: a machine learning approach combined with read-level features to detect somatic mutations
title RFcaller: a machine learning approach combined with read-level features to detect somatic mutations
title_full RFcaller: a machine learning approach combined with read-level features to detect somatic mutations
title_fullStr RFcaller: a machine learning approach combined with read-level features to detect somatic mutations
title_full_unstemmed RFcaller: a machine learning approach combined with read-level features to detect somatic mutations
title_short RFcaller: a machine learning approach combined with read-level features to detect somatic mutations
title_sort rfcaller: a machine learning approach combined with read-level features to detect somatic mutations
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10227442/
https://www.ncbi.nlm.nih.gov/pubmed/37260508
http://dx.doi.org/10.1093/nargab/lqad056
work_keys_str_mv AT diaznavarroander rfcalleramachinelearningapproachcombinedwithreadlevelfeaturestodetectsomaticmutations
AT bousquetsmunozpablo rfcalleramachinelearningapproachcombinedwithreadlevelfeaturestodetectsomaticmutations
AT nadeuferran rfcalleramachinelearningapproachcombinedwithreadlevelfeaturestodetectsomaticmutations
AT lopeztamargosara rfcalleramachinelearningapproachcombinedwithreadlevelfeaturestodetectsomaticmutations
AT beasilvia rfcalleramachinelearningapproachcombinedwithreadlevelfeaturestodetectsomaticmutations
AT campoelias rfcalleramachinelearningapproachcombinedwithreadlevelfeaturestodetectsomaticmutations
AT puentexoses rfcalleramachinelearningapproachcombinedwithreadlevelfeaturestodetectsomaticmutations