DeepSom: a CNN-based approach to somatic variant calling in WGS samples without a matched normal

MOTIVATION: Somatic mutations are usually called by analyzing the DNA sequence of a tumor sample in conjunction with a matched normal. However, a matched normal is not always available, for instance, in retrospective analysis or diagnostic settings. For such cases, tumor-only somatic variant calling...

Descripción completa

Detalles Bibliográficos
Autores principales: Vilov, Sergey, Heinig, Matthias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9843587/
https://www.ncbi.nlm.nih.gov/pubmed/36637201
http://dx.doi.org/10.1093/bioinformatics/btac828
Descripción
Sumario:MOTIVATION: Somatic mutations are usually called by analyzing the DNA sequence of a tumor sample in conjunction with a matched normal. However, a matched normal is not always available, for instance, in retrospective analysis or diagnostic settings. For such cases, tumor-only somatic variant calling tools need to be designed. Previously proposed approaches demonstrate inferior performance on whole-genome sequencing (WGS) samples. RESULTS: We present the convolutional neural network-based approach called DeepSom for detecting somatic single nucleotide polymorphism and short insertion and deletion variants in tumor WGS samples without a matched normal. We validate DeepSom by reporting its performance on five different cancer datasets. We also demonstrate that on WGS samples DeepSom outperforms previously proposed methods for tumor-only somatic variant calling. AVAILABILITY AND IMPLEMENTATION: DeepSom is available as a GitHub repository at https://github.com/heiniglab/DeepSom. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.