Cargando…

BreakNet: detecting deletions using long reads and a deep learning approach

BACKGROUND: Structural variations (SVs) occupy a prominent position in human genetic diversity, and deletions form an important type of SV that has been suggested to be associated with genetic diseases. Although various deletion calling methods based on long reads have been proposed, a new approach...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Junwei, Ding, Hongyu, Shen, Jiquan, Zhai, Haixia, Wu, Zhengjiang, Yan, Chaokun, Luo, Huimin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8641175/
https://www.ncbi.nlm.nih.gov/pubmed/34856923
http://dx.doi.org/10.1186/s12859-021-04499-5
Descripción
Sumario:BACKGROUND: Structural variations (SVs) occupy a prominent position in human genetic diversity, and deletions form an important type of SV that has been suggested to be associated with genetic diseases. Although various deletion calling methods based on long reads have been proposed, a new approach is still needed to mine features in long-read alignment information. Recently, deep learning has attracted much attention in genome analysis, and it is a promising technique for calling SVs. RESULTS: In this paper, we propose BreakNet, a deep learning method that detects deletions by using long reads. BreakNet first extracts feature matrices from long-read alignments. Second, it uses a time-distributed convolutional neural network (CNN) to integrate and map the feature matrices to feature vectors. Third, BreakNet employs a bidirectional long short-term memory (BLSTM) model to analyse the produced set of continuous feature vectors in both the forward and backward directions. Finally, a classification module determines whether a region refers to a deletion. On real long-read sequencing datasets, we demonstrate that BreakNet outperforms Sniffles, SVIM and cuteSV in terms of their F1 scores. The source code for the proposed method is available from GitHub at https://github.com/luojunwei/BreakNet. CONCLUSIONS: Our work shows that deep learning can be combined with long reads to call deletions more effectively than existing methods. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04499-5.