Cargando…
BreakNet: detecting deletions using long reads and a deep learning approach
BACKGROUND: Structural variations (SVs) occupy a prominent position in human genetic diversity, and deletions form an important type of SV that has been suggested to be associated with genetic diseases. Although various deletion calling methods based on long reads have been proposed, a new approach...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8641175/ https://www.ncbi.nlm.nih.gov/pubmed/34856923 http://dx.doi.org/10.1186/s12859-021-04499-5 |
_version_ | 1784609454966177792 |
---|---|
author | Luo, Junwei Ding, Hongyu Shen, Jiquan Zhai, Haixia Wu, Zhengjiang Yan, Chaokun Luo, Huimin |
author_facet | Luo, Junwei Ding, Hongyu Shen, Jiquan Zhai, Haixia Wu, Zhengjiang Yan, Chaokun Luo, Huimin |
author_sort | Luo, Junwei |
collection | PubMed |
description | BACKGROUND: Structural variations (SVs) occupy a prominent position in human genetic diversity, and deletions form an important type of SV that has been suggested to be associated with genetic diseases. Although various deletion calling methods based on long reads have been proposed, a new approach is still needed to mine features in long-read alignment information. Recently, deep learning has attracted much attention in genome analysis, and it is a promising technique for calling SVs. RESULTS: In this paper, we propose BreakNet, a deep learning method that detects deletions by using long reads. BreakNet first extracts feature matrices from long-read alignments. Second, it uses a time-distributed convolutional neural network (CNN) to integrate and map the feature matrices to feature vectors. Third, BreakNet employs a bidirectional long short-term memory (BLSTM) model to analyse the produced set of continuous feature vectors in both the forward and backward directions. Finally, a classification module determines whether a region refers to a deletion. On real long-read sequencing datasets, we demonstrate that BreakNet outperforms Sniffles, SVIM and cuteSV in terms of their F1 scores. The source code for the proposed method is available from GitHub at https://github.com/luojunwei/BreakNet. CONCLUSIONS: Our work shows that deep learning can be combined with long reads to call deletions more effectively than existing methods. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04499-5. |
format | Online Article Text |
id | pubmed-8641175 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-86411752021-12-06 BreakNet: detecting deletions using long reads and a deep learning approach Luo, Junwei Ding, Hongyu Shen, Jiquan Zhai, Haixia Wu, Zhengjiang Yan, Chaokun Luo, Huimin BMC Bioinformatics Research BACKGROUND: Structural variations (SVs) occupy a prominent position in human genetic diversity, and deletions form an important type of SV that has been suggested to be associated with genetic diseases. Although various deletion calling methods based on long reads have been proposed, a new approach is still needed to mine features in long-read alignment information. Recently, deep learning has attracted much attention in genome analysis, and it is a promising technique for calling SVs. RESULTS: In this paper, we propose BreakNet, a deep learning method that detects deletions by using long reads. BreakNet first extracts feature matrices from long-read alignments. Second, it uses a time-distributed convolutional neural network (CNN) to integrate and map the feature matrices to feature vectors. Third, BreakNet employs a bidirectional long short-term memory (BLSTM) model to analyse the produced set of continuous feature vectors in both the forward and backward directions. Finally, a classification module determines whether a region refers to a deletion. On real long-read sequencing datasets, we demonstrate that BreakNet outperforms Sniffles, SVIM and cuteSV in terms of their F1 scores. The source code for the proposed method is available from GitHub at https://github.com/luojunwei/BreakNet. CONCLUSIONS: Our work shows that deep learning can be combined with long reads to call deletions more effectively than existing methods. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04499-5. BioMed Central 2021-12-02 /pmc/articles/PMC8641175/ /pubmed/34856923 http://dx.doi.org/10.1186/s12859-021-04499-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Luo, Junwei Ding, Hongyu Shen, Jiquan Zhai, Haixia Wu, Zhengjiang Yan, Chaokun Luo, Huimin BreakNet: detecting deletions using long reads and a deep learning approach |
title | BreakNet: detecting deletions using long reads and a deep learning approach |
title_full | BreakNet: detecting deletions using long reads and a deep learning approach |
title_fullStr | BreakNet: detecting deletions using long reads and a deep learning approach |
title_full_unstemmed | BreakNet: detecting deletions using long reads and a deep learning approach |
title_short | BreakNet: detecting deletions using long reads and a deep learning approach |
title_sort | breaknet: detecting deletions using long reads and a deep learning approach |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8641175/ https://www.ncbi.nlm.nih.gov/pubmed/34856923 http://dx.doi.org/10.1186/s12859-021-04499-5 |
work_keys_str_mv | AT luojunwei breaknetdetectingdeletionsusinglongreadsandadeeplearningapproach AT dinghongyu breaknetdetectingdeletionsusinglongreadsandadeeplearningapproach AT shenjiquan breaknetdetectingdeletionsusinglongreadsandadeeplearningapproach AT zhaihaixia breaknetdetectingdeletionsusinglongreadsandadeeplearningapproach AT wuzhengjiang breaknetdetectingdeletionsusinglongreadsandadeeplearningapproach AT yanchaokun breaknetdetectingdeletionsusinglongreadsandadeeplearningapproach AT luohuimin breaknetdetectingdeletionsusinglongreadsandadeeplearningapproach |