Cargando…
DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning
BACKGROUND: Measuring DNA replication dynamics with high throughput and single-molecule resolution is critical for understanding both the basic biology behind how cells replicate their DNA and how DNA replication can be used as a therapeutic target for diseases like cancer. In recent years, the dete...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191041/ https://www.ncbi.nlm.nih.gov/pubmed/34107894 http://dx.doi.org/10.1186/s12864-021-07736-6 |
_version_ | 1783705798415745024 |
---|---|
author | Boemo, Michael A. |
author_facet | Boemo, Michael A. |
author_sort | Boemo, Michael A. |
collection | PubMed |
description | BACKGROUND: Measuring DNA replication dynamics with high throughput and single-molecule resolution is critical for understanding both the basic biology behind how cells replicate their DNA and how DNA replication can be used as a therapeutic target for diseases like cancer. In recent years, the detection of base analogues in Oxford Nanopore Technologies (ONT) sequencing reads has become a promising new method to supersede existing single-molecule methods such as DNA fibre analysis: ONT sequencing yields long reads with high throughput, and sequenced molecules can be mapped to the genome using standard sequence alignment software. RESULTS: This paper introduces DNAscent v2, software that uses a residual neural network to achieve fast, accurate detection of the thymidine analogue BrdU with single-nucleotide resolution. DNAscent v2 also comes equipped with an autoencoder that interprets the pattern of BrdU incorporation on each ONT-sequenced molecule into replication fork direction to call the location of replication origins termination sites. DNAscent v2 surpasses previous versions of DNAscent in BrdU calling accuracy, origin calling accuracy, speed, and versatility across different experimental protocols. Unlike NanoMod, DNAscent v2 positively identifies BrdU without the need for sequencing unmodified DNA. Unlike RepNano, DNAscent v2 calls BrdU with single-nucleotide resolution and detects more origins than RepNano from the same sequencing data. DNAscent v2 is open-source and available at https://github.com/MBoemo/DNAscent. CONCLUSIONS: This paper shows that DNAscent v2 is the new state-of-the-art in the high-throughput, single-molecule detection of replication fork dynamics. These improvements in DNAscent v2 mark an important step towards measuring DNA replication dynamics in large genomes with single-molecule resolution. Looking forward, the increase in accuracy in single-nucleotide resolution BrdU calls will also allow DNAscent v2 to branch out into other areas of genome stability research, particularly the detection of DNA repair. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-021-07736-6). |
format | Online Article Text |
id | pubmed-8191041 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-81910412021-06-10 DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning Boemo, Michael A. BMC Genomics Software BACKGROUND: Measuring DNA replication dynamics with high throughput and single-molecule resolution is critical for understanding both the basic biology behind how cells replicate their DNA and how DNA replication can be used as a therapeutic target for diseases like cancer. In recent years, the detection of base analogues in Oxford Nanopore Technologies (ONT) sequencing reads has become a promising new method to supersede existing single-molecule methods such as DNA fibre analysis: ONT sequencing yields long reads with high throughput, and sequenced molecules can be mapped to the genome using standard sequence alignment software. RESULTS: This paper introduces DNAscent v2, software that uses a residual neural network to achieve fast, accurate detection of the thymidine analogue BrdU with single-nucleotide resolution. DNAscent v2 also comes equipped with an autoencoder that interprets the pattern of BrdU incorporation on each ONT-sequenced molecule into replication fork direction to call the location of replication origins termination sites. DNAscent v2 surpasses previous versions of DNAscent in BrdU calling accuracy, origin calling accuracy, speed, and versatility across different experimental protocols. Unlike NanoMod, DNAscent v2 positively identifies BrdU without the need for sequencing unmodified DNA. Unlike RepNano, DNAscent v2 calls BrdU with single-nucleotide resolution and detects more origins than RepNano from the same sequencing data. DNAscent v2 is open-source and available at https://github.com/MBoemo/DNAscent. CONCLUSIONS: This paper shows that DNAscent v2 is the new state-of-the-art in the high-throughput, single-molecule detection of replication fork dynamics. These improvements in DNAscent v2 mark an important step towards measuring DNA replication dynamics in large genomes with single-molecule resolution. Looking forward, the increase in accuracy in single-nucleotide resolution BrdU calls will also allow DNAscent v2 to branch out into other areas of genome stability research, particularly the detection of DNA repair. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-021-07736-6). BioMed Central 2021-06-09 /pmc/articles/PMC8191041/ /pubmed/34107894 http://dx.doi.org/10.1186/s12864-021-07736-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Software Boemo, Michael A. DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning |
title | DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning |
title_full | DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning |
title_fullStr | DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning |
title_full_unstemmed | DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning |
title_short | DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning |
title_sort | dnascent v2: detecting replication forks in nanopore sequencing data with deep learning |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191041/ https://www.ncbi.nlm.nih.gov/pubmed/34107894 http://dx.doi.org/10.1186/s12864-021-07736-6 |
work_keys_str_mv | AT boemomichaela dnascentv2detectingreplicationforksinnanoporesequencingdatawithdeeplearning |