Cargando…

DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning

BACKGROUND: Measuring DNA replication dynamics with high throughput and single-molecule resolution is critical for understanding both the basic biology behind how cells replicate their DNA and how DNA replication can be used as a therapeutic target for diseases like cancer. In recent years, the dete...

Descripción completa

Detalles Bibliográficos
Autor principal: Boemo, Michael A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191041/
https://www.ncbi.nlm.nih.gov/pubmed/34107894
http://dx.doi.org/10.1186/s12864-021-07736-6
_version_ 1783705798415745024
author Boemo, Michael A.
author_facet Boemo, Michael A.
author_sort Boemo, Michael A.
collection PubMed
description BACKGROUND: Measuring DNA replication dynamics with high throughput and single-molecule resolution is critical for understanding both the basic biology behind how cells replicate their DNA and how DNA replication can be used as a therapeutic target for diseases like cancer. In recent years, the detection of base analogues in Oxford Nanopore Technologies (ONT) sequencing reads has become a promising new method to supersede existing single-molecule methods such as DNA fibre analysis: ONT sequencing yields long reads with high throughput, and sequenced molecules can be mapped to the genome using standard sequence alignment software. RESULTS: This paper introduces DNAscent v2, software that uses a residual neural network to achieve fast, accurate detection of the thymidine analogue BrdU with single-nucleotide resolution. DNAscent v2 also comes equipped with an autoencoder that interprets the pattern of BrdU incorporation on each ONT-sequenced molecule into replication fork direction to call the location of replication origins termination sites. DNAscent v2 surpasses previous versions of DNAscent in BrdU calling accuracy, origin calling accuracy, speed, and versatility across different experimental protocols. Unlike NanoMod, DNAscent v2 positively identifies BrdU without the need for sequencing unmodified DNA. Unlike RepNano, DNAscent v2 calls BrdU with single-nucleotide resolution and detects more origins than RepNano from the same sequencing data. DNAscent v2 is open-source and available at https://github.com/MBoemo/DNAscent. CONCLUSIONS: This paper shows that DNAscent v2 is the new state-of-the-art in the high-throughput, single-molecule detection of replication fork dynamics. These improvements in DNAscent v2 mark an important step towards measuring DNA replication dynamics in large genomes with single-molecule resolution. Looking forward, the increase in accuracy in single-nucleotide resolution BrdU calls will also allow DNAscent v2 to branch out into other areas of genome stability research, particularly the detection of DNA repair. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-021-07736-6).
format Online
Article
Text
id pubmed-8191041
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-81910412021-06-10 DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning Boemo, Michael A. BMC Genomics Software BACKGROUND: Measuring DNA replication dynamics with high throughput and single-molecule resolution is critical for understanding both the basic biology behind how cells replicate their DNA and how DNA replication can be used as a therapeutic target for diseases like cancer. In recent years, the detection of base analogues in Oxford Nanopore Technologies (ONT) sequencing reads has become a promising new method to supersede existing single-molecule methods such as DNA fibre analysis: ONT sequencing yields long reads with high throughput, and sequenced molecules can be mapped to the genome using standard sequence alignment software. RESULTS: This paper introduces DNAscent v2, software that uses a residual neural network to achieve fast, accurate detection of the thymidine analogue BrdU with single-nucleotide resolution. DNAscent v2 also comes equipped with an autoencoder that interprets the pattern of BrdU incorporation on each ONT-sequenced molecule into replication fork direction to call the location of replication origins termination sites. DNAscent v2 surpasses previous versions of DNAscent in BrdU calling accuracy, origin calling accuracy, speed, and versatility across different experimental protocols. Unlike NanoMod, DNAscent v2 positively identifies BrdU without the need for sequencing unmodified DNA. Unlike RepNano, DNAscent v2 calls BrdU with single-nucleotide resolution and detects more origins than RepNano from the same sequencing data. DNAscent v2 is open-source and available at https://github.com/MBoemo/DNAscent. CONCLUSIONS: This paper shows that DNAscent v2 is the new state-of-the-art in the high-throughput, single-molecule detection of replication fork dynamics. These improvements in DNAscent v2 mark an important step towards measuring DNA replication dynamics in large genomes with single-molecule resolution. Looking forward, the increase in accuracy in single-nucleotide resolution BrdU calls will also allow DNAscent v2 to branch out into other areas of genome stability research, particularly the detection of DNA repair. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-021-07736-6). BioMed Central 2021-06-09 /pmc/articles/PMC8191041/ /pubmed/34107894 http://dx.doi.org/10.1186/s12864-021-07736-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Boemo, Michael A.
DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning
title DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning
title_full DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning
title_fullStr DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning
title_full_unstemmed DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning
title_short DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning
title_sort dnascent v2: detecting replication forks in nanopore sequencing data with deep learning
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191041/
https://www.ncbi.nlm.nih.gov/pubmed/34107894
http://dx.doi.org/10.1186/s12864-021-07736-6
work_keys_str_mv AT boemomichaela dnascentv2detectingreplicationforksinnanoporesequencingdatawithdeeplearning