Cargando…

Fibertools: fast and accurate DNA-m6A calling using single-molecule long-read sequencing

Single-molecule chromatin fiber sequencing is based on the single-nucleotide resolution identification of DNA N(6)-methyladenine (m6A) along individual sequencing reads. We present fibertools, a semi-supervised convolutional neural network that permits the fast and accurate identification of both en...

Descripción completa

Detalles Bibliográficos
Autores principales: Jha, Anupama, Bohaczuk, Stephanie C., Mao, Yizi, Ranchalis, Jane, Mallory, Benjamin J., Min, Alan T., Hamm, Morgan O., Swanson, Elliott, Finkbeiner, Connor, Li, Tony, Whittington, Dale, Noble, William Stafford, Stergachis, Andrew B., Vollger, Mitchell R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10153250/
https://www.ncbi.nlm.nih.gov/pubmed/37131601
http://dx.doi.org/10.1101/2023.04.20.537673
_version_ 1785035895314841600
author Jha, Anupama
Bohaczuk, Stephanie C.
Mao, Yizi
Ranchalis, Jane
Mallory, Benjamin J.
Min, Alan T.
Hamm, Morgan O.
Swanson, Elliott
Finkbeiner, Connor
Li, Tony
Whittington, Dale
Noble, William Stafford
Stergachis, Andrew B.
Vollger, Mitchell R.
author_facet Jha, Anupama
Bohaczuk, Stephanie C.
Mao, Yizi
Ranchalis, Jane
Mallory, Benjamin J.
Min, Alan T.
Hamm, Morgan O.
Swanson, Elliott
Finkbeiner, Connor
Li, Tony
Whittington, Dale
Noble, William Stafford
Stergachis, Andrew B.
Vollger, Mitchell R.
author_sort Jha, Anupama
collection PubMed
description Single-molecule chromatin fiber sequencing is based on the single-nucleotide resolution identification of DNA N(6)-methyladenine (m6A) along individual sequencing reads. We present fibertools, a semi-supervised convolutional neural network that permits the fast and accurate identification of both endogenous and exogenous m6A-marked bases using single-molecule long-read sequencing. Fibertools enables highly accurate (>90% precision and recall) m6A identification along multi-kilobase DNA molecules with a ~1,000-fold improvement in speed and the capacity to generalize to new sequencing chemistries.
format Online
Article
Text
id pubmed-10153250
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-101532502023-05-03 Fibertools: fast and accurate DNA-m6A calling using single-molecule long-read sequencing Jha, Anupama Bohaczuk, Stephanie C. Mao, Yizi Ranchalis, Jane Mallory, Benjamin J. Min, Alan T. Hamm, Morgan O. Swanson, Elliott Finkbeiner, Connor Li, Tony Whittington, Dale Noble, William Stafford Stergachis, Andrew B. Vollger, Mitchell R. bioRxiv Article Single-molecule chromatin fiber sequencing is based on the single-nucleotide resolution identification of DNA N(6)-methyladenine (m6A) along individual sequencing reads. We present fibertools, a semi-supervised convolutional neural network that permits the fast and accurate identification of both endogenous and exogenous m6A-marked bases using single-molecule long-read sequencing. Fibertools enables highly accurate (>90% precision and recall) m6A identification along multi-kilobase DNA molecules with a ~1,000-fold improvement in speed and the capacity to generalize to new sequencing chemistries. Cold Spring Harbor Laboratory 2023-07-06 /pmc/articles/PMC10153250/ /pubmed/37131601 http://dx.doi.org/10.1101/2023.04.20.537673 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Jha, Anupama
Bohaczuk, Stephanie C.
Mao, Yizi
Ranchalis, Jane
Mallory, Benjamin J.
Min, Alan T.
Hamm, Morgan O.
Swanson, Elliott
Finkbeiner, Connor
Li, Tony
Whittington, Dale
Noble, William Stafford
Stergachis, Andrew B.
Vollger, Mitchell R.
Fibertools: fast and accurate DNA-m6A calling using single-molecule long-read sequencing
title Fibertools: fast and accurate DNA-m6A calling using single-molecule long-read sequencing
title_full Fibertools: fast and accurate DNA-m6A calling using single-molecule long-read sequencing
title_fullStr Fibertools: fast and accurate DNA-m6A calling using single-molecule long-read sequencing
title_full_unstemmed Fibertools: fast and accurate DNA-m6A calling using single-molecule long-read sequencing
title_short Fibertools: fast and accurate DNA-m6A calling using single-molecule long-read sequencing
title_sort fibertools: fast and accurate dna-m6a calling using single-molecule long-read sequencing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10153250/
https://www.ncbi.nlm.nih.gov/pubmed/37131601
http://dx.doi.org/10.1101/2023.04.20.537673
work_keys_str_mv AT jhaanupama fibertoolsfastandaccuratednam6acallingusingsinglemoleculelongreadsequencing
AT bohaczukstephaniec fibertoolsfastandaccuratednam6acallingusingsinglemoleculelongreadsequencing
AT maoyizi fibertoolsfastandaccuratednam6acallingusingsinglemoleculelongreadsequencing
AT ranchalisjane fibertoolsfastandaccuratednam6acallingusingsinglemoleculelongreadsequencing
AT mallorybenjaminj fibertoolsfastandaccuratednam6acallingusingsinglemoleculelongreadsequencing
AT minalant fibertoolsfastandaccuratednam6acallingusingsinglemoleculelongreadsequencing
AT hammmorgano fibertoolsfastandaccuratednam6acallingusingsinglemoleculelongreadsequencing
AT swansonelliott fibertoolsfastandaccuratednam6acallingusingsinglemoleculelongreadsequencing
AT finkbeinerconnor fibertoolsfastandaccuratednam6acallingusingsinglemoleculelongreadsequencing
AT litony fibertoolsfastandaccuratednam6acallingusingsinglemoleculelongreadsequencing
AT whittingtondale fibertoolsfastandaccuratednam6acallingusingsinglemoleculelongreadsequencing
AT noblewilliamstafford fibertoolsfastandaccuratednam6acallingusingsinglemoleculelongreadsequencing
AT stergachisandrewb fibertoolsfastandaccuratednam6acallingusingsinglemoleculelongreadsequencing
AT vollgermitchellr fibertoolsfastandaccuratednam6acallingusingsinglemoleculelongreadsequencing