Cargando…

Rapid and accurate alignment of nucleotide conversion sequencing reads with HISAT-3N

Sequencing technologies using nucleotide conversion techniques such as cytosine to thymine in bisulfite-seq and thymine to cytosine in SLAM seq are powerful tools to explore the chemical intricacies of cellular processes. To date, no one has developed a unified methodology for aligning converted seq...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yun, Park, Chanhee, Bennett, Christopher, Thornton, Micah, Kim, Daehwan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8256862/
https://www.ncbi.nlm.nih.gov/pubmed/34103331
http://dx.doi.org/10.1101/gr.275193.120
_version_ 1783718183016857600
author Zhang, Yun
Park, Chanhee
Bennett, Christopher
Thornton, Micah
Kim, Daehwan
author_facet Zhang, Yun
Park, Chanhee
Bennett, Christopher
Thornton, Micah
Kim, Daehwan
author_sort Zhang, Yun
collection PubMed
description Sequencing technologies using nucleotide conversion techniques such as cytosine to thymine in bisulfite-seq and thymine to cytosine in SLAM seq are powerful tools to explore the chemical intricacies of cellular processes. To date, no one has developed a unified methodology for aligning converted sequences and consolidating alignment of these technologies in one package. In this paper, we describe hierarchical indexing for spliced alignment of transcripts–3 nucleotides (HISAT-3N), which can rapidly and accurately align sequences consisting of any nucleotide conversion by leveraging the powerful hierarchical index and repeat index algorithms originally developed for the HISAT software. Tests on real and simulated data sets show that HISAT-3N is faster than other modern systems, with greater alignment accuracy, higher scalability, and smaller memory requirements. HISAT-3N therefore becomes an ideal aligner when used with converted sequence technologies.
format Online
Article
Text
id pubmed-8256862
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-82568622022-01-01 Rapid and accurate alignment of nucleotide conversion sequencing reads with HISAT-3N Zhang, Yun Park, Chanhee Bennett, Christopher Thornton, Micah Kim, Daehwan Genome Res Method Sequencing technologies using nucleotide conversion techniques such as cytosine to thymine in bisulfite-seq and thymine to cytosine in SLAM seq are powerful tools to explore the chemical intricacies of cellular processes. To date, no one has developed a unified methodology for aligning converted sequences and consolidating alignment of these technologies in one package. In this paper, we describe hierarchical indexing for spliced alignment of transcripts–3 nucleotides (HISAT-3N), which can rapidly and accurately align sequences consisting of any nucleotide conversion by leveraging the powerful hierarchical index and repeat index algorithms originally developed for the HISAT software. Tests on real and simulated data sets show that HISAT-3N is faster than other modern systems, with greater alignment accuracy, higher scalability, and smaller memory requirements. HISAT-3N therefore becomes an ideal aligner when used with converted sequence technologies. Cold Spring Harbor Laboratory Press 2021-07 /pmc/articles/PMC8256862/ /pubmed/34103331 http://dx.doi.org/10.1101/gr.275193.120 Text en © 2021 Zhang et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Method
Zhang, Yun
Park, Chanhee
Bennett, Christopher
Thornton, Micah
Kim, Daehwan
Rapid and accurate alignment of nucleotide conversion sequencing reads with HISAT-3N
title Rapid and accurate alignment of nucleotide conversion sequencing reads with HISAT-3N
title_full Rapid and accurate alignment of nucleotide conversion sequencing reads with HISAT-3N
title_fullStr Rapid and accurate alignment of nucleotide conversion sequencing reads with HISAT-3N
title_full_unstemmed Rapid and accurate alignment of nucleotide conversion sequencing reads with HISAT-3N
title_short Rapid and accurate alignment of nucleotide conversion sequencing reads with HISAT-3N
title_sort rapid and accurate alignment of nucleotide conversion sequencing reads with hisat-3n
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8256862/
https://www.ncbi.nlm.nih.gov/pubmed/34103331
http://dx.doi.org/10.1101/gr.275193.120
work_keys_str_mv AT zhangyun rapidandaccuratealignmentofnucleotideconversionsequencingreadswithhisat3n
AT parkchanhee rapidandaccuratealignmentofnucleotideconversionsequencingreadswithhisat3n
AT bennettchristopher rapidandaccuratealignmentofnucleotideconversionsequencingreadswithhisat3n
AT thorntonmicah rapidandaccuratealignmentofnucleotideconversionsequencingreadswithhisat3n
AT kimdaehwan rapidandaccuratealignmentofnucleotideconversionsequencingreadswithhisat3n