Cargando…
HQAlign: Aligning nanopore reads for SV detection using current-level modeling
MOTIVATION: Detection of structural variants (SV) from the alignment of sample DNA reads to the reference genome is an important problem in understanding human diseases. Long reads that can span repeat regions, along with an accurate alignment of these long reads play an important role in identifyin...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cornell University
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882582/ https://www.ncbi.nlm.nih.gov/pubmed/36713252 |
_version_ | 1784879321606782976 |
---|---|
author | Joshi, Dhaivat Diggavi, Suhas Chaisson, Mark J.P. Kannan, Sreeram |
author_facet | Joshi, Dhaivat Diggavi, Suhas Chaisson, Mark J.P. Kannan, Sreeram |
author_sort | Joshi, Dhaivat |
collection | PubMed |
description | MOTIVATION: Detection of structural variants (SV) from the alignment of sample DNA reads to the reference genome is an important problem in understanding human diseases. Long reads that can span repeat regions, along with an accurate alignment of these long reads play an important role in identifying novel SVs. Long read sequencers such as nanopore sequencing can address this problem by providing very long reads but with high error rates, making accurate alignment challenging. Many errors induced by nanopore sequencing have a bias because of the physics of the sequencing process and proper utilization of these error characteristics can play an important role in designing a robust aligner for SV detection problems. In this paper, we design and evaluate HQAlign, an aligner for SV detection using nanopore sequenced reads. The key ideas of HQAlign include (i) using basecalled nanopore reads along with the nanopore physics to improve alignments for SVs (ii) incorporating SV specific changes to the alignment pipeline (iii) adapting these into existing state-of-the-art long read aligner pipeline, minimap2 (v2.24), for efficient alignments. RESULTS: We show that HQAlign captures about 4 – 6% complementary SVs across different datasets which are missed by minimap2 alignments while having a standalone performance at par with minimap2 for real nanopore reads data. For the common SV calls between HQAlign and minimap2, HQAlign improves the start and the end breakpoint accuracy for about 10 – 50% of SVs across different datasets. Moreover, HQAlign improves the alignment rate to 89.35% from minimap2 85.64% for nanopore reads alignment to recent telomere-to-telomere CHM13 assembly, and it improves to 86.65% from 83.48% for nanopore reads alignment to GRCh37 human genome. AVAILABILITY: https://github.com/joshidhaivat/HQAlign.git |
format | Online Article Text |
id | pubmed-9882582 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cornell University |
record_format | MEDLINE/PubMed |
spelling | pubmed-98825822023-01-28 HQAlign: Aligning nanopore reads for SV detection using current-level modeling Joshi, Dhaivat Diggavi, Suhas Chaisson, Mark J.P. Kannan, Sreeram ArXiv Article MOTIVATION: Detection of structural variants (SV) from the alignment of sample DNA reads to the reference genome is an important problem in understanding human diseases. Long reads that can span repeat regions, along with an accurate alignment of these long reads play an important role in identifying novel SVs. Long read sequencers such as nanopore sequencing can address this problem by providing very long reads but with high error rates, making accurate alignment challenging. Many errors induced by nanopore sequencing have a bias because of the physics of the sequencing process and proper utilization of these error characteristics can play an important role in designing a robust aligner for SV detection problems. In this paper, we design and evaluate HQAlign, an aligner for SV detection using nanopore sequenced reads. The key ideas of HQAlign include (i) using basecalled nanopore reads along with the nanopore physics to improve alignments for SVs (ii) incorporating SV specific changes to the alignment pipeline (iii) adapting these into existing state-of-the-art long read aligner pipeline, minimap2 (v2.24), for efficient alignments. RESULTS: We show that HQAlign captures about 4 – 6% complementary SVs across different datasets which are missed by minimap2 alignments while having a standalone performance at par with minimap2 for real nanopore reads data. For the common SV calls between HQAlign and minimap2, HQAlign improves the start and the end breakpoint accuracy for about 10 – 50% of SVs across different datasets. Moreover, HQAlign improves the alignment rate to 89.35% from minimap2 85.64% for nanopore reads alignment to recent telomere-to-telomere CHM13 assembly, and it improves to 86.65% from 83.48% for nanopore reads alignment to GRCh37 human genome. AVAILABILITY: https://github.com/joshidhaivat/HQAlign.git Cornell University 2023-01-10 /pmc/articles/PMC9882582/ /pubmed/36713252 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Joshi, Dhaivat Diggavi, Suhas Chaisson, Mark J.P. Kannan, Sreeram HQAlign: Aligning nanopore reads for SV detection using current-level modeling |
title | HQAlign: Aligning nanopore reads for SV detection using current-level modeling |
title_full | HQAlign: Aligning nanopore reads for SV detection using current-level modeling |
title_fullStr | HQAlign: Aligning nanopore reads for SV detection using current-level modeling |
title_full_unstemmed | HQAlign: Aligning nanopore reads for SV detection using current-level modeling |
title_short | HQAlign: Aligning nanopore reads for SV detection using current-level modeling |
title_sort | hqalign: aligning nanopore reads for sv detection using current-level modeling |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882582/ https://www.ncbi.nlm.nih.gov/pubmed/36713252 |
work_keys_str_mv | AT joshidhaivat hqalignaligningnanoporereadsforsvdetectionusingcurrentlevelmodeling AT diggavisuhas hqalignaligningnanoporereadsforsvdetectionusingcurrentlevelmodeling AT chaissonmarkjp hqalignaligningnanoporereadsforsvdetectionusingcurrentlevelmodeling AT kannansreeram hqalignaligningnanoporereadsforsvdetectionusingcurrentlevelmodeling |