Cargando…

PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations

The comprehensive discovery of structure variations (SVs) is fundamental to many genomics studies and high-throughput sequencing has become a common approach to this task. However, due the limited length, it is still non-trivial to state-of-the-art tools to accurately align short reads and produce h...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Gaoyang, Jiang, Tao, Li, Junyi, Wang, Yadong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8417358/
https://www.ncbi.nlm.nih.gov/pubmed/34490049
http://dx.doi.org/10.3389/fgene.2021.731515
_version_ 1783748363573788672
author Li, Gaoyang
Jiang, Tao
Li, Junyi
Wang, Yadong
author_facet Li, Gaoyang
Jiang, Tao
Li, Junyi
Wang, Yadong
author_sort Li, Gaoyang
collection PubMed
description The comprehensive discovery of structure variations (SVs) is fundamental to many genomics studies and high-throughput sequencing has become a common approach to this task. However, due the limited length, it is still non-trivial to state-of-the-art tools to accurately align short reads and produce high-quality SV callsets. Pan-genome provides a novel and promising framework to short read-based SV calling since it enables to comprehensively integrate known variants to reduce the incompleteness and bias of single reference to breakthrough the bottlenecks of short read alignments and provide new evidences to the detection of SVs. However, it is still an open problem to develop effective computational approaches to fully take the advantage of pan-genomes. Herein, we propose Pan-genome augmented Structure Variation calling tool with read Re-alignment (PanSVR), a novel pan-genome-based SV calling approach. PanSVR uses several tailored methods to implement precise re-alignment for SV-spanning reads against well-organized pan-genome reference with plenty of known SVs. PanSVR enables to greatly improve the quality of short read alignments and produce clear and homogenous SV signatures which facilitate SV calling. Benchmark results on real sequencing data suggest that PanSVR is able to largely improve the sensitivity of SV calling than that of state-of-the-art SV callers, especially for the SVs from repeat-rich regions and/or novel insertions which are difficult to existing tools.
format Online
Article
Text
id pubmed-8417358
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-84173582021-09-05 PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations Li, Gaoyang Jiang, Tao Li, Junyi Wang, Yadong Front Genet Genetics The comprehensive discovery of structure variations (SVs) is fundamental to many genomics studies and high-throughput sequencing has become a common approach to this task. However, due the limited length, it is still non-trivial to state-of-the-art tools to accurately align short reads and produce high-quality SV callsets. Pan-genome provides a novel and promising framework to short read-based SV calling since it enables to comprehensively integrate known variants to reduce the incompleteness and bias of single reference to breakthrough the bottlenecks of short read alignments and provide new evidences to the detection of SVs. However, it is still an open problem to develop effective computational approaches to fully take the advantage of pan-genomes. Herein, we propose Pan-genome augmented Structure Variation calling tool with read Re-alignment (PanSVR), a novel pan-genome-based SV calling approach. PanSVR uses several tailored methods to implement precise re-alignment for SV-spanning reads against well-organized pan-genome reference with plenty of known SVs. PanSVR enables to greatly improve the quality of short read alignments and produce clear and homogenous SV signatures which facilitate SV calling. Benchmark results on real sequencing data suggest that PanSVR is able to largely improve the sensitivity of SV calling than that of state-of-the-art SV callers, especially for the SVs from repeat-rich regions and/or novel insertions which are difficult to existing tools. Frontiers Media S.A. 2021-08-19 /pmc/articles/PMC8417358/ /pubmed/34490049 http://dx.doi.org/10.3389/fgene.2021.731515 Text en Copyright © 2021 Li, Jiang, Li and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Li, Gaoyang
Jiang, Tao
Li, Junyi
Wang, Yadong
PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations
title PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations
title_full PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations
title_fullStr PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations
title_full_unstemmed PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations
title_short PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations
title_sort pansvr: pan-genome augmented short read realignment for sensitive detection of structural variations
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8417358/
https://www.ncbi.nlm.nih.gov/pubmed/34490049
http://dx.doi.org/10.3389/fgene.2021.731515
work_keys_str_mv AT ligaoyang pansvrpangenomeaugmentedshortreadrealignmentforsensitivedetectionofstructuralvariations
AT jiangtao pansvrpangenomeaugmentedshortreadrealignmentforsensitivedetectionofstructuralvariations
AT lijunyi pansvrpangenomeaugmentedshortreadrealignmentforsensitivedetectionofstructuralvariations
AT wangyadong pansvrpangenomeaugmentedshortreadrealignmentforsensitivedetectionofstructuralvariations