Cargando…
PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations
The comprehensive discovery of structure variations (SVs) is fundamental to many genomics studies and high-throughput sequencing has become a common approach to this task. However, due the limited length, it is still non-trivial to state-of-the-art tools to accurately align short reads and produce h...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8417358/ https://www.ncbi.nlm.nih.gov/pubmed/34490049 http://dx.doi.org/10.3389/fgene.2021.731515 |
_version_ | 1783748363573788672 |
---|---|
author | Li, Gaoyang Jiang, Tao Li, Junyi Wang, Yadong |
author_facet | Li, Gaoyang Jiang, Tao Li, Junyi Wang, Yadong |
author_sort | Li, Gaoyang |
collection | PubMed |
description | The comprehensive discovery of structure variations (SVs) is fundamental to many genomics studies and high-throughput sequencing has become a common approach to this task. However, due the limited length, it is still non-trivial to state-of-the-art tools to accurately align short reads and produce high-quality SV callsets. Pan-genome provides a novel and promising framework to short read-based SV calling since it enables to comprehensively integrate known variants to reduce the incompleteness and bias of single reference to breakthrough the bottlenecks of short read alignments and provide new evidences to the detection of SVs. However, it is still an open problem to develop effective computational approaches to fully take the advantage of pan-genomes. Herein, we propose Pan-genome augmented Structure Variation calling tool with read Re-alignment (PanSVR), a novel pan-genome-based SV calling approach. PanSVR uses several tailored methods to implement precise re-alignment for SV-spanning reads against well-organized pan-genome reference with plenty of known SVs. PanSVR enables to greatly improve the quality of short read alignments and produce clear and homogenous SV signatures which facilitate SV calling. Benchmark results on real sequencing data suggest that PanSVR is able to largely improve the sensitivity of SV calling than that of state-of-the-art SV callers, especially for the SVs from repeat-rich regions and/or novel insertions which are difficult to existing tools. |
format | Online Article Text |
id | pubmed-8417358 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-84173582021-09-05 PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations Li, Gaoyang Jiang, Tao Li, Junyi Wang, Yadong Front Genet Genetics The comprehensive discovery of structure variations (SVs) is fundamental to many genomics studies and high-throughput sequencing has become a common approach to this task. However, due the limited length, it is still non-trivial to state-of-the-art tools to accurately align short reads and produce high-quality SV callsets. Pan-genome provides a novel and promising framework to short read-based SV calling since it enables to comprehensively integrate known variants to reduce the incompleteness and bias of single reference to breakthrough the bottlenecks of short read alignments and provide new evidences to the detection of SVs. However, it is still an open problem to develop effective computational approaches to fully take the advantage of pan-genomes. Herein, we propose Pan-genome augmented Structure Variation calling tool with read Re-alignment (PanSVR), a novel pan-genome-based SV calling approach. PanSVR uses several tailored methods to implement precise re-alignment for SV-spanning reads against well-organized pan-genome reference with plenty of known SVs. PanSVR enables to greatly improve the quality of short read alignments and produce clear and homogenous SV signatures which facilitate SV calling. Benchmark results on real sequencing data suggest that PanSVR is able to largely improve the sensitivity of SV calling than that of state-of-the-art SV callers, especially for the SVs from repeat-rich regions and/or novel insertions which are difficult to existing tools. Frontiers Media S.A. 2021-08-19 /pmc/articles/PMC8417358/ /pubmed/34490049 http://dx.doi.org/10.3389/fgene.2021.731515 Text en Copyright © 2021 Li, Jiang, Li and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Li, Gaoyang Jiang, Tao Li, Junyi Wang, Yadong PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations |
title | PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations |
title_full | PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations |
title_fullStr | PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations |
title_full_unstemmed | PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations |
title_short | PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations |
title_sort | pansvr: pan-genome augmented short read realignment for sensitive detection of structural variations |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8417358/ https://www.ncbi.nlm.nih.gov/pubmed/34490049 http://dx.doi.org/10.3389/fgene.2021.731515 |
work_keys_str_mv | AT ligaoyang pansvrpangenomeaugmentedshortreadrealignmentforsensitivedetectionofstructuralvariations AT jiangtao pansvrpangenomeaugmentedshortreadrealignmentforsensitivedetectionofstructuralvariations AT lijunyi pansvrpangenomeaugmentedshortreadrealignmentforsensitivedetectionofstructuralvariations AT wangyadong pansvrpangenomeaugmentedshortreadrealignmentforsensitivedetectionofstructuralvariations |