Cargando…

ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules

Structural variation (SV), which consists of genomic variation from 50 to millions of base pairs, confers considerable impacts on human diseases, complex traits and evolution. Accurately detecting SV is a fundamental step to characterize the features of individual genomes. Currently, several methods...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Peng, chen, Yu, Gao, Min, Chong, Zechen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7850140/
https://www.ncbi.nlm.nih.gov/pubmed/33554118
http://dx.doi.org/10.1093/nargab/lqab003
_version_ 1783645408922173440
author Xu, Peng
chen, Yu
Gao, Min
Chong, Zechen
author_facet Xu, Peng
chen, Yu
Gao, Min
Chong, Zechen
author_sort Xu, Peng
collection PubMed
description Structural variation (SV), which consists of genomic variation from 50 to millions of base pairs, confers considerable impacts on human diseases, complex traits and evolution. Accurately detecting SV is a fundamental step to characterize the features of individual genomes. Currently, several methods have been proposed to detect SVs using the next-generation sequencing (NGS) platform. However, due to the short length of sequencing reads and the complexity of SV content, the SV-detecting tools are still limited by low sensitivity, especially for insertion detection. In this study, we developed a novel tool, ClipSV, to improve SV discovery. ClipSV utilizes a read extension and spliced alignment approach to overcoming the limitation of read length. By reconstructing long sequences from SV-associated short reads, ClipSV discovers deletions and short insertions from the long sequence alignments. To comprehensively characterize insertions, ClipSV implements tree-based decision rules that can efficiently utilize SV-containing reads. Based on the evaluations of both simulated and real sequencing data, ClipSV exhibited an overall better performance compared to currently popular tools, especially for insertion detection. As NGS platform represents the mainstream sequencing capacity for routine genomic applications, we anticipate ClipSV will serve as an important tool for SV characterization in future genomic studies.
format Online
Article
Text
id pubmed-7850140
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-78501402021-02-04 ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules Xu, Peng chen, Yu Gao, Min Chong, Zechen NAR Genom Bioinform Methods Article Structural variation (SV), which consists of genomic variation from 50 to millions of base pairs, confers considerable impacts on human diseases, complex traits and evolution. Accurately detecting SV is a fundamental step to characterize the features of individual genomes. Currently, several methods have been proposed to detect SVs using the next-generation sequencing (NGS) platform. However, due to the short length of sequencing reads and the complexity of SV content, the SV-detecting tools are still limited by low sensitivity, especially for insertion detection. In this study, we developed a novel tool, ClipSV, to improve SV discovery. ClipSV utilizes a read extension and spliced alignment approach to overcoming the limitation of read length. By reconstructing long sequences from SV-associated short reads, ClipSV discovers deletions and short insertions from the long sequence alignments. To comprehensively characterize insertions, ClipSV implements tree-based decision rules that can efficiently utilize SV-containing reads. Based on the evaluations of both simulated and real sequencing data, ClipSV exhibited an overall better performance compared to currently popular tools, especially for insertion detection. As NGS platform represents the mainstream sequencing capacity for routine genomic applications, we anticipate ClipSV will serve as an important tool for SV characterization in future genomic studies. Oxford University Press 2021-02-01 /pmc/articles/PMC7850140/ /pubmed/33554118 http://dx.doi.org/10.1093/nargab/lqab003 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Article
Xu, Peng
chen, Yu
Gao, Min
Chong, Zechen
ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules
title ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules
title_full ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules
title_fullStr ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules
title_full_unstemmed ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules
title_short ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules
title_sort clipsv: improving structural variation detection by read extension, spliced alignment and tree-based decision rules
topic Methods Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7850140/
https://www.ncbi.nlm.nih.gov/pubmed/33554118
http://dx.doi.org/10.1093/nargab/lqab003
work_keys_str_mv AT xupeng clipsvimprovingstructuralvariationdetectionbyreadextensionsplicedalignmentandtreebaseddecisionrules
AT chenyu clipsvimprovingstructuralvariationdetectionbyreadextensionsplicedalignmentandtreebaseddecisionrules
AT gaomin clipsvimprovingstructuralvariationdetectionbyreadextensionsplicedalignmentandtreebaseddecisionrules
AT chongzechen clipsvimprovingstructuralvariationdetectionbyreadextensionsplicedalignmentandtreebaseddecisionrules