Cargando…
ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules
Structural variation (SV), which consists of genomic variation from 50 to millions of base pairs, confers considerable impacts on human diseases, complex traits and evolution. Accurately detecting SV is a fundamental step to characterize the features of individual genomes. Currently, several methods...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7850140/ https://www.ncbi.nlm.nih.gov/pubmed/33554118 http://dx.doi.org/10.1093/nargab/lqab003 |
_version_ | 1783645408922173440 |
---|---|
author | Xu, Peng chen, Yu Gao, Min Chong, Zechen |
author_facet | Xu, Peng chen, Yu Gao, Min Chong, Zechen |
author_sort | Xu, Peng |
collection | PubMed |
description | Structural variation (SV), which consists of genomic variation from 50 to millions of base pairs, confers considerable impacts on human diseases, complex traits and evolution. Accurately detecting SV is a fundamental step to characterize the features of individual genomes. Currently, several methods have been proposed to detect SVs using the next-generation sequencing (NGS) platform. However, due to the short length of sequencing reads and the complexity of SV content, the SV-detecting tools are still limited by low sensitivity, especially for insertion detection. In this study, we developed a novel tool, ClipSV, to improve SV discovery. ClipSV utilizes a read extension and spliced alignment approach to overcoming the limitation of read length. By reconstructing long sequences from SV-associated short reads, ClipSV discovers deletions and short insertions from the long sequence alignments. To comprehensively characterize insertions, ClipSV implements tree-based decision rules that can efficiently utilize SV-containing reads. Based on the evaluations of both simulated and real sequencing data, ClipSV exhibited an overall better performance compared to currently popular tools, especially for insertion detection. As NGS platform represents the mainstream sequencing capacity for routine genomic applications, we anticipate ClipSV will serve as an important tool for SV characterization in future genomic studies. |
format | Online Article Text |
id | pubmed-7850140 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-78501402021-02-04 ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules Xu, Peng chen, Yu Gao, Min Chong, Zechen NAR Genom Bioinform Methods Article Structural variation (SV), which consists of genomic variation from 50 to millions of base pairs, confers considerable impacts on human diseases, complex traits and evolution. Accurately detecting SV is a fundamental step to characterize the features of individual genomes. Currently, several methods have been proposed to detect SVs using the next-generation sequencing (NGS) platform. However, due to the short length of sequencing reads and the complexity of SV content, the SV-detecting tools are still limited by low sensitivity, especially for insertion detection. In this study, we developed a novel tool, ClipSV, to improve SV discovery. ClipSV utilizes a read extension and spliced alignment approach to overcoming the limitation of read length. By reconstructing long sequences from SV-associated short reads, ClipSV discovers deletions and short insertions from the long sequence alignments. To comprehensively characterize insertions, ClipSV implements tree-based decision rules that can efficiently utilize SV-containing reads. Based on the evaluations of both simulated and real sequencing data, ClipSV exhibited an overall better performance compared to currently popular tools, especially for insertion detection. As NGS platform represents the mainstream sequencing capacity for routine genomic applications, we anticipate ClipSV will serve as an important tool for SV characterization in future genomic studies. Oxford University Press 2021-02-01 /pmc/articles/PMC7850140/ /pubmed/33554118 http://dx.doi.org/10.1093/nargab/lqab003 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Article Xu, Peng chen, Yu Gao, Min Chong, Zechen ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules |
title | ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules |
title_full | ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules |
title_fullStr | ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules |
title_full_unstemmed | ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules |
title_short | ClipSV: improving structural variation detection by read extension, spliced alignment and tree-based decision rules |
title_sort | clipsv: improving structural variation detection by read extension, spliced alignment and tree-based decision rules |
topic | Methods Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7850140/ https://www.ncbi.nlm.nih.gov/pubmed/33554118 http://dx.doi.org/10.1093/nargab/lqab003 |
work_keys_str_mv | AT xupeng clipsvimprovingstructuralvariationdetectionbyreadextensionsplicedalignmentandtreebaseddecisionrules AT chenyu clipsvimprovingstructuralvariationdetectionbyreadextensionsplicedalignmentandtreebaseddecisionrules AT gaomin clipsvimprovingstructuralvariationdetectionbyreadextensionsplicedalignmentandtreebaseddecisionrules AT chongzechen clipsvimprovingstructuralvariationdetectionbyreadextensionsplicedalignmentandtreebaseddecisionrules |