Cargando…
Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads
MOTIVATION: Identifying structural variants (SVs) is critical in health and disease, however, detecting them remains a challenge. Several linked-read sequencing technologies, including 10X Genomics, TELL-Seq and single tube long fragment read (stLFR), have been recently developed as cost-effective a...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710574/ https://www.ncbi.nlm.nih.gov/pubmed/36700103 http://dx.doi.org/10.1093/bioadv/vbab007 |
_version_ | 1784841396513931264 |
---|---|
author | Liu, Yichen Henry Grubbs, Griffin L Zhang, Lu Fang, Xiaodong Dill, David L Sidow, Arend Zhou, Xin |
author_facet | Liu, Yichen Henry Grubbs, Griffin L Zhang, Lu Fang, Xiaodong Dill, David L Sidow, Arend Zhou, Xin |
author_sort | Liu, Yichen Henry |
collection | PubMed |
description | MOTIVATION: Identifying structural variants (SVs) is critical in health and disease, however, detecting them remains a challenge. Several linked-read sequencing technologies, including 10X Genomics, TELL-Seq and single tube long fragment read (stLFR), have been recently developed as cost-effective approaches to reconstruct multi-megabase haplotypes (phase blocks) from sequence data of a single sample. These technologies provide an optimal sequencing platform to characterize SVs, though few computational algorithms can utilize them. Thus, we developed Aquila_stLFR, an approach that resolves SVs through haplotype-based assembly of stLFR linked-reads. RESULTS: Aquila_stLFR first partitions long fragment reads into two haplotype-specific blocks with the assistance of the high-quality reference genome, by taking advantage of the potential phasing ability of the linked-read itself. Each haplotype is then assembled independently, to achieve a complete diploid assembly to finally reconstruct the genome-wide SVs. We benchmarked Aquila_stLFR on a well-studied sample, NA24385, and showed Aquila_stLFR can detect medium to large size deletions (50 bp–10 kb) with high sensitivity and medium-size insertions (50 bp–1 kb) with high specificity. AVAILABILITY AND IMPLEMENTATION: Source code and documentation are available on https://github.com/maiziex/Aquila_stLFR. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. |
format | Online Article Text |
id | pubmed-9710574 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-97105742023-01-24 Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads Liu, Yichen Henry Grubbs, Griffin L Zhang, Lu Fang, Xiaodong Dill, David L Sidow, Arend Zhou, Xin Bioinform Adv Applications Note MOTIVATION: Identifying structural variants (SVs) is critical in health and disease, however, detecting them remains a challenge. Several linked-read sequencing technologies, including 10X Genomics, TELL-Seq and single tube long fragment read (stLFR), have been recently developed as cost-effective approaches to reconstruct multi-megabase haplotypes (phase blocks) from sequence data of a single sample. These technologies provide an optimal sequencing platform to characterize SVs, though few computational algorithms can utilize them. Thus, we developed Aquila_stLFR, an approach that resolves SVs through haplotype-based assembly of stLFR linked-reads. RESULTS: Aquila_stLFR first partitions long fragment reads into two haplotype-specific blocks with the assistance of the high-quality reference genome, by taking advantage of the potential phasing ability of the linked-read itself. Each haplotype is then assembled independently, to achieve a complete diploid assembly to finally reconstruct the genome-wide SVs. We benchmarked Aquila_stLFR on a well-studied sample, NA24385, and showed Aquila_stLFR can detect medium to large size deletions (50 bp–10 kb) with high sensitivity and medium-size insertions (50 bp–1 kb) with high specificity. AVAILABILITY AND IMPLEMENTATION: Source code and documentation are available on https://github.com/maiziex/Aquila_stLFR. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Oxford University Press 2021-06-16 /pmc/articles/PMC9710574/ /pubmed/36700103 http://dx.doi.org/10.1093/bioadv/vbab007 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Note Liu, Yichen Henry Grubbs, Griffin L Zhang, Lu Fang, Xiaodong Dill, David L Sidow, Arend Zhou, Xin Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads |
title | Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads |
title_full | Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads |
title_fullStr | Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads |
title_full_unstemmed | Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads |
title_short | Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads |
title_sort | aquila_stlfr: diploid genome assembly based structural variant calling package for stlfr linked-reads |
topic | Applications Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710574/ https://www.ncbi.nlm.nih.gov/pubmed/36700103 http://dx.doi.org/10.1093/bioadv/vbab007 |
work_keys_str_mv | AT liuyichenhenry aquilastlfrdiploidgenomeassemblybasedstructuralvariantcallingpackageforstlfrlinkedreads AT grubbsgriffinl aquilastlfrdiploidgenomeassemblybasedstructuralvariantcallingpackageforstlfrlinkedreads AT zhanglu aquilastlfrdiploidgenomeassemblybasedstructuralvariantcallingpackageforstlfrlinkedreads AT fangxiaodong aquilastlfrdiploidgenomeassemblybasedstructuralvariantcallingpackageforstlfrlinkedreads AT dilldavidl aquilastlfrdiploidgenomeassemblybasedstructuralvariantcallingpackageforstlfrlinkedreads AT sidowarend aquilastlfrdiploidgenomeassemblybasedstructuralvariantcallingpackageforstlfrlinkedreads AT zhouxin aquilastlfrdiploidgenomeassemblybasedstructuralvariantcallingpackageforstlfrlinkedreads |