Cargando…

Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads

MOTIVATION: Identifying structural variants (SVs) is critical in health and disease, however, detecting them remains a challenge. Several linked-read sequencing technologies, including 10X Genomics, TELL-Seq and single tube long fragment read (stLFR), have been recently developed as cost-effective a...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yichen Henry, Grubbs, Griffin L, Zhang, Lu, Fang, Xiaodong, Dill, David L, Sidow, Arend, Zhou, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710574/
https://www.ncbi.nlm.nih.gov/pubmed/36700103
http://dx.doi.org/10.1093/bioadv/vbab007
_version_ 1784841396513931264
author Liu, Yichen Henry
Grubbs, Griffin L
Zhang, Lu
Fang, Xiaodong
Dill, David L
Sidow, Arend
Zhou, Xin
author_facet Liu, Yichen Henry
Grubbs, Griffin L
Zhang, Lu
Fang, Xiaodong
Dill, David L
Sidow, Arend
Zhou, Xin
author_sort Liu, Yichen Henry
collection PubMed
description MOTIVATION: Identifying structural variants (SVs) is critical in health and disease, however, detecting them remains a challenge. Several linked-read sequencing technologies, including 10X Genomics, TELL-Seq and single tube long fragment read (stLFR), have been recently developed as cost-effective approaches to reconstruct multi-megabase haplotypes (phase blocks) from sequence data of a single sample. These technologies provide an optimal sequencing platform to characterize SVs, though few computational algorithms can utilize them. Thus, we developed Aquila_stLFR, an approach that resolves SVs through haplotype-based assembly of stLFR linked-reads. RESULTS: Aquila_stLFR first partitions long fragment reads into two haplotype-specific blocks with the assistance of the high-quality reference genome, by taking advantage of the potential phasing ability of the linked-read itself. Each haplotype is then assembled independently, to achieve a complete diploid assembly to finally reconstruct the genome-wide SVs. We benchmarked Aquila_stLFR on a well-studied sample, NA24385, and showed Aquila_stLFR can detect medium to large size deletions (50 bp–10 kb) with high sensitivity and medium-size insertions (50 bp–1 kb) with high specificity. AVAILABILITY AND IMPLEMENTATION: Source code and documentation are available on https://github.com/maiziex/Aquila_stLFR. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online.
format Online
Article
Text
id pubmed-9710574
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97105742023-01-24 Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads Liu, Yichen Henry Grubbs, Griffin L Zhang, Lu Fang, Xiaodong Dill, David L Sidow, Arend Zhou, Xin Bioinform Adv Applications Note MOTIVATION: Identifying structural variants (SVs) is critical in health and disease, however, detecting them remains a challenge. Several linked-read sequencing technologies, including 10X Genomics, TELL-Seq and single tube long fragment read (stLFR), have been recently developed as cost-effective approaches to reconstruct multi-megabase haplotypes (phase blocks) from sequence data of a single sample. These technologies provide an optimal sequencing platform to characterize SVs, though few computational algorithms can utilize them. Thus, we developed Aquila_stLFR, an approach that resolves SVs through haplotype-based assembly of stLFR linked-reads. RESULTS: Aquila_stLFR first partitions long fragment reads into two haplotype-specific blocks with the assistance of the high-quality reference genome, by taking advantage of the potential phasing ability of the linked-read itself. Each haplotype is then assembled independently, to achieve a complete diploid assembly to finally reconstruct the genome-wide SVs. We benchmarked Aquila_stLFR on a well-studied sample, NA24385, and showed Aquila_stLFR can detect medium to large size deletions (50 bp–10 kb) with high sensitivity and medium-size insertions (50 bp–1 kb) with high specificity. AVAILABILITY AND IMPLEMENTATION: Source code and documentation are available on https://github.com/maiziex/Aquila_stLFR. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Oxford University Press 2021-06-16 /pmc/articles/PMC9710574/ /pubmed/36700103 http://dx.doi.org/10.1093/bioadv/vbab007 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Liu, Yichen Henry
Grubbs, Griffin L
Zhang, Lu
Fang, Xiaodong
Dill, David L
Sidow, Arend
Zhou, Xin
Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads
title Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads
title_full Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads
title_fullStr Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads
title_full_unstemmed Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads
title_short Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads
title_sort aquila_stlfr: diploid genome assembly based structural variant calling package for stlfr linked-reads
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710574/
https://www.ncbi.nlm.nih.gov/pubmed/36700103
http://dx.doi.org/10.1093/bioadv/vbab007
work_keys_str_mv AT liuyichenhenry aquilastlfrdiploidgenomeassemblybasedstructuralvariantcallingpackageforstlfrlinkedreads
AT grubbsgriffinl aquilastlfrdiploidgenomeassemblybasedstructuralvariantcallingpackageforstlfrlinkedreads
AT zhanglu aquilastlfrdiploidgenomeassemblybasedstructuralvariantcallingpackageforstlfrlinkedreads
AT fangxiaodong aquilastlfrdiploidgenomeassemblybasedstructuralvariantcallingpackageforstlfrlinkedreads
AT dilldavidl aquilastlfrdiploidgenomeassemblybasedstructuralvariantcallingpackageforstlfrlinkedreads
AT sidowarend aquilastlfrdiploidgenomeassemblybasedstructuralvariantcallingpackageforstlfrlinkedreads
AT zhouxin aquilastlfrdiploidgenomeassemblybasedstructuralvariantcallingpackageforstlfrlinkedreads