Cargando…

Linked-read sequencing for detecting short tandem repeat expansions

Detection of short tandem repeat (STR) expansions with standard short-read sequencing is challenging due to the difficulty in mapping multicopy repeat sequences. In this study, we explored how the long-range sequence information of barcode linked-read sequencing (BLRS) can be leveraged to improve re...

Descripción completa

Detalles Bibliográficos
Autores principales: Chiu, Readman, Rajan-Babu, Indhu-Shree, Birol, Inanc, Friedman, Jan M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9174224/
https://www.ncbi.nlm.nih.gov/pubmed/35672336
http://dx.doi.org/10.1038/s41598-022-13024-4
_version_ 1784722192816144384
author Chiu, Readman
Rajan-Babu, Indhu-Shree
Birol, Inanc
Friedman, Jan M.
author_facet Chiu, Readman
Rajan-Babu, Indhu-Shree
Birol, Inanc
Friedman, Jan M.
author_sort Chiu, Readman
collection PubMed
description Detection of short tandem repeat (STR) expansions with standard short-read sequencing is challenging due to the difficulty in mapping multicopy repeat sequences. In this study, we explored how the long-range sequence information of barcode linked-read sequencing (BLRS) can be leveraged to improve repeat-read detection. We also devised a novel algorithm using BLRS barcodes for distance estimation and evaluated its application for STR genotyping. Both approaches were designed for genotyping large expansions (> 1 kb) that cannot be sized accurately by existing methods. Using simulated and experimental data of genomes with STR expansions from multiple BLRS platforms, we validated the utility of barcode and phasing information in attaining better STR genotypes compared to standard short-read sequencing. Although the coverage bias of extremely GC-rich STRs is an important limitation of BLRS, BLRS is an effective strategy for genotyping many other STR loci.
format Online
Article
Text
id pubmed-9174224
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-91742242022-06-09 Linked-read sequencing for detecting short tandem repeat expansions Chiu, Readman Rajan-Babu, Indhu-Shree Birol, Inanc Friedman, Jan M. Sci Rep Article Detection of short tandem repeat (STR) expansions with standard short-read sequencing is challenging due to the difficulty in mapping multicopy repeat sequences. In this study, we explored how the long-range sequence information of barcode linked-read sequencing (BLRS) can be leveraged to improve repeat-read detection. We also devised a novel algorithm using BLRS barcodes for distance estimation and evaluated its application for STR genotyping. Both approaches were designed for genotyping large expansions (> 1 kb) that cannot be sized accurately by existing methods. Using simulated and experimental data of genomes with STR expansions from multiple BLRS platforms, we validated the utility of barcode and phasing information in attaining better STR genotypes compared to standard short-read sequencing. Although the coverage bias of extremely GC-rich STRs is an important limitation of BLRS, BLRS is an effective strategy for genotyping many other STR loci. Nature Publishing Group UK 2022-06-07 /pmc/articles/PMC9174224/ /pubmed/35672336 http://dx.doi.org/10.1038/s41598-022-13024-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Chiu, Readman
Rajan-Babu, Indhu-Shree
Birol, Inanc
Friedman, Jan M.
Linked-read sequencing for detecting short tandem repeat expansions
title Linked-read sequencing for detecting short tandem repeat expansions
title_full Linked-read sequencing for detecting short tandem repeat expansions
title_fullStr Linked-read sequencing for detecting short tandem repeat expansions
title_full_unstemmed Linked-read sequencing for detecting short tandem repeat expansions
title_short Linked-read sequencing for detecting short tandem repeat expansions
title_sort linked-read sequencing for detecting short tandem repeat expansions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9174224/
https://www.ncbi.nlm.nih.gov/pubmed/35672336
http://dx.doi.org/10.1038/s41598-022-13024-4
work_keys_str_mv AT chiureadman linkedreadsequencingfordetectingshorttandemrepeatexpansions
AT rajanbabuindhushree linkedreadsequencingfordetectingshorttandemrepeatexpansions
AT birolinanc linkedreadsequencingfordetectingshorttandemrepeatexpansions
AT friedmanjanm linkedreadsequencingfordetectingshorttandemrepeatexpansions