Cargando…

Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments

Single-cell RNA sequencing allows for characterizing the gene expression landscape at the cell type level. However, because of its use of short-reads, it is severely limited at detecting full-length features of transcripts such as alternative splicing. New library preparation techniques attempt to e...

Descripción completa

Detalles Bibliográficos
Autores principales: Ebrahimi, Ghazal, Orabi, Baraa, Robinson, Meghan, Chauve, Cedric, Flannigan, Ryan, Hach, Faraz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9209721/
https://www.ncbi.nlm.nih.gov/pubmed/35747387
http://dx.doi.org/10.1016/j.isci.2022.104530
_version_ 1784730008144576512
author Ebrahimi, Ghazal
Orabi, Baraa
Robinson, Meghan
Chauve, Cedric
Flannigan, Ryan
Hach, Faraz
author_facet Ebrahimi, Ghazal
Orabi, Baraa
Robinson, Meghan
Chauve, Cedric
Flannigan, Ryan
Hach, Faraz
author_sort Ebrahimi, Ghazal
collection PubMed
description Single-cell RNA sequencing allows for characterizing the gene expression landscape at the cell type level. However, because of its use of short-reads, it is severely limited at detecting full-length features of transcripts such as alternative splicing. New library preparation techniques attempt to extend single-cell sequencing by utilizing both long-reads and short-reads. These techniques split the library material, after it is tagged with cellular barcodes, into two pools: one for short-read sequencing and one for long-read sequencing. However, the challenge of utilizing these techniques is that they require matching the cellular barcodes sequenced by the erroneous long-reads to the cellular barcodes detected by the short-reads. To overcome this challenge, we introduce scTagger, a computational method to match cellular barcodes data from long-reads and short-reads. We tested scTagger against another state-of-the-art tool on both real and simulated datasets, and we demonstrate that scTagger has both significantly better accuracy and time efficiency.
format Online
Article
Text
id pubmed-9209721
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-92097212022-06-22 Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments Ebrahimi, Ghazal Orabi, Baraa Robinson, Meghan Chauve, Cedric Flannigan, Ryan Hach, Faraz iScience Article Single-cell RNA sequencing allows for characterizing the gene expression landscape at the cell type level. However, because of its use of short-reads, it is severely limited at detecting full-length features of transcripts such as alternative splicing. New library preparation techniques attempt to extend single-cell sequencing by utilizing both long-reads and short-reads. These techniques split the library material, after it is tagged with cellular barcodes, into two pools: one for short-read sequencing and one for long-read sequencing. However, the challenge of utilizing these techniques is that they require matching the cellular barcodes sequenced by the erroneous long-reads to the cellular barcodes detected by the short-reads. To overcome this challenge, we introduce scTagger, a computational method to match cellular barcodes data from long-reads and short-reads. We tested scTagger against another state-of-the-art tool on both real and simulated datasets, and we demonstrate that scTagger has both significantly better accuracy and time efficiency. Elsevier 2022-06-07 /pmc/articles/PMC9209721/ /pubmed/35747387 http://dx.doi.org/10.1016/j.isci.2022.104530 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Ebrahimi, Ghazal
Orabi, Baraa
Robinson, Meghan
Chauve, Cedric
Flannigan, Ryan
Hach, Faraz
Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments
title Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments
title_full Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments
title_fullStr Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments
title_full_unstemmed Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments
title_short Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments
title_sort fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell rna-seq experiments
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9209721/
https://www.ncbi.nlm.nih.gov/pubmed/35747387
http://dx.doi.org/10.1016/j.isci.2022.104530
work_keys_str_mv AT ebrahimighazal fastandaccuratematchingofcellularbarcodesacrossshortreadsandlongreadsofsinglecellrnaseqexperiments
AT orabibaraa fastandaccuratematchingofcellularbarcodesacrossshortreadsandlongreadsofsinglecellrnaseqexperiments
AT robinsonmeghan fastandaccuratematchingofcellularbarcodesacrossshortreadsandlongreadsofsinglecellrnaseqexperiments
AT chauvecedric fastandaccuratematchingofcellularbarcodesacrossshortreadsandlongreadsofsinglecellrnaseqexperiments
AT flanniganryan fastandaccuratematchingofcellularbarcodesacrossshortreadsandlongreadsofsinglecellrnaseqexperiments
AT hachfaraz fastandaccuratematchingofcellularbarcodesacrossshortreadsandlongreadsofsinglecellrnaseqexperiments