Cargando…
Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments
Single-cell RNA sequencing allows for characterizing the gene expression landscape at the cell type level. However, because of its use of short-reads, it is severely limited at detecting full-length features of transcripts such as alternative splicing. New library preparation techniques attempt to e...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9209721/ https://www.ncbi.nlm.nih.gov/pubmed/35747387 http://dx.doi.org/10.1016/j.isci.2022.104530 |
_version_ | 1784730008144576512 |
---|---|
author | Ebrahimi, Ghazal Orabi, Baraa Robinson, Meghan Chauve, Cedric Flannigan, Ryan Hach, Faraz |
author_facet | Ebrahimi, Ghazal Orabi, Baraa Robinson, Meghan Chauve, Cedric Flannigan, Ryan Hach, Faraz |
author_sort | Ebrahimi, Ghazal |
collection | PubMed |
description | Single-cell RNA sequencing allows for characterizing the gene expression landscape at the cell type level. However, because of its use of short-reads, it is severely limited at detecting full-length features of transcripts such as alternative splicing. New library preparation techniques attempt to extend single-cell sequencing by utilizing both long-reads and short-reads. These techniques split the library material, after it is tagged with cellular barcodes, into two pools: one for short-read sequencing and one for long-read sequencing. However, the challenge of utilizing these techniques is that they require matching the cellular barcodes sequenced by the erroneous long-reads to the cellular barcodes detected by the short-reads. To overcome this challenge, we introduce scTagger, a computational method to match cellular barcodes data from long-reads and short-reads. We tested scTagger against another state-of-the-art tool on both real and simulated datasets, and we demonstrate that scTagger has both significantly better accuracy and time efficiency. |
format | Online Article Text |
id | pubmed-9209721 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-92097212022-06-22 Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments Ebrahimi, Ghazal Orabi, Baraa Robinson, Meghan Chauve, Cedric Flannigan, Ryan Hach, Faraz iScience Article Single-cell RNA sequencing allows for characterizing the gene expression landscape at the cell type level. However, because of its use of short-reads, it is severely limited at detecting full-length features of transcripts such as alternative splicing. New library preparation techniques attempt to extend single-cell sequencing by utilizing both long-reads and short-reads. These techniques split the library material, after it is tagged with cellular barcodes, into two pools: one for short-read sequencing and one for long-read sequencing. However, the challenge of utilizing these techniques is that they require matching the cellular barcodes sequenced by the erroneous long-reads to the cellular barcodes detected by the short-reads. To overcome this challenge, we introduce scTagger, a computational method to match cellular barcodes data from long-reads and short-reads. We tested scTagger against another state-of-the-art tool on both real and simulated datasets, and we demonstrate that scTagger has both significantly better accuracy and time efficiency. Elsevier 2022-06-07 /pmc/articles/PMC9209721/ /pubmed/35747387 http://dx.doi.org/10.1016/j.isci.2022.104530 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Article Ebrahimi, Ghazal Orabi, Baraa Robinson, Meghan Chauve, Cedric Flannigan, Ryan Hach, Faraz Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments |
title | Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments |
title_full | Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments |
title_fullStr | Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments |
title_full_unstemmed | Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments |
title_short | Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments |
title_sort | fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell rna-seq experiments |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9209721/ https://www.ncbi.nlm.nih.gov/pubmed/35747387 http://dx.doi.org/10.1016/j.isci.2022.104530 |
work_keys_str_mv | AT ebrahimighazal fastandaccuratematchingofcellularbarcodesacrossshortreadsandlongreadsofsinglecellrnaseqexperiments AT orabibaraa fastandaccuratematchingofcellularbarcodesacrossshortreadsandlongreadsofsinglecellrnaseqexperiments AT robinsonmeghan fastandaccuratematchingofcellularbarcodesacrossshortreadsandlongreadsofsinglecellrnaseqexperiments AT chauvecedric fastandaccuratematchingofcellularbarcodesacrossshortreadsandlongreadsofsinglecellrnaseqexperiments AT flanniganryan fastandaccuratematchingofcellularbarcodesacrossshortreadsandlongreadsofsinglecellrnaseqexperiments AT hachfaraz fastandaccuratematchingofcellularbarcodesacrossshortreadsandlongreadsofsinglecellrnaseqexperiments |