Cargando…
The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools
Application of Oxford Nanopore Technologies’ long-read sequencing platform to transcriptomic analysis is increasing in popularity. However, such analysis can be challenging due to the high sequence error and small library sizes, which decreases quantification accuracy and reduces power for statistic...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8074342/ https://www.ncbi.nlm.nih.gov/pubmed/33937765 http://dx.doi.org/10.1093/nargab/lqab028 |
_version_ | 1783684333649788928 |
---|---|
author | Dong, Xueyi Tian, Luyi Gouil, Quentin Kariyawasam, Hasaru Su, Shian De Paoli-Iseppi, Ricardo Prawer, Yair David Joseph Clark, Michael B Breslin, Kelsey Iminitoff, Megan Blewitt, Marnie E Law, Charity W Ritchie, Matthew E |
author_facet | Dong, Xueyi Tian, Luyi Gouil, Quentin Kariyawasam, Hasaru Su, Shian De Paoli-Iseppi, Ricardo Prawer, Yair David Joseph Clark, Michael B Breslin, Kelsey Iminitoff, Megan Blewitt, Marnie E Law, Charity W Ritchie, Matthew E |
author_sort | Dong, Xueyi |
collection | PubMed |
description | Application of Oxford Nanopore Technologies’ long-read sequencing platform to transcriptomic analysis is increasing in popularity. However, such analysis can be challenging due to the high sequence error and small library sizes, which decreases quantification accuracy and reduces power for statistical testing. Here, we report the analysis of two nanopore RNA-seq datasets with the goal of obtaining gene- and isoform-level differential expression information. A dataset of synthetic, spliced, spike-in RNAs (‘sequins’) as well as a mouse neural stem cell dataset from samples with a null mutation of the epigenetic regulator Smchd1 was analysed using a mix of long-read specific tools for preprocessing together with established short-read RNA-seq methods for downstream analysis. We used limma-voom to perform differential gene expression analysis, and the novel FLAMES pipeline to perform isoform identification and quantification, followed by DRIMSeq and limma-diffSplice (with stageR) to perform differential transcript usage analysis. We compared results from the sequins dataset to the ground truth, and results of the mouse dataset to a previous short-read study on equivalent samples. Overall, our work shows that transcriptomic analysis of long-read nanopore data using long-read specific preprocessing methods together with short-read differential expression methods and software that are already in wide use can yield meaningful results. |
format | Online Article Text |
id | pubmed-8074342 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-80743422021-04-29 The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools Dong, Xueyi Tian, Luyi Gouil, Quentin Kariyawasam, Hasaru Su, Shian De Paoli-Iseppi, Ricardo Prawer, Yair David Joseph Clark, Michael B Breslin, Kelsey Iminitoff, Megan Blewitt, Marnie E Law, Charity W Ritchie, Matthew E NAR Genom Bioinform APP Notes Application of Oxford Nanopore Technologies’ long-read sequencing platform to transcriptomic analysis is increasing in popularity. However, such analysis can be challenging due to the high sequence error and small library sizes, which decreases quantification accuracy and reduces power for statistical testing. Here, we report the analysis of two nanopore RNA-seq datasets with the goal of obtaining gene- and isoform-level differential expression information. A dataset of synthetic, spliced, spike-in RNAs (‘sequins’) as well as a mouse neural stem cell dataset from samples with a null mutation of the epigenetic regulator Smchd1 was analysed using a mix of long-read specific tools for preprocessing together with established short-read RNA-seq methods for downstream analysis. We used limma-voom to perform differential gene expression analysis, and the novel FLAMES pipeline to perform isoform identification and quantification, followed by DRIMSeq and limma-diffSplice (with stageR) to perform differential transcript usage analysis. We compared results from the sequins dataset to the ground truth, and results of the mouse dataset to a previous short-read study on equivalent samples. Overall, our work shows that transcriptomic analysis of long-read nanopore data using long-read specific preprocessing methods together with short-read differential expression methods and software that are already in wide use can yield meaningful results. Oxford University Press 2021-04-26 /pmc/articles/PMC8074342/ /pubmed/33937765 http://dx.doi.org/10.1093/nargab/lqab028 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | APP Notes Dong, Xueyi Tian, Luyi Gouil, Quentin Kariyawasam, Hasaru Su, Shian De Paoli-Iseppi, Ricardo Prawer, Yair David Joseph Clark, Michael B Breslin, Kelsey Iminitoff, Megan Blewitt, Marnie E Law, Charity W Ritchie, Matthew E The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools |
title | The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools |
title_full | The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools |
title_fullStr | The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools |
title_full_unstemmed | The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools |
title_short | The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools |
title_sort | long and the short of it: unlocking nanopore long-read rna sequencing data with short-read differential expression analysis tools |
topic | APP Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8074342/ https://www.ncbi.nlm.nih.gov/pubmed/33937765 http://dx.doi.org/10.1093/nargab/lqab028 |
work_keys_str_mv | AT dongxueyi thelongandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT tianluyi thelongandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT gouilquentin thelongandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT kariyawasamhasaru thelongandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT sushian thelongandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT depaoliiseppiricardo thelongandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT praweryairdavidjoseph thelongandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT clarkmichaelb thelongandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT breslinkelsey thelongandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT iminitoffmegan thelongandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT blewittmarniee thelongandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT lawcharityw thelongandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT ritchiematthewe thelongandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT dongxueyi longandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT tianluyi longandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT gouilquentin longandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT kariyawasamhasaru longandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT sushian longandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT depaoliiseppiricardo longandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT praweryairdavidjoseph longandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT clarkmichaelb longandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT breslinkelsey longandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT iminitoffmegan longandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT blewittmarniee longandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT lawcharityw longandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools AT ritchiematthewe longandtheshortofitunlockingnanoporelongreadrnasequencingdatawithshortreaddifferentialexpressionanalysistools |