Cargando…

Systematic benchmarking of statistical methods to assess differential expression of circular RNAs

Circular RNAs (circRNAs) are covalently closed transcripts involved in critical regulatory axes, cancer pathways and disease mechanisms. CircRNA expression measured with RNA-seq has particular characteristics that might hamper the performance of standard biostatistical differential expression assess...

Descripción completa

Detalles Bibliográficos
Autores principales: Buratin, Alessia, Bortoluzzi, Stefania, Gaffo, Enrico
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851295/
https://www.ncbi.nlm.nih.gov/pubmed/36592056
http://dx.doi.org/10.1093/bib/bbac612
_version_ 1784872366171488256
author Buratin, Alessia
Bortoluzzi, Stefania
Gaffo, Enrico
author_facet Buratin, Alessia
Bortoluzzi, Stefania
Gaffo, Enrico
author_sort Buratin, Alessia
collection PubMed
description Circular RNAs (circRNAs) are covalently closed transcripts involved in critical regulatory axes, cancer pathways and disease mechanisms. CircRNA expression measured with RNA-seq has particular characteristics that might hamper the performance of standard biostatistical differential expression assessment methods (DEMs). We compared 38 DEM pipelines configured to fit circRNA expression data’s statistical properties, including bulk RNA-seq, single-cell RNA-seq (scRNA-seq) and metagenomics DEMs. The DEMs performed poorly on data sets of typical size. Widely used DEMs, such as DESeq2, edgeR and Limma-Voom, gave scarce results, unreliable predictions or even contravened the expected behaviour with some parameter configurations. Limma-Voom achieved the most consistent performance throughout different benchmark data sets and, as well as SAMseq, reasonably balanced false discovery rate (FDR) and recall rate. Interestingly, a few scRNA-seq DEMs obtained results comparable with the best-performing bulk RNA-seq tools. Almost all DEMs’ performance improved when increasing the number of replicates. CircRNA expression studies require careful design, choice of DEM and DEM configuration. This analysis can guide scientists in selecting the appropriate tools to investigate circRNA differential expression with RNA-seq experiments.
format Online
Article
Text
id pubmed-9851295
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98512952023-01-20 Systematic benchmarking of statistical methods to assess differential expression of circular RNAs Buratin, Alessia Bortoluzzi, Stefania Gaffo, Enrico Brief Bioinform Problem Solving Protocol Circular RNAs (circRNAs) are covalently closed transcripts involved in critical regulatory axes, cancer pathways and disease mechanisms. CircRNA expression measured with RNA-seq has particular characteristics that might hamper the performance of standard biostatistical differential expression assessment methods (DEMs). We compared 38 DEM pipelines configured to fit circRNA expression data’s statistical properties, including bulk RNA-seq, single-cell RNA-seq (scRNA-seq) and metagenomics DEMs. The DEMs performed poorly on data sets of typical size. Widely used DEMs, such as DESeq2, edgeR and Limma-Voom, gave scarce results, unreliable predictions or even contravened the expected behaviour with some parameter configurations. Limma-Voom achieved the most consistent performance throughout different benchmark data sets and, as well as SAMseq, reasonably balanced false discovery rate (FDR) and recall rate. Interestingly, a few scRNA-seq DEMs obtained results comparable with the best-performing bulk RNA-seq tools. Almost all DEMs’ performance improved when increasing the number of replicates. CircRNA expression studies require careful design, choice of DEM and DEM configuration. This analysis can guide scientists in selecting the appropriate tools to investigate circRNA differential expression with RNA-seq experiments. Oxford University Press 2023-01-02 /pmc/articles/PMC9851295/ /pubmed/36592056 http://dx.doi.org/10.1093/bib/bbac612 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Problem Solving Protocol
Buratin, Alessia
Bortoluzzi, Stefania
Gaffo, Enrico
Systematic benchmarking of statistical methods to assess differential expression of circular RNAs
title Systematic benchmarking of statistical methods to assess differential expression of circular RNAs
title_full Systematic benchmarking of statistical methods to assess differential expression of circular RNAs
title_fullStr Systematic benchmarking of statistical methods to assess differential expression of circular RNAs
title_full_unstemmed Systematic benchmarking of statistical methods to assess differential expression of circular RNAs
title_short Systematic benchmarking of statistical methods to assess differential expression of circular RNAs
title_sort systematic benchmarking of statistical methods to assess differential expression of circular rnas
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851295/
https://www.ncbi.nlm.nih.gov/pubmed/36592056
http://dx.doi.org/10.1093/bib/bbac612
work_keys_str_mv AT buratinalessia systematicbenchmarkingofstatisticalmethodstoassessdifferentialexpressionofcircularrnas
AT bortoluzzistefania systematicbenchmarkingofstatisticalmethodstoassessdifferentialexpressionofcircularrnas
AT gaffoenrico systematicbenchmarkingofstatisticalmethodstoassessdifferentialexpressionofcircularrnas