Cargando…

Computational comparison of common event-based differential splicing tools: practical considerations for laboratory researchers

BACKGROUND: Computational tools analyzing RNA-sequencing data have boosted alternative splicing research by identifying and assessing differentially spliced genes. However, common alternative splicing analysis tools differ substantially in their statistical analyses and general performance. This rep...

Descripción completa

Detalles Bibliográficos
Autores principales: Muller, Ittai B., Meijers, Stijn, Kampstra, Peter, van Dijk, Steven, van Elswijk, Michel, Lin, Marry, Wojtuszkiewicz, Anna M., Jansen, Gerrit, de Jonge, Robert, Cloos, Jacqueline
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8236165/
https://www.ncbi.nlm.nih.gov/pubmed/34174808
http://dx.doi.org/10.1186/s12859-021-04263-9
Descripción
Sumario:BACKGROUND: Computational tools analyzing RNA-sequencing data have boosted alternative splicing research by identifying and assessing differentially spliced genes. However, common alternative splicing analysis tools differ substantially in their statistical analyses and general performance. This report compares the computational performance (CPU utilization and RAM usage) of three event-level splicing tools; rMATS, MISO, and SUPPA2. Additionally, concordance between tool outputs was investigated. RESULTS: Log-linear relations were found between job times and dataset size in all splicing tools and all virtual machine (VM) configurations. MISO had the highest job times for all analyses, irrespective of VM size, while MISO analyses also exceeded maximum CPU utilization on all VM sizes. rMATS and SUPPA2 load averages were relatively low in both size and replicate comparisons, not nearing maximum CPU utilization in the VM simulating the lowest computational power (D2 VM). RAM usage in rMATS and SUPPA2 did not exceed 20% of maximum RAM in both size and replicate comparisons while MISO reached maximum RAM usage in D2 VM analyses for input size. Correlation coefficients of differential splicing analyses showed high correlation (β > 80%) between different tool outputs with the exception of comparisons of retained intron (RI) events between rMATS/MISO and rMATS/SUPPA2 (β < 60%). CONCLUSIONS: Prior to RNA-seq analyses, users should consider job time, amount of replicates and splice event type of interest to determine the optimal alternative splicing tool. In general, rMATS is superior to both MISO and SUPPA2 in computational performance. Analysis outputs show high concordance between tools, with the exception of RI events. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04263-9.