Cargando…

DiCoExpress: a tool to process multifactorial RNAseq experiments from quality controls to co-expression analysis through differential analysis based on contrasts inside GLM models

BACKGROUND: RNAseq is nowadays the method of choice for transcriptome analysis. In the last decades, a high number of statistical methods, and associated bioinformatics tools, for RNAseq analysis were developed. More recently, statistical studies realised neutral comparison studies using benchmark d...

Descripción completa

Detalles Bibliográficos
Autores principales: Lambert, Ilana, Paysant-Le Roux, Christine, Colella, Stefano, Martin-Magniette, Marie-Laure
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7216733/
https://www.ncbi.nlm.nih.gov/pubmed/32426025
http://dx.doi.org/10.1186/s13007-020-00611-7
_version_ 1783532473179701248
author Lambert, Ilana
Paysant-Le Roux, Christine
Colella, Stefano
Martin-Magniette, Marie-Laure
author_facet Lambert, Ilana
Paysant-Le Roux, Christine
Colella, Stefano
Martin-Magniette, Marie-Laure
author_sort Lambert, Ilana
collection PubMed
description BACKGROUND: RNAseq is nowadays the method of choice for transcriptome analysis. In the last decades, a high number of statistical methods, and associated bioinformatics tools, for RNAseq analysis were developed. More recently, statistical studies realised neutral comparison studies using benchmark datasets, shedding light on the most appropriate approaches for RNAseq data analysis. RESULTS: DiCoExpress is a script-based tool implemented in R that includes methods chosen based on their performance in neutral comparisons studies. DiCoExpress uses pre-existing R packages including FactoMineR, edgeR and coseq, to perform quality control, differential, and co-expression analysis of RNAseq data. Users can perform the full analysis, providing a mapped read expression data file and a file containing the information on the experimental design. Following the quality control step, the user can move on to the differential expression analysis performed using generalized linear models thanks to the automated contrast writing function. A co-expression analysis is implemented using the coseq package. Lists of differentially expressed genes and identified co-expression clusters are automatically analyzed for enrichment of annotations provided by the user. We used DiCoExpress to analyze a publicly available RNAseq dataset on the transcriptional response of Brassica napus L. to silicon treatment in plant roots and mature leaves. This dataset, including two biological factors and three replicates for each condition, allowed us to demonstrate in a tutorial all the features of DiCoExpress. CONCLUSIONS: DiCoExpress is an R script-based tool allowing users to perform a full RNAseq analysis from quality controls to co-expression analysis through differential analysis based on contrasts inside generalized linear models. DiCoExpress focuses on the statistical modelling of gene expression according to the experimental design and facilitates the data analysis leading the biological interpretation of the results.
format Online
Article
Text
id pubmed-7216733
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-72167332020-05-18 DiCoExpress: a tool to process multifactorial RNAseq experiments from quality controls to co-expression analysis through differential analysis based on contrasts inside GLM models Lambert, Ilana Paysant-Le Roux, Christine Colella, Stefano Martin-Magniette, Marie-Laure Plant Methods Software BACKGROUND: RNAseq is nowadays the method of choice for transcriptome analysis. In the last decades, a high number of statistical methods, and associated bioinformatics tools, for RNAseq analysis were developed. More recently, statistical studies realised neutral comparison studies using benchmark datasets, shedding light on the most appropriate approaches for RNAseq data analysis. RESULTS: DiCoExpress is a script-based tool implemented in R that includes methods chosen based on their performance in neutral comparisons studies. DiCoExpress uses pre-existing R packages including FactoMineR, edgeR and coseq, to perform quality control, differential, and co-expression analysis of RNAseq data. Users can perform the full analysis, providing a mapped read expression data file and a file containing the information on the experimental design. Following the quality control step, the user can move on to the differential expression analysis performed using generalized linear models thanks to the automated contrast writing function. A co-expression analysis is implemented using the coseq package. Lists of differentially expressed genes and identified co-expression clusters are automatically analyzed for enrichment of annotations provided by the user. We used DiCoExpress to analyze a publicly available RNAseq dataset on the transcriptional response of Brassica napus L. to silicon treatment in plant roots and mature leaves. This dataset, including two biological factors and three replicates for each condition, allowed us to demonstrate in a tutorial all the features of DiCoExpress. CONCLUSIONS: DiCoExpress is an R script-based tool allowing users to perform a full RNAseq analysis from quality controls to co-expression analysis through differential analysis based on contrasts inside generalized linear models. DiCoExpress focuses on the statistical modelling of gene expression according to the experimental design and facilitates the data analysis leading the biological interpretation of the results. BioMed Central 2020-05-12 /pmc/articles/PMC7216733/ /pubmed/32426025 http://dx.doi.org/10.1186/s13007-020-00611-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Lambert, Ilana
Paysant-Le Roux, Christine
Colella, Stefano
Martin-Magniette, Marie-Laure
DiCoExpress: a tool to process multifactorial RNAseq experiments from quality controls to co-expression analysis through differential analysis based on contrasts inside GLM models
title DiCoExpress: a tool to process multifactorial RNAseq experiments from quality controls to co-expression analysis through differential analysis based on contrasts inside GLM models
title_full DiCoExpress: a tool to process multifactorial RNAseq experiments from quality controls to co-expression analysis through differential analysis based on contrasts inside GLM models
title_fullStr DiCoExpress: a tool to process multifactorial RNAseq experiments from quality controls to co-expression analysis through differential analysis based on contrasts inside GLM models
title_full_unstemmed DiCoExpress: a tool to process multifactorial RNAseq experiments from quality controls to co-expression analysis through differential analysis based on contrasts inside GLM models
title_short DiCoExpress: a tool to process multifactorial RNAseq experiments from quality controls to co-expression analysis through differential analysis based on contrasts inside GLM models
title_sort dicoexpress: a tool to process multifactorial rnaseq experiments from quality controls to co-expression analysis through differential analysis based on contrasts inside glm models
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7216733/
https://www.ncbi.nlm.nih.gov/pubmed/32426025
http://dx.doi.org/10.1186/s13007-020-00611-7
work_keys_str_mv AT lambertilana dicoexpressatooltoprocessmultifactorialrnaseqexperimentsfromqualitycontrolstocoexpressionanalysisthroughdifferentialanalysisbasedoncontrastsinsideglmmodels
AT paysantlerouxchristine dicoexpressatooltoprocessmultifactorialrnaseqexperimentsfromqualitycontrolstocoexpressionanalysisthroughdifferentialanalysisbasedoncontrastsinsideglmmodels
AT colellastefano dicoexpressatooltoprocessmultifactorialrnaseqexperimentsfromqualitycontrolstocoexpressionanalysisthroughdifferentialanalysisbasedoncontrastsinsideglmmodels
AT martinmagniettemarielaure dicoexpressatooltoprocessmultifactorialrnaseqexperimentsfromqualitycontrolstocoexpressionanalysisthroughdifferentialanalysisbasedoncontrastsinsideglmmodels