Cargando…

A multivariate statistical test for differential expression analysis

Statistical tests of differential expression usually suffer from two problems. Firstly, their statistical power is often limited when applied to small and skewed data sets. Secondly, gene expression data are usually discretized by applying arbitrary criteria to limit the number of false positives. I...

Descripción completa

Detalles Bibliográficos
Autores principales: Tumminello, Michele, Bertolazzi, Giorgio, Sottile, Gianluca, Sciaraffa, Nicolina, Arancio, Walter, Coronnello, Claudia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9117296/
https://www.ncbi.nlm.nih.gov/pubmed/35585166
http://dx.doi.org/10.1038/s41598-022-12246-w
_version_ 1784710302484398080
author Tumminello, Michele
Bertolazzi, Giorgio
Sottile, Gianluca
Sciaraffa, Nicolina
Arancio, Walter
Coronnello, Claudia
author_facet Tumminello, Michele
Bertolazzi, Giorgio
Sottile, Gianluca
Sciaraffa, Nicolina
Arancio, Walter
Coronnello, Claudia
author_sort Tumminello, Michele
collection PubMed
description Statistical tests of differential expression usually suffer from two problems. Firstly, their statistical power is often limited when applied to small and skewed data sets. Secondly, gene expression data are usually discretized by applying arbitrary criteria to limit the number of false positives. In this work, a new statistical test obtained from a convolution of multivariate hypergeometric distributions, the Hy-test, is proposed to address these issues. Hy-test has been carried out on transcriptomic data from breast and kidney cancer tissues, and it has been compared with other differential expression analysis methods. Hy-test allows implicit discretization of the expression profiles and is more selective in retrieving both differential expressed genes and terms of Gene Ontology. Hy-test can be adopted together with other tests to retrieve information that would remain hidden otherwise, e.g., terms of (1) cell cycle deregulation for breast cancer and (2) “programmed cell death” for kidney cancer.
format Online
Article
Text
id pubmed-9117296
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-91172962022-05-20 A multivariate statistical test for differential expression analysis Tumminello, Michele Bertolazzi, Giorgio Sottile, Gianluca Sciaraffa, Nicolina Arancio, Walter Coronnello, Claudia Sci Rep Article Statistical tests of differential expression usually suffer from two problems. Firstly, their statistical power is often limited when applied to small and skewed data sets. Secondly, gene expression data are usually discretized by applying arbitrary criteria to limit the number of false positives. In this work, a new statistical test obtained from a convolution of multivariate hypergeometric distributions, the Hy-test, is proposed to address these issues. Hy-test has been carried out on transcriptomic data from breast and kidney cancer tissues, and it has been compared with other differential expression analysis methods. Hy-test allows implicit discretization of the expression profiles and is more selective in retrieving both differential expressed genes and terms of Gene Ontology. Hy-test can be adopted together with other tests to retrieve information that would remain hidden otherwise, e.g., terms of (1) cell cycle deregulation for breast cancer and (2) “programmed cell death” for kidney cancer. Nature Publishing Group UK 2022-05-18 /pmc/articles/PMC9117296/ /pubmed/35585166 http://dx.doi.org/10.1038/s41598-022-12246-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Tumminello, Michele
Bertolazzi, Giorgio
Sottile, Gianluca
Sciaraffa, Nicolina
Arancio, Walter
Coronnello, Claudia
A multivariate statistical test for differential expression analysis
title A multivariate statistical test for differential expression analysis
title_full A multivariate statistical test for differential expression analysis
title_fullStr A multivariate statistical test for differential expression analysis
title_full_unstemmed A multivariate statistical test for differential expression analysis
title_short A multivariate statistical test for differential expression analysis
title_sort multivariate statistical test for differential expression analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9117296/
https://www.ncbi.nlm.nih.gov/pubmed/35585166
http://dx.doi.org/10.1038/s41598-022-12246-w
work_keys_str_mv AT tumminellomichele amultivariatestatisticaltestfordifferentialexpressionanalysis
AT bertolazzigiorgio amultivariatestatisticaltestfordifferentialexpressionanalysis
AT sottilegianluca amultivariatestatisticaltestfordifferentialexpressionanalysis
AT sciaraffanicolina amultivariatestatisticaltestfordifferentialexpressionanalysis
AT aranciowalter amultivariatestatisticaltestfordifferentialexpressionanalysis
AT coronnelloclaudia amultivariatestatisticaltestfordifferentialexpressionanalysis
AT tumminellomichele multivariatestatisticaltestfordifferentialexpressionanalysis
AT bertolazzigiorgio multivariatestatisticaltestfordifferentialexpressionanalysis
AT sottilegianluca multivariatestatisticaltestfordifferentialexpressionanalysis
AT sciaraffanicolina multivariatestatisticaltestfordifferentialexpressionanalysis
AT aranciowalter multivariatestatisticaltestfordifferentialexpressionanalysis
AT coronnelloclaudia multivariatestatisticaltestfordifferentialexpressionanalysis