Cargando…
Investigating skewness to understand gene expression heterogeneity in large patient cohorts
BACKGROUND: Skewness is an under-utilized statistical measure that captures the degree of asymmetry in the distribution of any dataset. This study applied a new metric based on skewness to identify regulators or genes that have outlier expression in large patient cohorts. RESULTS: We investigated wh...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6923883/ https://www.ncbi.nlm.nih.gov/pubmed/31861976 http://dx.doi.org/10.1186/s12859-019-3252-0 |
_version_ | 1783481613211926528 |
---|---|
author | Church, Benjamin V. Williams, Henry T. Mar, Jessica C. |
author_facet | Church, Benjamin V. Williams, Henry T. Mar, Jessica C. |
author_sort | Church, Benjamin V. |
collection | PubMed |
description | BACKGROUND: Skewness is an under-utilized statistical measure that captures the degree of asymmetry in the distribution of any dataset. This study applied a new metric based on skewness to identify regulators or genes that have outlier expression in large patient cohorts. RESULTS: We investigated whether specific patterns of skewed expression were related to the enrichment of biological pathways or genomic properties like DNA methylation status. Our study used publicly available datasets that were generated using both RNA-sequencing and microarray technology platforms. For comparison, the datasets selected for this study also included different samples derived from control donors and cancer patients. When comparing the shift in expression skewness between cancer and control datasets, we observed an enrichment of pathways related to the immune function that reflects an increase towards positive skewness in the cancer relative to control datasets. A significant correlation was also detected between expression skewness and the top 500 genes corresponding to the most significant differential DNA methylation occurring in the promotor regions for four Cancer Genome Atlas cancer cohorts. CONCLUSIONS: Our results indicate that expression skewness can reveal new insights into transcription based on outlier and asymmetrical behaviour in large patient cohorts. |
format | Online Article Text |
id | pubmed-6923883 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69238832019-12-30 Investigating skewness to understand gene expression heterogeneity in large patient cohorts Church, Benjamin V. Williams, Henry T. Mar, Jessica C. BMC Bioinformatics Research BACKGROUND: Skewness is an under-utilized statistical measure that captures the degree of asymmetry in the distribution of any dataset. This study applied a new metric based on skewness to identify regulators or genes that have outlier expression in large patient cohorts. RESULTS: We investigated whether specific patterns of skewed expression were related to the enrichment of biological pathways or genomic properties like DNA methylation status. Our study used publicly available datasets that were generated using both RNA-sequencing and microarray technology platforms. For comparison, the datasets selected for this study also included different samples derived from control donors and cancer patients. When comparing the shift in expression skewness between cancer and control datasets, we observed an enrichment of pathways related to the immune function that reflects an increase towards positive skewness in the cancer relative to control datasets. A significant correlation was also detected between expression skewness and the top 500 genes corresponding to the most significant differential DNA methylation occurring in the promotor regions for four Cancer Genome Atlas cancer cohorts. CONCLUSIONS: Our results indicate that expression skewness can reveal new insights into transcription based on outlier and asymmetrical behaviour in large patient cohorts. BioMed Central 2019-12-20 /pmc/articles/PMC6923883/ /pubmed/31861976 http://dx.doi.org/10.1186/s12859-019-3252-0 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Church, Benjamin V. Williams, Henry T. Mar, Jessica C. Investigating skewness to understand gene expression heterogeneity in large patient cohorts |
title | Investigating skewness to understand gene expression heterogeneity in large patient cohorts |
title_full | Investigating skewness to understand gene expression heterogeneity in large patient cohorts |
title_fullStr | Investigating skewness to understand gene expression heterogeneity in large patient cohorts |
title_full_unstemmed | Investigating skewness to understand gene expression heterogeneity in large patient cohorts |
title_short | Investigating skewness to understand gene expression heterogeneity in large patient cohorts |
title_sort | investigating skewness to understand gene expression heterogeneity in large patient cohorts |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6923883/ https://www.ncbi.nlm.nih.gov/pubmed/31861976 http://dx.doi.org/10.1186/s12859-019-3252-0 |
work_keys_str_mv | AT churchbenjaminv investigatingskewnesstounderstandgeneexpressionheterogeneityinlargepatientcohorts AT williamshenryt investigatingskewnesstounderstandgeneexpressionheterogeneityinlargepatientcohorts AT marjessicac investigatingskewnesstounderstandgeneexpressionheterogeneityinlargepatientcohorts |