Cargando…

Investigating skewness to understand gene expression heterogeneity in large patient cohorts

BACKGROUND: Skewness is an under-utilized statistical measure that captures the degree of asymmetry in the distribution of any dataset. This study applied a new metric based on skewness to identify regulators or genes that have outlier expression in large patient cohorts. RESULTS: We investigated wh...

Descripción completa

Detalles Bibliográficos
Autores principales: Church, Benjamin V., Williams, Henry T., Mar, Jessica C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6923883/
https://www.ncbi.nlm.nih.gov/pubmed/31861976
http://dx.doi.org/10.1186/s12859-019-3252-0
_version_ 1783481613211926528
author Church, Benjamin V.
Williams, Henry T.
Mar, Jessica C.
author_facet Church, Benjamin V.
Williams, Henry T.
Mar, Jessica C.
author_sort Church, Benjamin V.
collection PubMed
description BACKGROUND: Skewness is an under-utilized statistical measure that captures the degree of asymmetry in the distribution of any dataset. This study applied a new metric based on skewness to identify regulators or genes that have outlier expression in large patient cohorts. RESULTS: We investigated whether specific patterns of skewed expression were related to the enrichment of biological pathways or genomic properties like DNA methylation status. Our study used publicly available datasets that were generated using both RNA-sequencing and microarray technology platforms. For comparison, the datasets selected for this study also included different samples derived from control donors and cancer patients. When comparing the shift in expression skewness between cancer and control datasets, we observed an enrichment of pathways related to the immune function that reflects an increase towards positive skewness in the cancer relative to control datasets. A significant correlation was also detected between expression skewness and the top 500 genes corresponding to the most significant differential DNA methylation occurring in the promotor regions for four Cancer Genome Atlas cancer cohorts. CONCLUSIONS: Our results indicate that expression skewness can reveal new insights into transcription based on outlier and asymmetrical behaviour in large patient cohorts.
format Online
Article
Text
id pubmed-6923883
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69238832019-12-30 Investigating skewness to understand gene expression heterogeneity in large patient cohorts Church, Benjamin V. Williams, Henry T. Mar, Jessica C. BMC Bioinformatics Research BACKGROUND: Skewness is an under-utilized statistical measure that captures the degree of asymmetry in the distribution of any dataset. This study applied a new metric based on skewness to identify regulators or genes that have outlier expression in large patient cohorts. RESULTS: We investigated whether specific patterns of skewed expression were related to the enrichment of biological pathways or genomic properties like DNA methylation status. Our study used publicly available datasets that were generated using both RNA-sequencing and microarray technology platforms. For comparison, the datasets selected for this study also included different samples derived from control donors and cancer patients. When comparing the shift in expression skewness between cancer and control datasets, we observed an enrichment of pathways related to the immune function that reflects an increase towards positive skewness in the cancer relative to control datasets. A significant correlation was also detected between expression skewness and the top 500 genes corresponding to the most significant differential DNA methylation occurring in the promotor regions for four Cancer Genome Atlas cancer cohorts. CONCLUSIONS: Our results indicate that expression skewness can reveal new insights into transcription based on outlier and asymmetrical behaviour in large patient cohorts. BioMed Central 2019-12-20 /pmc/articles/PMC6923883/ /pubmed/31861976 http://dx.doi.org/10.1186/s12859-019-3252-0 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Church, Benjamin V.
Williams, Henry T.
Mar, Jessica C.
Investigating skewness to understand gene expression heterogeneity in large patient cohorts
title Investigating skewness to understand gene expression heterogeneity in large patient cohorts
title_full Investigating skewness to understand gene expression heterogeneity in large patient cohorts
title_fullStr Investigating skewness to understand gene expression heterogeneity in large patient cohorts
title_full_unstemmed Investigating skewness to understand gene expression heterogeneity in large patient cohorts
title_short Investigating skewness to understand gene expression heterogeneity in large patient cohorts
title_sort investigating skewness to understand gene expression heterogeneity in large patient cohorts
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6923883/
https://www.ncbi.nlm.nih.gov/pubmed/31861976
http://dx.doi.org/10.1186/s12859-019-3252-0
work_keys_str_mv AT churchbenjaminv investigatingskewnesstounderstandgeneexpressionheterogeneityinlargepatientcohorts
AT williamshenryt investigatingskewnesstounderstandgeneexpressionheterogeneityinlargepatientcohorts
AT marjessicac investigatingskewnesstounderstandgeneexpressionheterogeneityinlargepatientcohorts