Cargando…

Self-Contained Statistical Analysis of Gene Sets

Microarrays are a powerful tool for studying differential gene expression. However, lists of many differentially expressed genes are often generated, and unraveling meaningful biological processes from the lists can be challenging. For this reason, investigators have sought to quantify the statistic...

Descripción completa

Detalles Bibliográficos
Autores principales: Torres, David J., Cannon, Judy L., Ricoy, Ulises M., Johnson, Christopher
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5053608/
https://www.ncbi.nlm.nih.gov/pubmed/27711232
http://dx.doi.org/10.1371/journal.pone.0163918
_version_ 1782458445190922240
author Torres, David J.
Cannon, Judy L.
Ricoy, Ulises M.
Johnson, Christopher
author_facet Torres, David J.
Cannon, Judy L.
Ricoy, Ulises M.
Johnson, Christopher
author_sort Torres, David J.
collection PubMed
description Microarrays are a powerful tool for studying differential gene expression. However, lists of many differentially expressed genes are often generated, and unraveling meaningful biological processes from the lists can be challenging. For this reason, investigators have sought to quantify the statistical probability of compiled gene sets rather than individual genes. The gene sets typically are organized around a biological theme or pathway. We compute correlations between different gene set tests and elect to use Fisher’s self-contained method for gene set analysis. We improve Fisher’s differential expression analysis of a gene set by limiting the p-value of an individual gene within the gene set to prevent a small percentage of genes from determining the statistical significance of the entire set. In addition, we also compute dependencies among genes within the set to determine which genes are statistically linked. The method is applied to T-ALL (T-lineage Acute Lymphoblastic Leukemia) to identify differentially expressed gene sets between T-ALL and normal patients and T-ALL and AML (Acute Myeloid Leukemia) patients.
format Online
Article
Text
id pubmed-5053608
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-50536082016-10-27 Self-Contained Statistical Analysis of Gene Sets Torres, David J. Cannon, Judy L. Ricoy, Ulises M. Johnson, Christopher PLoS One Research Article Microarrays are a powerful tool for studying differential gene expression. However, lists of many differentially expressed genes are often generated, and unraveling meaningful biological processes from the lists can be challenging. For this reason, investigators have sought to quantify the statistical probability of compiled gene sets rather than individual genes. The gene sets typically are organized around a biological theme or pathway. We compute correlations between different gene set tests and elect to use Fisher’s self-contained method for gene set analysis. We improve Fisher’s differential expression analysis of a gene set by limiting the p-value of an individual gene within the gene set to prevent a small percentage of genes from determining the statistical significance of the entire set. In addition, we also compute dependencies among genes within the set to determine which genes are statistically linked. The method is applied to T-ALL (T-lineage Acute Lymphoblastic Leukemia) to identify differentially expressed gene sets between T-ALL and normal patients and T-ALL and AML (Acute Myeloid Leukemia) patients. Public Library of Science 2016-10-06 /pmc/articles/PMC5053608/ /pubmed/27711232 http://dx.doi.org/10.1371/journal.pone.0163918 Text en © 2016 Torres et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Torres, David J.
Cannon, Judy L.
Ricoy, Ulises M.
Johnson, Christopher
Self-Contained Statistical Analysis of Gene Sets
title Self-Contained Statistical Analysis of Gene Sets
title_full Self-Contained Statistical Analysis of Gene Sets
title_fullStr Self-Contained Statistical Analysis of Gene Sets
title_full_unstemmed Self-Contained Statistical Analysis of Gene Sets
title_short Self-Contained Statistical Analysis of Gene Sets
title_sort self-contained statistical analysis of gene sets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5053608/
https://www.ncbi.nlm.nih.gov/pubmed/27711232
http://dx.doi.org/10.1371/journal.pone.0163918
work_keys_str_mv AT torresdavidj selfcontainedstatisticalanalysisofgenesets
AT cannonjudyl selfcontainedstatisticalanalysisofgenesets
AT ricoyulisesm selfcontainedstatisticalanalysisofgenesets
AT johnsonchristopher selfcontainedstatisticalanalysisofgenesets