Cargando…
Self-Contained Statistical Analysis of Gene Sets
Microarrays are a powerful tool for studying differential gene expression. However, lists of many differentially expressed genes are often generated, and unraveling meaningful biological processes from the lists can be challenging. For this reason, investigators have sought to quantify the statistic...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5053608/ https://www.ncbi.nlm.nih.gov/pubmed/27711232 http://dx.doi.org/10.1371/journal.pone.0163918 |
_version_ | 1782458445190922240 |
---|---|
author | Torres, David J. Cannon, Judy L. Ricoy, Ulises M. Johnson, Christopher |
author_facet | Torres, David J. Cannon, Judy L. Ricoy, Ulises M. Johnson, Christopher |
author_sort | Torres, David J. |
collection | PubMed |
description | Microarrays are a powerful tool for studying differential gene expression. However, lists of many differentially expressed genes are often generated, and unraveling meaningful biological processes from the lists can be challenging. For this reason, investigators have sought to quantify the statistical probability of compiled gene sets rather than individual genes. The gene sets typically are organized around a biological theme or pathway. We compute correlations between different gene set tests and elect to use Fisher’s self-contained method for gene set analysis. We improve Fisher’s differential expression analysis of a gene set by limiting the p-value of an individual gene within the gene set to prevent a small percentage of genes from determining the statistical significance of the entire set. In addition, we also compute dependencies among genes within the set to determine which genes are statistically linked. The method is applied to T-ALL (T-lineage Acute Lymphoblastic Leukemia) to identify differentially expressed gene sets between T-ALL and normal patients and T-ALL and AML (Acute Myeloid Leukemia) patients. |
format | Online Article Text |
id | pubmed-5053608 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-50536082016-10-27 Self-Contained Statistical Analysis of Gene Sets Torres, David J. Cannon, Judy L. Ricoy, Ulises M. Johnson, Christopher PLoS One Research Article Microarrays are a powerful tool for studying differential gene expression. However, lists of many differentially expressed genes are often generated, and unraveling meaningful biological processes from the lists can be challenging. For this reason, investigators have sought to quantify the statistical probability of compiled gene sets rather than individual genes. The gene sets typically are organized around a biological theme or pathway. We compute correlations between different gene set tests and elect to use Fisher’s self-contained method for gene set analysis. We improve Fisher’s differential expression analysis of a gene set by limiting the p-value of an individual gene within the gene set to prevent a small percentage of genes from determining the statistical significance of the entire set. In addition, we also compute dependencies among genes within the set to determine which genes are statistically linked. The method is applied to T-ALL (T-lineage Acute Lymphoblastic Leukemia) to identify differentially expressed gene sets between T-ALL and normal patients and T-ALL and AML (Acute Myeloid Leukemia) patients. Public Library of Science 2016-10-06 /pmc/articles/PMC5053608/ /pubmed/27711232 http://dx.doi.org/10.1371/journal.pone.0163918 Text en © 2016 Torres et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Torres, David J. Cannon, Judy L. Ricoy, Ulises M. Johnson, Christopher Self-Contained Statistical Analysis of Gene Sets |
title | Self-Contained Statistical Analysis of Gene Sets |
title_full | Self-Contained Statistical Analysis of Gene Sets |
title_fullStr | Self-Contained Statistical Analysis of Gene Sets |
title_full_unstemmed | Self-Contained Statistical Analysis of Gene Sets |
title_short | Self-Contained Statistical Analysis of Gene Sets |
title_sort | self-contained statistical analysis of gene sets |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5053608/ https://www.ncbi.nlm.nih.gov/pubmed/27711232 http://dx.doi.org/10.1371/journal.pone.0163918 |
work_keys_str_mv | AT torresdavidj selfcontainedstatisticalanalysisofgenesets AT cannonjudyl selfcontainedstatisticalanalysisofgenesets AT ricoyulisesm selfcontainedstatisticalanalysisofgenesets AT johnsonchristopher selfcontainedstatisticalanalysisofgenesets |