Cargando…

Improved identification of concordant and discordant gene expression signatures using an updated rank-rank hypergeometric overlap approach

Recent advances in large-scale gene expression profiling necessitate concurrent development of biostatistical approaches to reveal meaningful biological relationships. Most analyses rely on significance thresholds for identifying differentially expressed genes. We use an approach to compare gene exp...

Descripción completa

Detalles Bibliográficos
Autores principales: Cahill, Kelly M., Huo, Zhiguang, Tseng, George C., Logan, Ryan W., Seney, Marianne L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6018631/
https://www.ncbi.nlm.nih.gov/pubmed/29942049
http://dx.doi.org/10.1038/s41598-018-27903-2
_version_ 1783334994871058432
author Cahill, Kelly M.
Huo, Zhiguang
Tseng, George C.
Logan, Ryan W.
Seney, Marianne L.
author_facet Cahill, Kelly M.
Huo, Zhiguang
Tseng, George C.
Logan, Ryan W.
Seney, Marianne L.
author_sort Cahill, Kelly M.
collection PubMed
description Recent advances in large-scale gene expression profiling necessitate concurrent development of biostatistical approaches to reveal meaningful biological relationships. Most analyses rely on significance thresholds for identifying differentially expressed genes. We use an approach to compare gene expression datasets using ‘threshold-free’ comparisons. Significance cut-offs to identify genes shared between datasets may be too stringent and may miss concordant patterns of gene expression with potential biological relevance. A threshold-free approach gaining popularity in several research areas, including neuroscience, is Rank–Rank Hypergeometric Overlap (RRHO). Genes are ranked by their p-value and effect size direction, and ranked lists are compared to identify significantly overlapping genes across a continuous significance gradient rather than at a single arbitrary cut-off. We have updated the previous RRHO analysis by accurately detecting overlap of genes changed in the same and opposite directions between two datasets. Here, we use simulated and real data to show the drawbacks of the previous algorithm as well as the utility of our new algorithm. For example, we show the power of detecting discordant transcriptional patterns in the postmortem brain of subjects with psychiatric disorders. The new R package, RRHO2, offers a new, more intuitive visualization of concordant and discordant gene overlap.
format Online
Article
Text
id pubmed-6018631
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-60186312018-07-06 Improved identification of concordant and discordant gene expression signatures using an updated rank-rank hypergeometric overlap approach Cahill, Kelly M. Huo, Zhiguang Tseng, George C. Logan, Ryan W. Seney, Marianne L. Sci Rep Article Recent advances in large-scale gene expression profiling necessitate concurrent development of biostatistical approaches to reveal meaningful biological relationships. Most analyses rely on significance thresholds for identifying differentially expressed genes. We use an approach to compare gene expression datasets using ‘threshold-free’ comparisons. Significance cut-offs to identify genes shared between datasets may be too stringent and may miss concordant patterns of gene expression with potential biological relevance. A threshold-free approach gaining popularity in several research areas, including neuroscience, is Rank–Rank Hypergeometric Overlap (RRHO). Genes are ranked by their p-value and effect size direction, and ranked lists are compared to identify significantly overlapping genes across a continuous significance gradient rather than at a single arbitrary cut-off. We have updated the previous RRHO analysis by accurately detecting overlap of genes changed in the same and opposite directions between two datasets. Here, we use simulated and real data to show the drawbacks of the previous algorithm as well as the utility of our new algorithm. For example, we show the power of detecting discordant transcriptional patterns in the postmortem brain of subjects with psychiatric disorders. The new R package, RRHO2, offers a new, more intuitive visualization of concordant and discordant gene overlap. Nature Publishing Group UK 2018-06-25 /pmc/articles/PMC6018631/ /pubmed/29942049 http://dx.doi.org/10.1038/s41598-018-27903-2 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Cahill, Kelly M.
Huo, Zhiguang
Tseng, George C.
Logan, Ryan W.
Seney, Marianne L.
Improved identification of concordant and discordant gene expression signatures using an updated rank-rank hypergeometric overlap approach
title Improved identification of concordant and discordant gene expression signatures using an updated rank-rank hypergeometric overlap approach
title_full Improved identification of concordant and discordant gene expression signatures using an updated rank-rank hypergeometric overlap approach
title_fullStr Improved identification of concordant and discordant gene expression signatures using an updated rank-rank hypergeometric overlap approach
title_full_unstemmed Improved identification of concordant and discordant gene expression signatures using an updated rank-rank hypergeometric overlap approach
title_short Improved identification of concordant and discordant gene expression signatures using an updated rank-rank hypergeometric overlap approach
title_sort improved identification of concordant and discordant gene expression signatures using an updated rank-rank hypergeometric overlap approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6018631/
https://www.ncbi.nlm.nih.gov/pubmed/29942049
http://dx.doi.org/10.1038/s41598-018-27903-2
work_keys_str_mv AT cahillkellym improvedidentificationofconcordantanddiscordantgeneexpressionsignaturesusinganupdatedrankrankhypergeometricoverlapapproach
AT huozhiguang improvedidentificationofconcordantanddiscordantgeneexpressionsignaturesusinganupdatedrankrankhypergeometricoverlapapproach
AT tsenggeorgec improvedidentificationofconcordantanddiscordantgeneexpressionsignaturesusinganupdatedrankrankhypergeometricoverlapapproach
AT loganryanw improvedidentificationofconcordantanddiscordantgeneexpressionsignaturesusinganupdatedrankrankhypergeometricoverlapapproach
AT seneymariannel improvedidentificationofconcordantanddiscordantgeneexpressionsignaturesusinganupdatedrankrankhypergeometricoverlapapproach