Cargando…
A ratiometric-based measure of gene co-expression
BACKGROUND: Gene co-expression analysis has previously been based on measures that include correlation coefficients and mutual information, as well as newcomers such as MIC. These measures depend primarily on the degree of association between the RNA levels of two genes and to a lesser extent on the...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4289233/ https://www.ncbi.nlm.nih.gov/pubmed/25411051 http://dx.doi.org/10.1186/1471-2105-15-331 |
_version_ | 1782352071906820096 |
---|---|
author | Abelin, Anna CT Marinov, Georgi K Williams, Brian A McCue, Kenneth Wold, Barbara J |
author_facet | Abelin, Anna CT Marinov, Georgi K Williams, Brian A McCue, Kenneth Wold, Barbara J |
author_sort | Abelin, Anna CT |
collection | PubMed |
description | BACKGROUND: Gene co-expression analysis has previously been based on measures that include correlation coefficients and mutual information, as well as newcomers such as MIC. These measures depend primarily on the degree of association between the RNA levels of two genes and to a lesser extent on their variability. They focus on the similarity of expression value trajectories that change in like manner across samples. However there are relationships of biological interest for which these classical measures are expected to be insensitive. These include genes whose expression levels are ratiometrically stable and genes whose variance is tightly constrained. Large-scale studies of relatively homogeneous samples, including single cell RNA-seq, are experimental settings in which such relationships might be especially pertinent. RESULTS: We develop and implement a ratiometric approach for detecting gene associations (abbreviated RA). It is based on the coefficient of variation of the measured expression ratio of each pair of genes. We apply it to a collection of lymphoblastoid RNA-seq data from the 1000 Genomes Project Consortium, a typical sample set with high overall homogeneity. RA is a selective method, reporting in this case ~1/4 of all possible gene pairs, yet these relationships include a distilled picture of biological relationships previously found by other methods. In addition, RA reveals expression relationships that are not detected by traditional correlation and mutual information methods. We also analyze data from individual lymphoblastoid cells and show that desirable properties of the RA method extend to single-cell RNA-seq. CONCLUSION: We show that our ratiometric method identifies biologically significant relationships that are often missed or low-ranked by conventional association-based methods when applied to a relatively homogenous dataset. The results open new questions about the regulatory mechanisms that produce strong RA relationships. RA is scalable and potentially well suited for the analysis of thousands of bulk-RNA or single-cell transcriptomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-331) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4289233 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42892332015-01-11 A ratiometric-based measure of gene co-expression Abelin, Anna CT Marinov, Georgi K Williams, Brian A McCue, Kenneth Wold, Barbara J BMC Bioinformatics Methodology Article BACKGROUND: Gene co-expression analysis has previously been based on measures that include correlation coefficients and mutual information, as well as newcomers such as MIC. These measures depend primarily on the degree of association between the RNA levels of two genes and to a lesser extent on their variability. They focus on the similarity of expression value trajectories that change in like manner across samples. However there are relationships of biological interest for which these classical measures are expected to be insensitive. These include genes whose expression levels are ratiometrically stable and genes whose variance is tightly constrained. Large-scale studies of relatively homogeneous samples, including single cell RNA-seq, are experimental settings in which such relationships might be especially pertinent. RESULTS: We develop and implement a ratiometric approach for detecting gene associations (abbreviated RA). It is based on the coefficient of variation of the measured expression ratio of each pair of genes. We apply it to a collection of lymphoblastoid RNA-seq data from the 1000 Genomes Project Consortium, a typical sample set with high overall homogeneity. RA is a selective method, reporting in this case ~1/4 of all possible gene pairs, yet these relationships include a distilled picture of biological relationships previously found by other methods. In addition, RA reveals expression relationships that are not detected by traditional correlation and mutual information methods. We also analyze data from individual lymphoblastoid cells and show that desirable properties of the RA method extend to single-cell RNA-seq. CONCLUSION: We show that our ratiometric method identifies biologically significant relationships that are often missed or low-ranked by conventional association-based methods when applied to a relatively homogenous dataset. The results open new questions about the regulatory mechanisms that produce strong RA relationships. RA is scalable and potentially well suited for the analysis of thousands of bulk-RNA or single-cell transcriptomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-331) contains supplementary material, which is available to authorized users. BioMed Central 2014-11-20 /pmc/articles/PMC4289233/ /pubmed/25411051 http://dx.doi.org/10.1186/1471-2105-15-331 Text en © Abelin et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Abelin, Anna CT Marinov, Georgi K Williams, Brian A McCue, Kenneth Wold, Barbara J A ratiometric-based measure of gene co-expression |
title | A ratiometric-based measure of gene co-expression |
title_full | A ratiometric-based measure of gene co-expression |
title_fullStr | A ratiometric-based measure of gene co-expression |
title_full_unstemmed | A ratiometric-based measure of gene co-expression |
title_short | A ratiometric-based measure of gene co-expression |
title_sort | ratiometric-based measure of gene co-expression |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4289233/ https://www.ncbi.nlm.nih.gov/pubmed/25411051 http://dx.doi.org/10.1186/1471-2105-15-331 |
work_keys_str_mv | AT abelinannact aratiometricbasedmeasureofgenecoexpression AT marinovgeorgik aratiometricbasedmeasureofgenecoexpression AT williamsbriana aratiometricbasedmeasureofgenecoexpression AT mccuekenneth aratiometricbasedmeasureofgenecoexpression AT woldbarbaraj aratiometricbasedmeasureofgenecoexpression AT abelinannact ratiometricbasedmeasureofgenecoexpression AT marinovgeorgik ratiometricbasedmeasureofgenecoexpression AT williamsbriana ratiometricbasedmeasureofgenecoexpression AT mccuekenneth ratiometricbasedmeasureofgenecoexpression AT woldbarbaraj ratiometricbasedmeasureofgenecoexpression |