Cargando…

A ratiometric-based measure of gene co-expression

BACKGROUND: Gene co-expression analysis has previously been based on measures that include correlation coefficients and mutual information, as well as newcomers such as MIC. These measures depend primarily on the degree of association between the RNA levels of two genes and to a lesser extent on the...

Descripción completa

Detalles Bibliográficos
Autores principales: Abelin, Anna CT, Marinov, Georgi K, Williams, Brian A, McCue, Kenneth, Wold, Barbara J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4289233/
https://www.ncbi.nlm.nih.gov/pubmed/25411051
http://dx.doi.org/10.1186/1471-2105-15-331
_version_ 1782352071906820096
author Abelin, Anna CT
Marinov, Georgi K
Williams, Brian A
McCue, Kenneth
Wold, Barbara J
author_facet Abelin, Anna CT
Marinov, Georgi K
Williams, Brian A
McCue, Kenneth
Wold, Barbara J
author_sort Abelin, Anna CT
collection PubMed
description BACKGROUND: Gene co-expression analysis has previously been based on measures that include correlation coefficients and mutual information, as well as newcomers such as MIC. These measures depend primarily on the degree of association between the RNA levels of two genes and to a lesser extent on their variability. They focus on the similarity of expression value trajectories that change in like manner across samples. However there are relationships of biological interest for which these classical measures are expected to be insensitive. These include genes whose expression levels are ratiometrically stable and genes whose variance is tightly constrained. Large-scale studies of relatively homogeneous samples, including single cell RNA-seq, are experimental settings in which such relationships might be especially pertinent. RESULTS: We develop and implement a ratiometric approach for detecting gene associations (abbreviated RA). It is based on the coefficient of variation of the measured expression ratio of each pair of genes. We apply it to a collection of lymphoblastoid RNA-seq data from the 1000 Genomes Project Consortium, a typical sample set with high overall homogeneity. RA is a selective method, reporting in this case ~1/4 of all possible gene pairs, yet these relationships include a distilled picture of biological relationships previously found by other methods. In addition, RA reveals expression relationships that are not detected by traditional correlation and mutual information methods. We also analyze data from individual lymphoblastoid cells and show that desirable properties of the RA method extend to single-cell RNA-seq. CONCLUSION: We show that our ratiometric method identifies biologically significant relationships that are often missed or low-ranked by conventional association-based methods when applied to a relatively homogenous dataset. The results open new questions about the regulatory mechanisms that produce strong RA relationships. RA is scalable and potentially well suited for the analysis of thousands of bulk-RNA or single-cell transcriptomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-331) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4289233
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42892332015-01-11 A ratiometric-based measure of gene co-expression Abelin, Anna CT Marinov, Georgi K Williams, Brian A McCue, Kenneth Wold, Barbara J BMC Bioinformatics Methodology Article BACKGROUND: Gene co-expression analysis has previously been based on measures that include correlation coefficients and mutual information, as well as newcomers such as MIC. These measures depend primarily on the degree of association between the RNA levels of two genes and to a lesser extent on their variability. They focus on the similarity of expression value trajectories that change in like manner across samples. However there are relationships of biological interest for which these classical measures are expected to be insensitive. These include genes whose expression levels are ratiometrically stable and genes whose variance is tightly constrained. Large-scale studies of relatively homogeneous samples, including single cell RNA-seq, are experimental settings in which such relationships might be especially pertinent. RESULTS: We develop and implement a ratiometric approach for detecting gene associations (abbreviated RA). It is based on the coefficient of variation of the measured expression ratio of each pair of genes. We apply it to a collection of lymphoblastoid RNA-seq data from the 1000 Genomes Project Consortium, a typical sample set with high overall homogeneity. RA is a selective method, reporting in this case ~1/4 of all possible gene pairs, yet these relationships include a distilled picture of biological relationships previously found by other methods. In addition, RA reveals expression relationships that are not detected by traditional correlation and mutual information methods. We also analyze data from individual lymphoblastoid cells and show that desirable properties of the RA method extend to single-cell RNA-seq. CONCLUSION: We show that our ratiometric method identifies biologically significant relationships that are often missed or low-ranked by conventional association-based methods when applied to a relatively homogenous dataset. The results open new questions about the regulatory mechanisms that produce strong RA relationships. RA is scalable and potentially well suited for the analysis of thousands of bulk-RNA or single-cell transcriptomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-331) contains supplementary material, which is available to authorized users. BioMed Central 2014-11-20 /pmc/articles/PMC4289233/ /pubmed/25411051 http://dx.doi.org/10.1186/1471-2105-15-331 Text en © Abelin et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Abelin, Anna CT
Marinov, Georgi K
Williams, Brian A
McCue, Kenneth
Wold, Barbara J
A ratiometric-based measure of gene co-expression
title A ratiometric-based measure of gene co-expression
title_full A ratiometric-based measure of gene co-expression
title_fullStr A ratiometric-based measure of gene co-expression
title_full_unstemmed A ratiometric-based measure of gene co-expression
title_short A ratiometric-based measure of gene co-expression
title_sort ratiometric-based measure of gene co-expression
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4289233/
https://www.ncbi.nlm.nih.gov/pubmed/25411051
http://dx.doi.org/10.1186/1471-2105-15-331
work_keys_str_mv AT abelinannact aratiometricbasedmeasureofgenecoexpression
AT marinovgeorgik aratiometricbasedmeasureofgenecoexpression
AT williamsbriana aratiometricbasedmeasureofgenecoexpression
AT mccuekenneth aratiometricbasedmeasureofgenecoexpression
AT woldbarbaraj aratiometricbasedmeasureofgenecoexpression
AT abelinannact ratiometricbasedmeasureofgenecoexpression
AT marinovgeorgik ratiometricbasedmeasureofgenecoexpression
AT williamsbriana ratiometricbasedmeasureofgenecoexpression
AT mccuekenneth ratiometricbasedmeasureofgenecoexpression
AT woldbarbaraj ratiometricbasedmeasureofgenecoexpression