Cargando…
Multivariate Analysis and Visualization of Splicing Correlations in Single-Gene Transcriptomes
BACKGROUND: RNA metabolism, through 'combinatorial splicing', can generate enormous structural diversity in the proteome. Alternative domains may interact, however, with unpredictable phenotypic consequences, necessitating integrated RNA-level regulation of molecular composition. Splicing...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1785386/ https://www.ncbi.nlm.nih.gov/pubmed/17233916 http://dx.doi.org/10.1186/1471-2105-8-16 |
_version_ | 1782132119229693952 |
---|---|
author | Emerick, Mark C Parmigiani, Giovanni Agnew, William S |
author_facet | Emerick, Mark C Parmigiani, Giovanni Agnew, William S |
author_sort | Emerick, Mark C |
collection | PubMed |
description | BACKGROUND: RNA metabolism, through 'combinatorial splicing', can generate enormous structural diversity in the proteome. Alternative domains may interact, however, with unpredictable phenotypic consequences, necessitating integrated RNA-level regulation of molecular composition. Splicing correlations within transcripts of single genes provide valuable clues to functional relationships among molecular domains as well as genomic targets for higher-order splicing regulation. RESULTS: We present tools to visualize complex splicing patterns in full-length cDNA libraries. Developmental changes in pair-wise correlations are presented vectorially in 'clock plots' and linkage grids. Higher-order correlations are assessed statistically through Monte Carlo analysis of a log-linear model with an empirical-Bayes estimate of the true probabilities of observed and unobserved splice forms. Log-linear coefficients are visualized in a 'spliceprint,' a signature of splice correlations in the transcriptome. We present two novel metrics: the linkage change index, which measures the directional change in pair-wise correlation with tissue differentiation, and the accuracy index, a very simple goodness-of-fit metric that is more sensitive than the integrated squared error when applied to sparsely populated tables, and unlike chi-square, does not diverge at low variance. Considerable attention is given to sparse contingency tables, which are inherent to single-gene libraries. CONCLUSION: Patterns of splicing correlations are revealed, which span a broad range of interaction order and change in development. The methods have a broad scope of applicability, beyond the single gene – including, for example, multiple gene interactions in the complete transcriptome. |
format | Text |
id | pubmed-1785386 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-17853862007-02-05 Multivariate Analysis and Visualization of Splicing Correlations in Single-Gene Transcriptomes Emerick, Mark C Parmigiani, Giovanni Agnew, William S BMC Bioinformatics Methodology Article BACKGROUND: RNA metabolism, through 'combinatorial splicing', can generate enormous structural diversity in the proteome. Alternative domains may interact, however, with unpredictable phenotypic consequences, necessitating integrated RNA-level regulation of molecular composition. Splicing correlations within transcripts of single genes provide valuable clues to functional relationships among molecular domains as well as genomic targets for higher-order splicing regulation. RESULTS: We present tools to visualize complex splicing patterns in full-length cDNA libraries. Developmental changes in pair-wise correlations are presented vectorially in 'clock plots' and linkage grids. Higher-order correlations are assessed statistically through Monte Carlo analysis of a log-linear model with an empirical-Bayes estimate of the true probabilities of observed and unobserved splice forms. Log-linear coefficients are visualized in a 'spliceprint,' a signature of splice correlations in the transcriptome. We present two novel metrics: the linkage change index, which measures the directional change in pair-wise correlation with tissue differentiation, and the accuracy index, a very simple goodness-of-fit metric that is more sensitive than the integrated squared error when applied to sparsely populated tables, and unlike chi-square, does not diverge at low variance. Considerable attention is given to sparse contingency tables, which are inherent to single-gene libraries. CONCLUSION: Patterns of splicing correlations are revealed, which span a broad range of interaction order and change in development. The methods have a broad scope of applicability, beyond the single gene – including, for example, multiple gene interactions in the complete transcriptome. BioMed Central 2007-01-18 /pmc/articles/PMC1785386/ /pubmed/17233916 http://dx.doi.org/10.1186/1471-2105-8-16 Text en Copyright © 2007 Emerick et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Emerick, Mark C Parmigiani, Giovanni Agnew, William S Multivariate Analysis and Visualization of Splicing Correlations in Single-Gene Transcriptomes |
title | Multivariate Analysis and Visualization of Splicing Correlations in Single-Gene Transcriptomes |
title_full | Multivariate Analysis and Visualization of Splicing Correlations in Single-Gene Transcriptomes |
title_fullStr | Multivariate Analysis and Visualization of Splicing Correlations in Single-Gene Transcriptomes |
title_full_unstemmed | Multivariate Analysis and Visualization of Splicing Correlations in Single-Gene Transcriptomes |
title_short | Multivariate Analysis and Visualization of Splicing Correlations in Single-Gene Transcriptomes |
title_sort | multivariate analysis and visualization of splicing correlations in single-gene transcriptomes |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1785386/ https://www.ncbi.nlm.nih.gov/pubmed/17233916 http://dx.doi.org/10.1186/1471-2105-8-16 |
work_keys_str_mv | AT emerickmarkc multivariateanalysisandvisualizationofsplicingcorrelationsinsinglegenetranscriptomes AT parmigianigiovanni multivariateanalysisandvisualizationofsplicingcorrelationsinsinglegenetranscriptomes AT agnewwilliams multivariateanalysisandvisualizationofsplicingcorrelationsinsinglegenetranscriptomes |