Cargando…

Computational Selection of Transcriptomics Experiments Improves Guilt-by-Association Analyses

The Guilt-by-Association (GBA) principle, according to which genes with similar expression profiles are functionally associated, is widely applied for functional analyses using large heterogeneous collections of transcriptomics data. However, the use of such large collections could hamper GBA functi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Bhat, Prajwal, Yang, Haixuan, Bögre, László, Devoto, Alessandra, Paccanaro, Alberto
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2012
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3413680/ https://www.ncbi.nlm.nih.gov/pubmed/22879875 http://dx.doi.org/10.1371/journal.pone.0039681

_version_	1782240098172010496
author	Bhat, Prajwal Yang, Haixuan Bögre, László Devoto, Alessandra Paccanaro, Alberto
author_facet	Bhat, Prajwal Yang, Haixuan Bögre, László Devoto, Alessandra Paccanaro, Alberto
author_sort	Bhat, Prajwal
collection	PubMed
description	The Guilt-by-Association (GBA) principle, according to which genes with similar expression profiles are functionally associated, is widely applied for functional analyses using large heterogeneous collections of transcriptomics data. However, the use of such large collections could hamper GBA functional analysis for genes whose expression is condition specific. In these cases a smaller set of condition related experiments should instead be used, but identifying such functionally relevant experiments from large collections based on literature knowledge alone is an impractical task. We begin this paper by analyzing, both from a mathematical and a biological point of view, why only condition specific experiments should be used in GBA functional analysis. We are able to show that this phenomenon is independent of the functional categorization scheme and of the organisms being analyzed. We then present a semi-supervised algorithm that can select functionally relevant experiments from large collections of transcriptomics experiments. Our algorithm is able to select experiments relevant to a given GO term, MIPS FunCat term or even KEGG pathways. We extensively test our algorithm on large dataset collections for yeast and Arabidopsis. We demonstrate that: using the selected experiments there is a statistically significant improvement in correlation between genes in the functional category of interest; the selected experiments improve GBA-based gene function prediction; the effectiveness of the selected experiments increases with annotation specificity; our algorithm can be successfully applied to GBA-based pathway reconstruction. Importantly, the set of experiments selected by the algorithm reflects the existing literature knowledge about the experiments. [A MATLAB implementation of the algorithm and all the data used in this paper can be downloaded from the paper website: http://www.paccanarolab.org/papers/CorrGene/].
format	Online Article Text
id	pubmed-3413680
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-34136802012-08-09 Computational Selection of Transcriptomics Experiments Improves Guilt-by-Association Analyses Bhat, Prajwal Yang, Haixuan Bögre, László Devoto, Alessandra Paccanaro, Alberto PLoS One Research Article The Guilt-by-Association (GBA) principle, according to which genes with similar expression profiles are functionally associated, is widely applied for functional analyses using large heterogeneous collections of transcriptomics data. However, the use of such large collections could hamper GBA functional analysis for genes whose expression is condition specific. In these cases a smaller set of condition related experiments should instead be used, but identifying such functionally relevant experiments from large collections based on literature knowledge alone is an impractical task. We begin this paper by analyzing, both from a mathematical and a biological point of view, why only condition specific experiments should be used in GBA functional analysis. We are able to show that this phenomenon is independent of the functional categorization scheme and of the organisms being analyzed. We then present a semi-supervised algorithm that can select functionally relevant experiments from large collections of transcriptomics experiments. Our algorithm is able to select experiments relevant to a given GO term, MIPS FunCat term or even KEGG pathways. We extensively test our algorithm on large dataset collections for yeast and Arabidopsis. We demonstrate that: using the selected experiments there is a statistically significant improvement in correlation between genes in the functional category of interest; the selected experiments improve GBA-based gene function prediction; the effectiveness of the selected experiments increases with annotation specificity; our algorithm can be successfully applied to GBA-based pathway reconstruction. Importantly, the set of experiments selected by the algorithm reflects the existing literature knowledge about the experiments. [A MATLAB implementation of the algorithm and all the data used in this paper can be downloaded from the paper website: http://www.paccanarolab.org/papers/CorrGene/]. Public Library of Science 2012-08-07 /pmc/articles/PMC3413680/ /pubmed/22879875 http://dx.doi.org/10.1371/journal.pone.0039681 Text en © 2012 Bhat et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Bhat, Prajwal Yang, Haixuan Bögre, László Devoto, Alessandra Paccanaro, Alberto Computational Selection of Transcriptomics Experiments Improves Guilt-by-Association Analyses
title	Computational Selection of Transcriptomics Experiments Improves Guilt-by-Association Analyses
title_full	Computational Selection of Transcriptomics Experiments Improves Guilt-by-Association Analyses
title_fullStr	Computational Selection of Transcriptomics Experiments Improves Guilt-by-Association Analyses
title_full_unstemmed	Computational Selection of Transcriptomics Experiments Improves Guilt-by-Association Analyses
title_short	Computational Selection of Transcriptomics Experiments Improves Guilt-by-Association Analyses
title_sort	computational selection of transcriptomics experiments improves guilt-by-association analyses
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3413680/ https://www.ncbi.nlm.nih.gov/pubmed/22879875 http://dx.doi.org/10.1371/journal.pone.0039681
work_keys_str_mv	AT bhatprajwal computationalselectionoftranscriptomicsexperimentsimprovesguiltbyassociationanalyses AT yanghaixuan computationalselectionoftranscriptomicsexperimentsimprovesguiltbyassociationanalyses AT bogrelaszlo computationalselectionoftranscriptomicsexperimentsimprovesguiltbyassociationanalyses AT devotoalessandra computationalselectionoftranscriptomicsexperimentsimprovesguiltbyassociationanalyses AT paccanaroalberto computationalselectionoftranscriptomicsexperimentsimprovesguiltbyassociationanalyses

Computational Selection of Transcriptomics Experiments Improves Guilt-by-Association Analyses

Ejemplares similares