Cargando…

WeGET: predicting new genes for molecular systems by weighted co-expression

We have developed the Weighted Gene Expression Tool and database (WeGET, http://weget.cmbi.umcn.nl) for the prediction of new genes of a molecular system by correlated gene expression. WeGET utilizes a compendium of 465 human and 560 murine gene expression datasets that have been collected from mult...

Descripción completa

Detalles Bibliográficos
Autores principales: Szklarczyk, Radek, Megchelenbrink, Wout, Cizek, Pavel, Ledent, Marie, Velemans, Gonny, Szklarczyk, Damian, Huynen, Martijn A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4702868/
https://www.ncbi.nlm.nih.gov/pubmed/26582928
http://dx.doi.org/10.1093/nar/gkv1228
_version_ 1782408668663250944
author Szklarczyk, Radek
Megchelenbrink, Wout
Cizek, Pavel
Ledent, Marie
Velemans, Gonny
Szklarczyk, Damian
Huynen, Martijn A.
author_facet Szklarczyk, Radek
Megchelenbrink, Wout
Cizek, Pavel
Ledent, Marie
Velemans, Gonny
Szklarczyk, Damian
Huynen, Martijn A.
author_sort Szklarczyk, Radek
collection PubMed
description We have developed the Weighted Gene Expression Tool and database (WeGET, http://weget.cmbi.umcn.nl) for the prediction of new genes of a molecular system by correlated gene expression. WeGET utilizes a compendium of 465 human and 560 murine gene expression datasets that have been collected from multiple tissues under a wide range of experimental conditions. It exploits this abundance of expression data by assigning a high weight to datasets in which the known genes of a molecular system are harmoniously up- and down-regulated. WeGET ranks new candidate genes by calculating their weighted co-expression with that system. A weighted rank is calculated for human genes and their mouse orthologs. Then, an integrated gene rank and p-value is computed using a rank-order statistic. We applied our method to predict novel genes that have a high degree of co-expression with Gene Ontology terms and pathways from KEGG and Reactome. For each query set we provide a list of predicted novel genes, computed weights for transcription datasets used and cell and tissue types that contributed to the final predictions. The performance for each query set is assessed by 10-fold cross-validation. Finally, users can use the WeGET to predict novel genes that co-express with a custom query set.
format Online
Article
Text
id pubmed-4702868
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-47028682016-01-07 WeGET: predicting new genes for molecular systems by weighted co-expression Szklarczyk, Radek Megchelenbrink, Wout Cizek, Pavel Ledent, Marie Velemans, Gonny Szklarczyk, Damian Huynen, Martijn A. Nucleic Acids Res Database Issue We have developed the Weighted Gene Expression Tool and database (WeGET, http://weget.cmbi.umcn.nl) for the prediction of new genes of a molecular system by correlated gene expression. WeGET utilizes a compendium of 465 human and 560 murine gene expression datasets that have been collected from multiple tissues under a wide range of experimental conditions. It exploits this abundance of expression data by assigning a high weight to datasets in which the known genes of a molecular system are harmoniously up- and down-regulated. WeGET ranks new candidate genes by calculating their weighted co-expression with that system. A weighted rank is calculated for human genes and their mouse orthologs. Then, an integrated gene rank and p-value is computed using a rank-order statistic. We applied our method to predict novel genes that have a high degree of co-expression with Gene Ontology terms and pathways from KEGG and Reactome. For each query set we provide a list of predicted novel genes, computed weights for transcription datasets used and cell and tissue types that contributed to the final predictions. The performance for each query set is assessed by 10-fold cross-validation. Finally, users can use the WeGET to predict novel genes that co-express with a custom query set. Oxford University Press 2016-01-04 2015-11-17 /pmc/articles/PMC4702868/ /pubmed/26582928 http://dx.doi.org/10.1093/nar/gkv1228 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Database Issue
Szklarczyk, Radek
Megchelenbrink, Wout
Cizek, Pavel
Ledent, Marie
Velemans, Gonny
Szklarczyk, Damian
Huynen, Martijn A.
WeGET: predicting new genes for molecular systems by weighted co-expression
title WeGET: predicting new genes for molecular systems by weighted co-expression
title_full WeGET: predicting new genes for molecular systems by weighted co-expression
title_fullStr WeGET: predicting new genes for molecular systems by weighted co-expression
title_full_unstemmed WeGET: predicting new genes for molecular systems by weighted co-expression
title_short WeGET: predicting new genes for molecular systems by weighted co-expression
title_sort weget: predicting new genes for molecular systems by weighted co-expression
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4702868/
https://www.ncbi.nlm.nih.gov/pubmed/26582928
http://dx.doi.org/10.1093/nar/gkv1228
work_keys_str_mv AT szklarczykradek wegetpredictingnewgenesformolecularsystemsbyweightedcoexpression
AT megchelenbrinkwout wegetpredictingnewgenesformolecularsystemsbyweightedcoexpression
AT cizekpavel wegetpredictingnewgenesformolecularsystemsbyweightedcoexpression
AT ledentmarie wegetpredictingnewgenesformolecularsystemsbyweightedcoexpression
AT velemansgonny wegetpredictingnewgenesformolecularsystemsbyweightedcoexpression
AT szklarczykdamian wegetpredictingnewgenesformolecularsystemsbyweightedcoexpression
AT huynenmartijna wegetpredictingnewgenesformolecularsystemsbyweightedcoexpression