Cargando…

Co-expressed Pathways DataBase for Tomato: a database to predict pathways relevant to a query gene

BACKGROUND: Gene co-expression, the similarity of gene expression profiles under various experimental conditions, has been used as an indicator of functional relationships between genes, and many co-expression databases have been developed for predicting gene functions. These databases usually provi...

Descripción completa

Detalles Bibliográficos
Autores principales: Narise, Takafumi, Sakurai, Nozomu, Obayashi, Takeshi, Ohta, Hiroyuki, Shibata, Daisuke
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5460524/
https://www.ncbi.nlm.nih.gov/pubmed/28583129
http://dx.doi.org/10.1186/s12864-017-3786-3
_version_ 1783242195897155584
author Narise, Takafumi
Sakurai, Nozomu
Obayashi, Takeshi
Ohta, Hiroyuki
Shibata, Daisuke
author_facet Narise, Takafumi
Sakurai, Nozomu
Obayashi, Takeshi
Ohta, Hiroyuki
Shibata, Daisuke
author_sort Narise, Takafumi
collection PubMed
description BACKGROUND: Gene co-expression, the similarity of gene expression profiles under various experimental conditions, has been used as an indicator of functional relationships between genes, and many co-expression databases have been developed for predicting gene functions. These databases usually provide users with a co-expression network and a list of strongly co-expressed genes for a query gene. Several of these databases also provide functional information on a set of strongly co-expressed genes (i.e., provide biological processes and pathways that are enriched in these strongly co-expressed genes), which is generally analyzed via over-representation analysis (ORA). A limitation of this approach may be that users can predict gene functions only based on the strongly co-expressed genes. RESULTS: In this study, we developed a new co-expression database that enables users to predict the function of tomato genes from the results of functional enrichment analyses of co-expressed genes while considering the genes that are not strongly co-expressed. To achieve this, we used the ORA approach with several thresholds to select co-expressed genes, and performed gene set enrichment analysis (GSEA) applied to a ranked list of genes ordered by the co-expression degree. We found that internal correlation in pathways affected the significance levels of the enrichment analyses. Therefore, we introduced a new measure for evaluating the relationship between the gene and pathway, termed the percentile (p)-score, which enables users to predict functionally relevant pathways without being affected by the internal correlation in pathways. In addition, we evaluated our approaches using receiver operating characteristic curves, which concluded that the p-score could improve the performance of the ORA. CONCLUSIONS: We developed a new database, named Co-expressed Pathways DataBase for Tomato, which is available at http://cox-path-db.kazusa.or.jp/tomato. The database allows users to predict pathways that are relevant to a query gene, which would help to infer gene functions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3786-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5460524
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54605242017-06-07 Co-expressed Pathways DataBase for Tomato: a database to predict pathways relevant to a query gene Narise, Takafumi Sakurai, Nozomu Obayashi, Takeshi Ohta, Hiroyuki Shibata, Daisuke BMC Genomics Database BACKGROUND: Gene co-expression, the similarity of gene expression profiles under various experimental conditions, has been used as an indicator of functional relationships between genes, and many co-expression databases have been developed for predicting gene functions. These databases usually provide users with a co-expression network and a list of strongly co-expressed genes for a query gene. Several of these databases also provide functional information on a set of strongly co-expressed genes (i.e., provide biological processes and pathways that are enriched in these strongly co-expressed genes), which is generally analyzed via over-representation analysis (ORA). A limitation of this approach may be that users can predict gene functions only based on the strongly co-expressed genes. RESULTS: In this study, we developed a new co-expression database that enables users to predict the function of tomato genes from the results of functional enrichment analyses of co-expressed genes while considering the genes that are not strongly co-expressed. To achieve this, we used the ORA approach with several thresholds to select co-expressed genes, and performed gene set enrichment analysis (GSEA) applied to a ranked list of genes ordered by the co-expression degree. We found that internal correlation in pathways affected the significance levels of the enrichment analyses. Therefore, we introduced a new measure for evaluating the relationship between the gene and pathway, termed the percentile (p)-score, which enables users to predict functionally relevant pathways without being affected by the internal correlation in pathways. In addition, we evaluated our approaches using receiver operating characteristic curves, which concluded that the p-score could improve the performance of the ORA. CONCLUSIONS: We developed a new database, named Co-expressed Pathways DataBase for Tomato, which is available at http://cox-path-db.kazusa.or.jp/tomato. The database allows users to predict pathways that are relevant to a query gene, which would help to infer gene functions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3786-3) contains supplementary material, which is available to authorized users. BioMed Central 2017-06-05 /pmc/articles/PMC5460524/ /pubmed/28583129 http://dx.doi.org/10.1186/s12864-017-3786-3 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Database
Narise, Takafumi
Sakurai, Nozomu
Obayashi, Takeshi
Ohta, Hiroyuki
Shibata, Daisuke
Co-expressed Pathways DataBase for Tomato: a database to predict pathways relevant to a query gene
title Co-expressed Pathways DataBase for Tomato: a database to predict pathways relevant to a query gene
title_full Co-expressed Pathways DataBase for Tomato: a database to predict pathways relevant to a query gene
title_fullStr Co-expressed Pathways DataBase for Tomato: a database to predict pathways relevant to a query gene
title_full_unstemmed Co-expressed Pathways DataBase for Tomato: a database to predict pathways relevant to a query gene
title_short Co-expressed Pathways DataBase for Tomato: a database to predict pathways relevant to a query gene
title_sort co-expressed pathways database for tomato: a database to predict pathways relevant to a query gene
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5460524/
https://www.ncbi.nlm.nih.gov/pubmed/28583129
http://dx.doi.org/10.1186/s12864-017-3786-3
work_keys_str_mv AT narisetakafumi coexpressedpathwaysdatabasefortomatoadatabasetopredictpathwaysrelevanttoaquerygene
AT sakurainozomu coexpressedpathwaysdatabasefortomatoadatabasetopredictpathwaysrelevanttoaquerygene
AT obayashitakeshi coexpressedpathwaysdatabasefortomatoadatabasetopredictpathwaysrelevanttoaquerygene
AT ohtahiroyuki coexpressedpathwaysdatabasefortomatoadatabasetopredictpathwaysrelevanttoaquerygene
AT shibatadaisuke coexpressedpathwaysdatabasefortomatoadatabasetopredictpathwaysrelevanttoaquerygene