Cargando…
Predicting pathway membership via domain signatures
Motivation: Functional characterization of genes is of great importance for the understanding of complex cellular processes. Valuable information for this purpose can be obtained from pathway databases, like KEGG. However, only a small fraction of genes is annotated with pathway information up to no...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2553439/ https://www.ncbi.nlm.nih.gov/pubmed/18676972 http://dx.doi.org/10.1093/bioinformatics/btn403 |
_version_ | 1782159507004063744 |
---|---|
author | Fröhlich, Holger Fellmann, Mark Sültmann, Holger Poustka, Annemarie Beißbarth, Tim |
author_facet | Fröhlich, Holger Fellmann, Mark Sültmann, Holger Poustka, Annemarie Beißbarth, Tim |
author_sort | Fröhlich, Holger |
collection | PubMed |
description | Motivation: Functional characterization of genes is of great importance for the understanding of complex cellular processes. Valuable information for this purpose can be obtained from pathway databases, like KEGG. However, only a small fraction of genes is annotated with pathway information up to now. In contrast, information on contained protein domains can be obtained for a significantly higher number of genes, e.g. from the InterPro database. Results: We present a classification model, which for a specific gene of interest can predict the mapping to a KEGG pathway, based on its domain signature. The classifier makes explicit use of the hierarchical organization of pathways in the KEGG database. Furthermore, we take into account that a specific gene can be mapped to different pathways at the same time. The classification method produces a scoring of all possible mapping positions of the gene in the KEGG hierarchy. Evaluations of our model, which is a combination of a SVM and ranking perceptron approach, show a high prediction performance. Moreover, for signaling pathways we reveal that it is even possible to forecast accurately the membership to individual pathway components. Availability: The R package gene2pathway is a supplement to this article. Contact: h.froehlich@dkfz-heidelberg.de Supplementary Information: Supplementary data are available at Bioinformatics online. |
format | Text |
id | pubmed-2553439 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-25534392009-02-25 Predicting pathway membership via domain signatures Fröhlich, Holger Fellmann, Mark Sültmann, Holger Poustka, Annemarie Beißbarth, Tim Bioinformatics Original Papers Motivation: Functional characterization of genes is of great importance for the understanding of complex cellular processes. Valuable information for this purpose can be obtained from pathway databases, like KEGG. However, only a small fraction of genes is annotated with pathway information up to now. In contrast, information on contained protein domains can be obtained for a significantly higher number of genes, e.g. from the InterPro database. Results: We present a classification model, which for a specific gene of interest can predict the mapping to a KEGG pathway, based on its domain signature. The classifier makes explicit use of the hierarchical organization of pathways in the KEGG database. Furthermore, we take into account that a specific gene can be mapped to different pathways at the same time. The classification method produces a scoring of all possible mapping positions of the gene in the KEGG hierarchy. Evaluations of our model, which is a combination of a SVM and ranking perceptron approach, show a high prediction performance. Moreover, for signaling pathways we reveal that it is even possible to forecast accurately the membership to individual pathway components. Availability: The R package gene2pathway is a supplement to this article. Contact: h.froehlich@dkfz-heidelberg.de Supplementary Information: Supplementary data are available at Bioinformatics online. Oxford University Press 2008-10-01 2008-08-01 /pmc/articles/PMC2553439/ /pubmed/18676972 http://dx.doi.org/10.1093/bioinformatics/btn403 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Fröhlich, Holger Fellmann, Mark Sültmann, Holger Poustka, Annemarie Beißbarth, Tim Predicting pathway membership via domain signatures |
title | Predicting pathway membership via domain signatures |
title_full | Predicting pathway membership via domain signatures |
title_fullStr | Predicting pathway membership via domain signatures |
title_full_unstemmed | Predicting pathway membership via domain signatures |
title_short | Predicting pathway membership via domain signatures |
title_sort | predicting pathway membership via domain signatures |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2553439/ https://www.ncbi.nlm.nih.gov/pubmed/18676972 http://dx.doi.org/10.1093/bioinformatics/btn403 |
work_keys_str_mv | AT frohlichholger predictingpathwaymembershipviadomainsignatures AT fellmannmark predictingpathwaymembershipviadomainsignatures AT sultmannholger predictingpathwaymembershipviadomainsignatures AT poustkaannemarie predictingpathwaymembershipviadomainsignatures AT beißbarthtim predictingpathwaymembershipviadomainsignatures |