Cargando…

Metabolic network prediction through pairwise rational kernels

BACKGROUND: Metabolic networks are represented by the set of metabolic pathways. Metabolic pathways are a series of biochemical reactions, in which the product (output) from one reaction serves as the substrate (input) to another reaction. Many pathways remain incompletely characterized. One of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Roche-Lima, Abiel, Domaratzki, Michael, Fristensky, Brian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4261252/
https://www.ncbi.nlm.nih.gov/pubmed/25260372
http://dx.doi.org/10.1186/1471-2105-15-318
_version_ 1782348279553458176
author Roche-Lima, Abiel
Domaratzki, Michael
Fristensky, Brian
author_facet Roche-Lima, Abiel
Domaratzki, Michael
Fristensky, Brian
author_sort Roche-Lima, Abiel
collection PubMed
description BACKGROUND: Metabolic networks are represented by the set of metabolic pathways. Metabolic pathways are a series of biochemical reactions, in which the product (output) from one reaction serves as the substrate (input) to another reaction. Many pathways remain incompletely characterized. One of the major challenges of computational biology is to obtain better models of metabolic pathways. Existing models are dependent on the annotation of the genes. This propagates error accumulation when the pathways are predicted by incorrectly annotated genes. Pairwise classification methods are supervised learning methods used to classify new pair of entities. Some of these classification methods, e.g., Pairwise Support Vector Machines (SVMs), use pairwise kernels. Pairwise kernels describe similarity measures between two pairs of entities. Using pairwise kernels to handle sequence data requires long processing times and large storage. Rational kernels are kernels based on weighted finite-state transducers that represent similarity measures between sequences or automata. They have been effectively used in problems that handle large amount of sequence information such as protein essentiality, natural language processing and machine translations. RESULTS: We create a new family of pairwise kernels using weighted finite-state transducers (called Pairwise Rational Kernel (PRK)) to predict metabolic pathways from a variety of biological data. PRKs take advantage of the simpler representations and faster algorithms of transducers. Because raw sequence data can be used, the predictor model avoids the errors introduced by incorrect gene annotations. We then developed several experiments with PRKs and Pairwise SVM to validate our methods using the metabolic network of Saccharomyces cerevisiae. As a result, when PRKs are used, our method executes faster in comparison with other pairwise kernels. Also, when we use PRKs combined with other simple kernels that include evolutionary information, the accuracy values have been improved, while maintaining lower construction and execution times. CONCLUSIONS: The power of using kernels is that almost any sort of data can be represented using kernels. Therefore, completely disparate types of data can be combined to add power to kernel-based machine learning methods. When we compared our proposal using PRKs with other similar kernel, the execution times were decreased, with no compromise of accuracy. We also proved that by combining PRKs with other kernels that include evolutionary information, the accuracy can also also be improved. As our proposal can use any type of sequence data, genes do not need to be properly annotated, avoiding accumulation errors because of incorrect previous annotations.
format Online
Article
Text
id pubmed-4261252
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42612522014-12-10 Metabolic network prediction through pairwise rational kernels Roche-Lima, Abiel Domaratzki, Michael Fristensky, Brian BMC Bioinformatics Research Article BACKGROUND: Metabolic networks are represented by the set of metabolic pathways. Metabolic pathways are a series of biochemical reactions, in which the product (output) from one reaction serves as the substrate (input) to another reaction. Many pathways remain incompletely characterized. One of the major challenges of computational biology is to obtain better models of metabolic pathways. Existing models are dependent on the annotation of the genes. This propagates error accumulation when the pathways are predicted by incorrectly annotated genes. Pairwise classification methods are supervised learning methods used to classify new pair of entities. Some of these classification methods, e.g., Pairwise Support Vector Machines (SVMs), use pairwise kernels. Pairwise kernels describe similarity measures between two pairs of entities. Using pairwise kernels to handle sequence data requires long processing times and large storage. Rational kernels are kernels based on weighted finite-state transducers that represent similarity measures between sequences or automata. They have been effectively used in problems that handle large amount of sequence information such as protein essentiality, natural language processing and machine translations. RESULTS: We create a new family of pairwise kernels using weighted finite-state transducers (called Pairwise Rational Kernel (PRK)) to predict metabolic pathways from a variety of biological data. PRKs take advantage of the simpler representations and faster algorithms of transducers. Because raw sequence data can be used, the predictor model avoids the errors introduced by incorrect gene annotations. We then developed several experiments with PRKs and Pairwise SVM to validate our methods using the metabolic network of Saccharomyces cerevisiae. As a result, when PRKs are used, our method executes faster in comparison with other pairwise kernels. Also, when we use PRKs combined with other simple kernels that include evolutionary information, the accuracy values have been improved, while maintaining lower construction and execution times. CONCLUSIONS: The power of using kernels is that almost any sort of data can be represented using kernels. Therefore, completely disparate types of data can be combined to add power to kernel-based machine learning methods. When we compared our proposal using PRKs with other similar kernel, the execution times were decreased, with no compromise of accuracy. We also proved that by combining PRKs with other kernels that include evolutionary information, the accuracy can also also be improved. As our proposal can use any type of sequence data, genes do not need to be properly annotated, avoiding accumulation errors because of incorrect previous annotations. BioMed Central 2014-09-26 /pmc/articles/PMC4261252/ /pubmed/25260372 http://dx.doi.org/10.1186/1471-2105-15-318 Text en © Roche-Lima et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Roche-Lima, Abiel
Domaratzki, Michael
Fristensky, Brian
Metabolic network prediction through pairwise rational kernels
title Metabolic network prediction through pairwise rational kernels
title_full Metabolic network prediction through pairwise rational kernels
title_fullStr Metabolic network prediction through pairwise rational kernels
title_full_unstemmed Metabolic network prediction through pairwise rational kernels
title_short Metabolic network prediction through pairwise rational kernels
title_sort metabolic network prediction through pairwise rational kernels
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4261252/
https://www.ncbi.nlm.nih.gov/pubmed/25260372
http://dx.doi.org/10.1186/1471-2105-15-318
work_keys_str_mv AT rochelimaabiel metabolicnetworkpredictionthroughpairwiserationalkernels
AT domaratzkimichael metabolicnetworkpredictionthroughpairwiserationalkernels
AT fristenskybrian metabolicnetworkpredictionthroughpairwiserationalkernels