Cargando…

An efficient algorithm for de novo predictions of biochemical pathways between chemical compounds

BACKGROUND: Prediction of biochemical (metabolic) pathways has a wide range of applications, including the optimization of drug candidates, and the elucidation of toxicity mechanisms. Recently, several methods have been developed for pathway prediction to derive a goal compound from a start compound...

Descripción completa

Detalles Bibliográficos
Autores principales: Nakamura, Masaomi, Hachiya, Tsuyoshi, Saito, Yutaka, Sato, Kengo, Sakakibara, Yasubumi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3521390/
https://www.ncbi.nlm.nih.gov/pubmed/23282285
http://dx.doi.org/10.1186/1471-2105-13-S17-S8
_version_ 1782252946316066816
author Nakamura, Masaomi
Hachiya, Tsuyoshi
Saito, Yutaka
Sato, Kengo
Sakakibara, Yasubumi
author_facet Nakamura, Masaomi
Hachiya, Tsuyoshi
Saito, Yutaka
Sato, Kengo
Sakakibara, Yasubumi
author_sort Nakamura, Masaomi
collection PubMed
description BACKGROUND: Prediction of biochemical (metabolic) pathways has a wide range of applications, including the optimization of drug candidates, and the elucidation of toxicity mechanisms. Recently, several methods have been developed for pathway prediction to derive a goal compound from a start compound. However, these methods require high computational costs, and cannot perform comprehensive prediction of novel metabolic pathways. Our aim of this study is to develop a de novo prediction method for reconstructions of metabolic pathways and predictions of unknown biosynthetic pathways in the sense that it does not require any initial network such as KEGG metabolic network to be explored. RESULTS: We formulated pathway prediction between a start compound and a goal compound as the shortest path search problem in terms of the number of enzyme reactions applied. We propose an efficient search method based on A* algorithm and heuristic techniques utilizing Linear Programming (LP) solution for estimation of the distance to the goal. First, a chemical compound is represented by a feature vector which counts frequencies of substructure occurrences in the structural formula. Second, an enzyme reaction is represented as an operator vector by detecting the structural changes to compounds before and after the reaction. By defining compound vectors as nodes and operator vectors as edges, prediction of the reaction pathway is reduced to the shortest path search problem in the vector space. In experiments on the DDT degradation pathway, we verify that the shortest paths predicted by our method are biologically correct pathways registered in the KEGG database. The results also demonstrate that the LP heuristics can achieve significant reduction in computation time. Furthermore, we apply our method to a secondary metabolite pathway of plant origin, and successfully find a novel biochemical pathway which cannot be predicted by the existing method. For the reconstruction of a known biochemical pathway, our method is over 40 times as fast as the existing method. CONCLUSIONS: Our method enables fast and accurate de novo pathway predictions and novel pathway detection.
format Online
Article
Text
id pubmed-3521390
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35213902012-12-14 An efficient algorithm for de novo predictions of biochemical pathways between chemical compounds Nakamura, Masaomi Hachiya, Tsuyoshi Saito, Yutaka Sato, Kengo Sakakibara, Yasubumi BMC Bioinformatics Proceedings BACKGROUND: Prediction of biochemical (metabolic) pathways has a wide range of applications, including the optimization of drug candidates, and the elucidation of toxicity mechanisms. Recently, several methods have been developed for pathway prediction to derive a goal compound from a start compound. However, these methods require high computational costs, and cannot perform comprehensive prediction of novel metabolic pathways. Our aim of this study is to develop a de novo prediction method for reconstructions of metabolic pathways and predictions of unknown biosynthetic pathways in the sense that it does not require any initial network such as KEGG metabolic network to be explored. RESULTS: We formulated pathway prediction between a start compound and a goal compound as the shortest path search problem in terms of the number of enzyme reactions applied. We propose an efficient search method based on A* algorithm and heuristic techniques utilizing Linear Programming (LP) solution for estimation of the distance to the goal. First, a chemical compound is represented by a feature vector which counts frequencies of substructure occurrences in the structural formula. Second, an enzyme reaction is represented as an operator vector by detecting the structural changes to compounds before and after the reaction. By defining compound vectors as nodes and operator vectors as edges, prediction of the reaction pathway is reduced to the shortest path search problem in the vector space. In experiments on the DDT degradation pathway, we verify that the shortest paths predicted by our method are biologically correct pathways registered in the KEGG database. The results also demonstrate that the LP heuristics can achieve significant reduction in computation time. Furthermore, we apply our method to a secondary metabolite pathway of plant origin, and successfully find a novel biochemical pathway which cannot be predicted by the existing method. For the reconstruction of a known biochemical pathway, our method is over 40 times as fast as the existing method. CONCLUSIONS: Our method enables fast and accurate de novo pathway predictions and novel pathway detection. BioMed Central 2012-12-07 /pmc/articles/PMC3521390/ /pubmed/23282285 http://dx.doi.org/10.1186/1471-2105-13-S17-S8 Text en Copyright ©2012 Nakamura et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Nakamura, Masaomi
Hachiya, Tsuyoshi
Saito, Yutaka
Sato, Kengo
Sakakibara, Yasubumi
An efficient algorithm for de novo predictions of biochemical pathways between chemical compounds
title An efficient algorithm for de novo predictions of biochemical pathways between chemical compounds
title_full An efficient algorithm for de novo predictions of biochemical pathways between chemical compounds
title_fullStr An efficient algorithm for de novo predictions of biochemical pathways between chemical compounds
title_full_unstemmed An efficient algorithm for de novo predictions of biochemical pathways between chemical compounds
title_short An efficient algorithm for de novo predictions of biochemical pathways between chemical compounds
title_sort efficient algorithm for de novo predictions of biochemical pathways between chemical compounds
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3521390/
https://www.ncbi.nlm.nih.gov/pubmed/23282285
http://dx.doi.org/10.1186/1471-2105-13-S17-S8
work_keys_str_mv AT nakamuramasaomi anefficientalgorithmfordenovopredictionsofbiochemicalpathwaysbetweenchemicalcompounds
AT hachiyatsuyoshi anefficientalgorithmfordenovopredictionsofbiochemicalpathwaysbetweenchemicalcompounds
AT saitoyutaka anefficientalgorithmfordenovopredictionsofbiochemicalpathwaysbetweenchemicalcompounds
AT satokengo anefficientalgorithmfordenovopredictionsofbiochemicalpathwaysbetweenchemicalcompounds
AT sakakibarayasubumi anefficientalgorithmfordenovopredictionsofbiochemicalpathwaysbetweenchemicalcompounds
AT nakamuramasaomi efficientalgorithmfordenovopredictionsofbiochemicalpathwaysbetweenchemicalcompounds
AT hachiyatsuyoshi efficientalgorithmfordenovopredictionsofbiochemicalpathwaysbetweenchemicalcompounds
AT saitoyutaka efficientalgorithmfordenovopredictionsofbiochemicalpathwaysbetweenchemicalcompounds
AT satokengo efficientalgorithmfordenovopredictionsofbiochemicalpathwaysbetweenchemicalcompounds
AT sakakibarayasubumi efficientalgorithmfordenovopredictionsofbiochemicalpathwaysbetweenchemicalcompounds