Cargando…

Automation of gene assignments to metabolic pathways using high-throughput expression data

BACKGROUND: Accurate assignment of genes to pathways is essential in order to understand the functional role of genes and to map the existing pathways in a given genome. Existing algorithms predict pathways by extrapolating experimental data in one organism to other organisms for which this data is...

Descripción completa

Detalles Bibliográficos
Autores principales: Popescu, Liviu, Yona, Golan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1239907/
https://www.ncbi.nlm.nih.gov/pubmed/16135255
http://dx.doi.org/10.1186/1471-2105-6-217
_version_ 1782125030496272384
author Popescu, Liviu
Yona, Golan
author_facet Popescu, Liviu
Yona, Golan
author_sort Popescu, Liviu
collection PubMed
description BACKGROUND: Accurate assignment of genes to pathways is essential in order to understand the functional role of genes and to map the existing pathways in a given genome. Existing algorithms predict pathways by extrapolating experimental data in one organism to other organisms for which this data is not available. However, current systems classify all genes that belong to a specific EC family to all the pathways that contain the corresponding enzymatic reaction, and thus introduce ambiguity. RESULTS: Here we describe an algorithm for assignment of genes to cellular pathways that addresses this problem by selectively assigning specific genes to pathways. Our algorithm uses the set of experimentally elucidated metabolic pathways from MetaCyc, together with statistical models of enzyme families and expression data to assign genes to enzyme families and pathways by optimizing correlated co-expression, while minimizing conflicts due to shared assignments among pathways. Our algorithm also identifies alternative ("backup") genes and addresses the multi-domain nature of proteins. We apply our model to assign genes to pathways in the Yeast genome and compare the results for genes that were assigned experimentally. Our assignments are consistent with the experimentally verified assignments and reflect characteristic properties of cellular pathways. CONCLUSION: We present an algorithm for automatic assignment of genes to metabolic pathways. The algorithm utilizes expression data and reduces the ambiguity that characterizes assignments that are based only on EC numbers.
format Text
id pubmed-1239907
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-12399072005-10-03 Automation of gene assignments to metabolic pathways using high-throughput expression data Popescu, Liviu Yona, Golan BMC Bioinformatics Methodology Article BACKGROUND: Accurate assignment of genes to pathways is essential in order to understand the functional role of genes and to map the existing pathways in a given genome. Existing algorithms predict pathways by extrapolating experimental data in one organism to other organisms for which this data is not available. However, current systems classify all genes that belong to a specific EC family to all the pathways that contain the corresponding enzymatic reaction, and thus introduce ambiguity. RESULTS: Here we describe an algorithm for assignment of genes to cellular pathways that addresses this problem by selectively assigning specific genes to pathways. Our algorithm uses the set of experimentally elucidated metabolic pathways from MetaCyc, together with statistical models of enzyme families and expression data to assign genes to enzyme families and pathways by optimizing correlated co-expression, while minimizing conflicts due to shared assignments among pathways. Our algorithm also identifies alternative ("backup") genes and addresses the multi-domain nature of proteins. We apply our model to assign genes to pathways in the Yeast genome and compare the results for genes that were assigned experimentally. Our assignments are consistent with the experimentally verified assignments and reflect characteristic properties of cellular pathways. CONCLUSION: We present an algorithm for automatic assignment of genes to metabolic pathways. The algorithm utilizes expression data and reduces the ambiguity that characterizes assignments that are based only on EC numbers. BioMed Central 2005-08-31 /pmc/articles/PMC1239907/ /pubmed/16135255 http://dx.doi.org/10.1186/1471-2105-6-217 Text en Copyright © 2005 Popescu and Yona; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Popescu, Liviu
Yona, Golan
Automation of gene assignments to metabolic pathways using high-throughput expression data
title Automation of gene assignments to metabolic pathways using high-throughput expression data
title_full Automation of gene assignments to metabolic pathways using high-throughput expression data
title_fullStr Automation of gene assignments to metabolic pathways using high-throughput expression data
title_full_unstemmed Automation of gene assignments to metabolic pathways using high-throughput expression data
title_short Automation of gene assignments to metabolic pathways using high-throughput expression data
title_sort automation of gene assignments to metabolic pathways using high-throughput expression data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1239907/
https://www.ncbi.nlm.nih.gov/pubmed/16135255
http://dx.doi.org/10.1186/1471-2105-6-217
work_keys_str_mv AT popesculiviu automationofgeneassignmentstometabolicpathwaysusinghighthroughputexpressiondata
AT yonagolan automationofgeneassignmentstometabolicpathwaysusinghighthroughputexpressiondata