Cargando…

Identifying essential genes in bacterial metabolic networks with machine learning methods

BACKGROUND: Identifying essential genes in bacteria supports to identify potential drug targets and an understanding of minimal requirements for a synthetic cell. However, experimentally assaying the essentiality of their coding genes is resource intensive and not feasible for all bacterial organism...

Descripción completa

Detalles Bibliográficos
Autores principales: Plaimas, Kitiporn, Eils, Roland, König, Rainer
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2874528/
https://www.ncbi.nlm.nih.gov/pubmed/20438628
http://dx.doi.org/10.1186/1752-0509-4-56
_version_ 1782181488645636096
author Plaimas, Kitiporn
Eils, Roland
König, Rainer
author_facet Plaimas, Kitiporn
Eils, Roland
König, Rainer
author_sort Plaimas, Kitiporn
collection PubMed
description BACKGROUND: Identifying essential genes in bacteria supports to identify potential drug targets and an understanding of minimal requirements for a synthetic cell. However, experimentally assaying the essentiality of their coding genes is resource intensive and not feasible for all bacterial organisms, in particular if they are infective. RESULTS: We developed a machine learning technique to identify essential genes using the experimental data of genome-wide knock-out screens from one bacterial organism to infer essential genes of another related bacterial organism. We used a broad variety of topological features, sequence characteristics and co-expression properties potentially associated with essentiality, such as flux deviations, centrality, codon frequencies of the sequences, co-regulation and phyletic retention. An organism-wise cross-validation on bacterial species yielded reliable results with good accuracies (area under the receiver-operator-curve of 75% - 81%). Finally, it was applied to drug target predictions for Salmonella typhimurium. We compared our predictions to the viability of experimental knock-outs of S. typhimurium and identified 35 enzymes, which are highly relevant to be considered as potential drug targets. Specifically, we detected promising drug targets in the non-mevalonate pathway. CONCLUSIONS: Using elaborated features characterizing network topology, sequence information and microarray data enables to predict essential genes from a bacterial reference organism to a related query organism without any knowledge about the essentiality of genes of the query organism. In general, such a method is beneficial for inferring drug targets when experimental data about genome-wide knockout screens is not available for the investigated organism.
format Text
id pubmed-2874528
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28745282010-05-22 Identifying essential genes in bacterial metabolic networks with machine learning methods Plaimas, Kitiporn Eils, Roland König, Rainer BMC Syst Biol Methodology article BACKGROUND: Identifying essential genes in bacteria supports to identify potential drug targets and an understanding of minimal requirements for a synthetic cell. However, experimentally assaying the essentiality of their coding genes is resource intensive and not feasible for all bacterial organisms, in particular if they are infective. RESULTS: We developed a machine learning technique to identify essential genes using the experimental data of genome-wide knock-out screens from one bacterial organism to infer essential genes of another related bacterial organism. We used a broad variety of topological features, sequence characteristics and co-expression properties potentially associated with essentiality, such as flux deviations, centrality, codon frequencies of the sequences, co-regulation and phyletic retention. An organism-wise cross-validation on bacterial species yielded reliable results with good accuracies (area under the receiver-operator-curve of 75% - 81%). Finally, it was applied to drug target predictions for Salmonella typhimurium. We compared our predictions to the viability of experimental knock-outs of S. typhimurium and identified 35 enzymes, which are highly relevant to be considered as potential drug targets. Specifically, we detected promising drug targets in the non-mevalonate pathway. CONCLUSIONS: Using elaborated features characterizing network topology, sequence information and microarray data enables to predict essential genes from a bacterial reference organism to a related query organism without any knowledge about the essentiality of genes of the query organism. In general, such a method is beneficial for inferring drug targets when experimental data about genome-wide knockout screens is not available for the investigated organism. BioMed Central 2010-05-03 /pmc/articles/PMC2874528/ /pubmed/20438628 http://dx.doi.org/10.1186/1752-0509-4-56 Text en Copyright ©2010 Plaimas et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology article
Plaimas, Kitiporn
Eils, Roland
König, Rainer
Identifying essential genes in bacterial metabolic networks with machine learning methods
title Identifying essential genes in bacterial metabolic networks with machine learning methods
title_full Identifying essential genes in bacterial metabolic networks with machine learning methods
title_fullStr Identifying essential genes in bacterial metabolic networks with machine learning methods
title_full_unstemmed Identifying essential genes in bacterial metabolic networks with machine learning methods
title_short Identifying essential genes in bacterial metabolic networks with machine learning methods
title_sort identifying essential genes in bacterial metabolic networks with machine learning methods
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2874528/
https://www.ncbi.nlm.nih.gov/pubmed/20438628
http://dx.doi.org/10.1186/1752-0509-4-56
work_keys_str_mv AT plaimaskitiporn identifyingessentialgenesinbacterialmetabolicnetworkswithmachinelearningmethods
AT eilsroland identifyingessentialgenesinbacterialmetabolicnetworkswithmachinelearningmethods
AT konigrainer identifyingessentialgenesinbacterialmetabolicnetworkswithmachinelearningmethods