Cargando…

Gene Function Hypotheses for the Campylobacter jejuni Glycome Generated by a Logic-Based Approach

Increasingly, experimental data on biological systems are obtained from several sources and computational approaches are required to integrate this information and derive models for the function of the system. Here, we demonstrate the power of a logic-based machine learning approach to propose hypot...

Descripción completa

Detalles Bibliográficos
Autores principales: Sternberg, Michael J.E., Tamaddoni-Nezhad, Alireza, Lesk, Victor I., Kay, Emily, Hitchen, Paul G., Cootes, Adrian, van Alphen, Lieke B., Lamoureux, Marc P., Jarrell, Harold C., Rawlings, Christopher J., Soo, Evelyn C., Szymanski, Christine M., Dell, Anne, Wren, Brendan W., Muggleton, Stephen H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3546167/
https://www.ncbi.nlm.nih.gov/pubmed/23103756
http://dx.doi.org/10.1016/j.jmb.2012.10.014
_version_ 1782256007526744064
author Sternberg, Michael J.E.
Tamaddoni-Nezhad, Alireza
Lesk, Victor I.
Kay, Emily
Hitchen, Paul G.
Cootes, Adrian
van Alphen, Lieke B.
Lamoureux, Marc P.
Jarrell, Harold C.
Rawlings, Christopher J.
Soo, Evelyn C.
Szymanski, Christine M.
Dell, Anne
Wren, Brendan W.
Muggleton, Stephen H.
author_facet Sternberg, Michael J.E.
Tamaddoni-Nezhad, Alireza
Lesk, Victor I.
Kay, Emily
Hitchen, Paul G.
Cootes, Adrian
van Alphen, Lieke B.
Lamoureux, Marc P.
Jarrell, Harold C.
Rawlings, Christopher J.
Soo, Evelyn C.
Szymanski, Christine M.
Dell, Anne
Wren, Brendan W.
Muggleton, Stephen H.
author_sort Sternberg, Michael J.E.
collection PubMed
description Increasingly, experimental data on biological systems are obtained from several sources and computational approaches are required to integrate this information and derive models for the function of the system. Here, we demonstrate the power of a logic-based machine learning approach to propose hypotheses for gene function integrating information from two diverse experimental approaches. Specifically, we use inductive logic programming that automatically proposes hypotheses explaining the empirical data with respect to logically encoded background knowledge. We study the capsular polysaccharide biosynthetic pathway of the major human gastrointestinal pathogen Campylobacter jejuni. We consider several key steps in the formation of capsular polysaccharide consisting of 15 genes of which 8 have assigned function, and we explore the extent to which functions can be hypothesised for the remaining 7. Two sources of experimental data provide the information for learning—the results of knockout experiments on the genes involved in capsule formation and the absence/presence of capsule genes in a multitude of strains of different serotypes. The machine learning uses the pathway structure as background knowledge. We propose assignments of specific genes to five previously unassigned reaction steps. For four of these steps, there was an unambiguous optimal assignment of gene to reaction, and to the fifth, there were three candidate genes. Several of these assignments were consistent with additional experimental results. We therefore show that the logic-based methodology provides a robust strategy to integrate results from different experimental approaches and propose hypotheses for the behaviour of a biological system.
format Online
Article
Text
id pubmed-3546167
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-35461672013-01-16 Gene Function Hypotheses for the Campylobacter jejuni Glycome Generated by a Logic-Based Approach Sternberg, Michael J.E. Tamaddoni-Nezhad, Alireza Lesk, Victor I. Kay, Emily Hitchen, Paul G. Cootes, Adrian van Alphen, Lieke B. Lamoureux, Marc P. Jarrell, Harold C. Rawlings, Christopher J. Soo, Evelyn C. Szymanski, Christine M. Dell, Anne Wren, Brendan W. Muggleton, Stephen H. J Mol Biol Article Increasingly, experimental data on biological systems are obtained from several sources and computational approaches are required to integrate this information and derive models for the function of the system. Here, we demonstrate the power of a logic-based machine learning approach to propose hypotheses for gene function integrating information from two diverse experimental approaches. Specifically, we use inductive logic programming that automatically proposes hypotheses explaining the empirical data with respect to logically encoded background knowledge. We study the capsular polysaccharide biosynthetic pathway of the major human gastrointestinal pathogen Campylobacter jejuni. We consider several key steps in the formation of capsular polysaccharide consisting of 15 genes of which 8 have assigned function, and we explore the extent to which functions can be hypothesised for the remaining 7. Two sources of experimental data provide the information for learning—the results of knockout experiments on the genes involved in capsule formation and the absence/presence of capsule genes in a multitude of strains of different serotypes. The machine learning uses the pathway structure as background knowledge. We propose assignments of specific genes to five previously unassigned reaction steps. For four of these steps, there was an unambiguous optimal assignment of gene to reaction, and to the fifth, there were three candidate genes. Several of these assignments were consistent with additional experimental results. We therefore show that the logic-based methodology provides a robust strategy to integrate results from different experimental approaches and propose hypotheses for the behaviour of a biological system. Elsevier 2013-01-09 /pmc/articles/PMC3546167/ /pubmed/23103756 http://dx.doi.org/10.1016/j.jmb.2012.10.014 Text en © 2013 Elsevier Ltd. https://creativecommons.org/licenses/by/3.0/ Open Access under CC BY 3.0 (https://creativecommons.org/licenses/by/3.0/) license
spellingShingle Article
Sternberg, Michael J.E.
Tamaddoni-Nezhad, Alireza
Lesk, Victor I.
Kay, Emily
Hitchen, Paul G.
Cootes, Adrian
van Alphen, Lieke B.
Lamoureux, Marc P.
Jarrell, Harold C.
Rawlings, Christopher J.
Soo, Evelyn C.
Szymanski, Christine M.
Dell, Anne
Wren, Brendan W.
Muggleton, Stephen H.
Gene Function Hypotheses for the Campylobacter jejuni Glycome Generated by a Logic-Based Approach
title Gene Function Hypotheses for the Campylobacter jejuni Glycome Generated by a Logic-Based Approach
title_full Gene Function Hypotheses for the Campylobacter jejuni Glycome Generated by a Logic-Based Approach
title_fullStr Gene Function Hypotheses for the Campylobacter jejuni Glycome Generated by a Logic-Based Approach
title_full_unstemmed Gene Function Hypotheses for the Campylobacter jejuni Glycome Generated by a Logic-Based Approach
title_short Gene Function Hypotheses for the Campylobacter jejuni Glycome Generated by a Logic-Based Approach
title_sort gene function hypotheses for the campylobacter jejuni glycome generated by a logic-based approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3546167/
https://www.ncbi.nlm.nih.gov/pubmed/23103756
http://dx.doi.org/10.1016/j.jmb.2012.10.014
work_keys_str_mv AT sternbergmichaelje genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach
AT tamaddoninezhadalireza genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach
AT leskvictori genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach
AT kayemily genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach
AT hitchenpaulg genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach
AT cootesadrian genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach
AT vanalphenliekeb genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach
AT lamoureuxmarcp genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach
AT jarrellharoldc genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach
AT rawlingschristopherj genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach
AT sooevelync genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach
AT szymanskichristinem genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach
AT dellanne genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach
AT wrenbrendanw genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach
AT muggletonstephenh genefunctionhypothesesforthecampylobacterjejuniglycomegeneratedbyalogicbasedapproach