Cargando…

Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae

BACKGROUND: Nitrogen is an essential nutrient for all life forms. Like most unicellular organisms, the yeast Saccharomyces cerevisiae transports and catabolizes good nitrogen sources in preference to poor ones. Nitrogen catabolite repression (NCR) refers to this selection mechanism. All known nitrog...

Descripción completa

Detalles Bibliográficos
Autores principales: Kontos, Kevin, Godard, Patrice, André, Bruno, van Helden, Jacques, Bontempi, Gianluca
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2654973/
https://www.ncbi.nlm.nih.gov/pubmed/19091052
_version_ 1782165427219070976
author Kontos, Kevin
Godard, Patrice
André, Bruno
van Helden, Jacques
Bontempi, Gianluca
author_facet Kontos, Kevin
Godard, Patrice
André, Bruno
van Helden, Jacques
Bontempi, Gianluca
author_sort Kontos, Kevin
collection PubMed
description BACKGROUND: Nitrogen is an essential nutrient for all life forms. Like most unicellular organisms, the yeast Saccharomyces cerevisiae transports and catabolizes good nitrogen sources in preference to poor ones. Nitrogen catabolite repression (NCR) refers to this selection mechanism. All known nitrogen catabolite pathways are regulated by four regulators. The ultimate goal is to infer the complete nitrogen catabolite pathways. Bioinformatics approaches offer the possibility to identify putative NCR genes and to discard uninteresting genes. RESULTS: We present a machine learning approach where the identification of putative NCR genes in the yeast Saccharomyces cerevisiae is formulated as a supervised two-class classification problem. Classifiers predict whether genes are NCR-sensitive or not from a large number of variables related to the GATA motif in the upstream non-coding sequences of the genes. The positive and negative training sets are composed of annotated NCR genes and manually-selected genes known to be insensitive to NCR, respectively. Different classifiers and variable selection methods are compared. We show that all classifiers make significant and biologically valid predictions by comparing these predictions to annotated and putative NCR genes, and by performing several negative controls. In particular, the inferred NCR genes significantly overlap with putative NCR genes identified in three genome-wide experimental and bioinformatics studies. CONCLUSION: These results suggest that our approach can successfully identify potential NCR genes. Hence, the dimensionality of the problem of identifying all genes involved in NCR is drastically reduced.
format Text
id pubmed-2654973
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26549732009-03-13 Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae Kontos, Kevin Godard, Patrice André, Bruno van Helden, Jacques Bontempi, Gianluca BMC Proc Proceedings BACKGROUND: Nitrogen is an essential nutrient for all life forms. Like most unicellular organisms, the yeast Saccharomyces cerevisiae transports and catabolizes good nitrogen sources in preference to poor ones. Nitrogen catabolite repression (NCR) refers to this selection mechanism. All known nitrogen catabolite pathways are regulated by four regulators. The ultimate goal is to infer the complete nitrogen catabolite pathways. Bioinformatics approaches offer the possibility to identify putative NCR genes and to discard uninteresting genes. RESULTS: We present a machine learning approach where the identification of putative NCR genes in the yeast Saccharomyces cerevisiae is formulated as a supervised two-class classification problem. Classifiers predict whether genes are NCR-sensitive or not from a large number of variables related to the GATA motif in the upstream non-coding sequences of the genes. The positive and negative training sets are composed of annotated NCR genes and manually-selected genes known to be insensitive to NCR, respectively. Different classifiers and variable selection methods are compared. We show that all classifiers make significant and biologically valid predictions by comparing these predictions to annotated and putative NCR genes, and by performing several negative controls. In particular, the inferred NCR genes significantly overlap with putative NCR genes identified in three genome-wide experimental and bioinformatics studies. CONCLUSION: These results suggest that our approach can successfully identify potential NCR genes. Hence, the dimensionality of the problem of identifying all genes involved in NCR is drastically reduced. BioMed Central 2008-12-17 /pmc/articles/PMC2654973/ /pubmed/19091052 Text en Copyright © 2008 Kontos et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Kontos, Kevin
Godard, Patrice
André, Bruno
van Helden, Jacques
Bontempi, Gianluca
Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae
title Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae
title_full Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae
title_fullStr Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae
title_full_unstemmed Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae
title_short Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae
title_sort machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast saccharomyces cerevisiae
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2654973/
https://www.ncbi.nlm.nih.gov/pubmed/19091052
work_keys_str_mv AT kontoskevin machinelearningtechniquestoidentifyputativegenesinvolvedinnitrogencataboliterepressionintheyeastsaccharomycescerevisiae
AT godardpatrice machinelearningtechniquestoidentifyputativegenesinvolvedinnitrogencataboliterepressionintheyeastsaccharomycescerevisiae
AT andrebruno machinelearningtechniquestoidentifyputativegenesinvolvedinnitrogencataboliterepressionintheyeastsaccharomycescerevisiae
AT vanheldenjacques machinelearningtechniquestoidentifyputativegenesinvolvedinnitrogencataboliterepressionintheyeastsaccharomycescerevisiae
AT bontempigianluca machinelearningtechniquestoidentifyputativegenesinvolvedinnitrogencataboliterepressionintheyeastsaccharomycescerevisiae