Cargando…
Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae
BACKGROUND: Nitrogen is an essential nutrient for all life forms. Like most unicellular organisms, the yeast Saccharomyces cerevisiae transports and catabolizes good nitrogen sources in preference to poor ones. Nitrogen catabolite repression (NCR) refers to this selection mechanism. All known nitrog...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2654973/ https://www.ncbi.nlm.nih.gov/pubmed/19091052 |
_version_ | 1782165427219070976 |
---|---|
author | Kontos, Kevin Godard, Patrice André, Bruno van Helden, Jacques Bontempi, Gianluca |
author_facet | Kontos, Kevin Godard, Patrice André, Bruno van Helden, Jacques Bontempi, Gianluca |
author_sort | Kontos, Kevin |
collection | PubMed |
description | BACKGROUND: Nitrogen is an essential nutrient for all life forms. Like most unicellular organisms, the yeast Saccharomyces cerevisiae transports and catabolizes good nitrogen sources in preference to poor ones. Nitrogen catabolite repression (NCR) refers to this selection mechanism. All known nitrogen catabolite pathways are regulated by four regulators. The ultimate goal is to infer the complete nitrogen catabolite pathways. Bioinformatics approaches offer the possibility to identify putative NCR genes and to discard uninteresting genes. RESULTS: We present a machine learning approach where the identification of putative NCR genes in the yeast Saccharomyces cerevisiae is formulated as a supervised two-class classification problem. Classifiers predict whether genes are NCR-sensitive or not from a large number of variables related to the GATA motif in the upstream non-coding sequences of the genes. The positive and negative training sets are composed of annotated NCR genes and manually-selected genes known to be insensitive to NCR, respectively. Different classifiers and variable selection methods are compared. We show that all classifiers make significant and biologically valid predictions by comparing these predictions to annotated and putative NCR genes, and by performing several negative controls. In particular, the inferred NCR genes significantly overlap with putative NCR genes identified in three genome-wide experimental and bioinformatics studies. CONCLUSION: These results suggest that our approach can successfully identify potential NCR genes. Hence, the dimensionality of the problem of identifying all genes involved in NCR is drastically reduced. |
format | Text |
id | pubmed-2654973 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26549732009-03-13 Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae Kontos, Kevin Godard, Patrice André, Bruno van Helden, Jacques Bontempi, Gianluca BMC Proc Proceedings BACKGROUND: Nitrogen is an essential nutrient for all life forms. Like most unicellular organisms, the yeast Saccharomyces cerevisiae transports and catabolizes good nitrogen sources in preference to poor ones. Nitrogen catabolite repression (NCR) refers to this selection mechanism. All known nitrogen catabolite pathways are regulated by four regulators. The ultimate goal is to infer the complete nitrogen catabolite pathways. Bioinformatics approaches offer the possibility to identify putative NCR genes and to discard uninteresting genes. RESULTS: We present a machine learning approach where the identification of putative NCR genes in the yeast Saccharomyces cerevisiae is formulated as a supervised two-class classification problem. Classifiers predict whether genes are NCR-sensitive or not from a large number of variables related to the GATA motif in the upstream non-coding sequences of the genes. The positive and negative training sets are composed of annotated NCR genes and manually-selected genes known to be insensitive to NCR, respectively. Different classifiers and variable selection methods are compared. We show that all classifiers make significant and biologically valid predictions by comparing these predictions to annotated and putative NCR genes, and by performing several negative controls. In particular, the inferred NCR genes significantly overlap with putative NCR genes identified in three genome-wide experimental and bioinformatics studies. CONCLUSION: These results suggest that our approach can successfully identify potential NCR genes. Hence, the dimensionality of the problem of identifying all genes involved in NCR is drastically reduced. BioMed Central 2008-12-17 /pmc/articles/PMC2654973/ /pubmed/19091052 Text en Copyright © 2008 Kontos et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Kontos, Kevin Godard, Patrice André, Bruno van Helden, Jacques Bontempi, Gianluca Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae |
title | Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae |
title_full | Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae |
title_fullStr | Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae |
title_full_unstemmed | Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae |
title_short | Machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast Saccharomyces cerevisiae |
title_sort | machine learning techniques to identify putative genes involved in nitrogen catabolite repression in the yeast saccharomyces cerevisiae |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2654973/ https://www.ncbi.nlm.nih.gov/pubmed/19091052 |
work_keys_str_mv | AT kontoskevin machinelearningtechniquestoidentifyputativegenesinvolvedinnitrogencataboliterepressionintheyeastsaccharomycescerevisiae AT godardpatrice machinelearningtechniquestoidentifyputativegenesinvolvedinnitrogencataboliterepressionintheyeastsaccharomycescerevisiae AT andrebruno machinelearningtechniquestoidentifyputativegenesinvolvedinnitrogencataboliterepressionintheyeastsaccharomycescerevisiae AT vanheldenjacques machinelearningtechniquestoidentifyputativegenesinvolvedinnitrogencataboliterepressionintheyeastsaccharomycescerevisiae AT bontempigianluca machinelearningtechniquestoidentifyputativegenesinvolvedinnitrogencataboliterepressionintheyeastsaccharomycescerevisiae |