Cargando…

Broad functional profiling of fission yeast proteins using phenomics and machine learning

Many proteins remain poorly characterized even in well-studied organisms, presenting a bottleneck for research. We applied phenomics and machine-learning approaches with Schizosaccharomyces pombe for broad cues on protein functions. We assayed colony-growth phenotypes to measure the fitness of delet...

Descripción completa

Detalles Bibliográficos
Autores principales: Rodríguez-López, María, Bordin, Nicola, Lees, Jon, Scholes, Harry, Hassan, Shaimaa, Saintain, Quentin, Kamrad, Stephan, Orengo, Christine, Bähler, Jürg
Formato: Online Artículo Texto
Lenguaje:English
Publicado: eLife Sciences Publications, Ltd 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10547477/
https://www.ncbi.nlm.nih.gov/pubmed/37787768
http://dx.doi.org/10.7554/eLife.88229
_version_ 1785115066814693376
author Rodríguez-López, María
Bordin, Nicola
Lees, Jon
Scholes, Harry
Hassan, Shaimaa
Saintain, Quentin
Kamrad, Stephan
Orengo, Christine
Bähler, Jürg
author_facet Rodríguez-López, María
Bordin, Nicola
Lees, Jon
Scholes, Harry
Hassan, Shaimaa
Saintain, Quentin
Kamrad, Stephan
Orengo, Christine
Bähler, Jürg
author_sort Rodríguez-López, María
collection PubMed
description Many proteins remain poorly characterized even in well-studied organisms, presenting a bottleneck for research. We applied phenomics and machine-learning approaches with Schizosaccharomyces pombe for broad cues on protein functions. We assayed colony-growth phenotypes to measure the fitness of deletion mutants for 3509 non-essential genes in 131 conditions with different nutrients, drugs, and stresses. These analyses exposed phenotypes for 3492 mutants, including 124 mutants of ‘priority unstudied’ proteins conserved in humans, providing varied functional clues. For example, over 900 proteins were newly implicated in the resistance to oxidative stress. Phenotype-correlation networks suggested roles for poorly characterized proteins through ‘guilt by association’ with known proteins. For complementary functional insights, we predicted Gene Ontology (GO) terms using machine learning methods exploiting protein-network and protein-homology data (NET-FF). We obtained 56,594 high-scoring GO predictions, of which 22,060 also featured high information content. Our phenotype-correlation data and NET-FF predictions showed a strong concordance with existing PomBase GO annotations and protein networks, with integrated analyses revealing 1675 novel GO predictions for 783 genes, including 47 predictions for 23 priority unstudied proteins. Experimental validation identified new proteins involved in cellular aging, showing that these predictions and phenomics data provide a rich resource to uncover new protein functions.
format Online
Article
Text
id pubmed-10547477
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher eLife Sciences Publications, Ltd
record_format MEDLINE/PubMed
spelling pubmed-105474772023-10-04 Broad functional profiling of fission yeast proteins using phenomics and machine learning Rodríguez-López, María Bordin, Nicola Lees, Jon Scholes, Harry Hassan, Shaimaa Saintain, Quentin Kamrad, Stephan Orengo, Christine Bähler, Jürg eLife Computational and Systems Biology Many proteins remain poorly characterized even in well-studied organisms, presenting a bottleneck for research. We applied phenomics and machine-learning approaches with Schizosaccharomyces pombe for broad cues on protein functions. We assayed colony-growth phenotypes to measure the fitness of deletion mutants for 3509 non-essential genes in 131 conditions with different nutrients, drugs, and stresses. These analyses exposed phenotypes for 3492 mutants, including 124 mutants of ‘priority unstudied’ proteins conserved in humans, providing varied functional clues. For example, over 900 proteins were newly implicated in the resistance to oxidative stress. Phenotype-correlation networks suggested roles for poorly characterized proteins through ‘guilt by association’ with known proteins. For complementary functional insights, we predicted Gene Ontology (GO) terms using machine learning methods exploiting protein-network and protein-homology data (NET-FF). We obtained 56,594 high-scoring GO predictions, of which 22,060 also featured high information content. Our phenotype-correlation data and NET-FF predictions showed a strong concordance with existing PomBase GO annotations and protein networks, with integrated analyses revealing 1675 novel GO predictions for 783 genes, including 47 predictions for 23 priority unstudied proteins. Experimental validation identified new proteins involved in cellular aging, showing that these predictions and phenomics data provide a rich resource to uncover new protein functions. eLife Sciences Publications, Ltd 2023-10-03 /pmc/articles/PMC10547477/ /pubmed/37787768 http://dx.doi.org/10.7554/eLife.88229 Text en © 2023, Rodríguez-López, Bordin et al https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use and redistribution provided that the original author and source are credited.
spellingShingle Computational and Systems Biology
Rodríguez-López, María
Bordin, Nicola
Lees, Jon
Scholes, Harry
Hassan, Shaimaa
Saintain, Quentin
Kamrad, Stephan
Orengo, Christine
Bähler, Jürg
Broad functional profiling of fission yeast proteins using phenomics and machine learning
title Broad functional profiling of fission yeast proteins using phenomics and machine learning
title_full Broad functional profiling of fission yeast proteins using phenomics and machine learning
title_fullStr Broad functional profiling of fission yeast proteins using phenomics and machine learning
title_full_unstemmed Broad functional profiling of fission yeast proteins using phenomics and machine learning
title_short Broad functional profiling of fission yeast proteins using phenomics and machine learning
title_sort broad functional profiling of fission yeast proteins using phenomics and machine learning
topic Computational and Systems Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10547477/
https://www.ncbi.nlm.nih.gov/pubmed/37787768
http://dx.doi.org/10.7554/eLife.88229
work_keys_str_mv AT rodriguezlopezmaria broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning
AT bordinnicola broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning
AT leesjon broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning
AT scholesharry broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning
AT hassanshaimaa broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning
AT saintainquentin broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning
AT kamradstephan broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning
AT orengochristine broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning
AT bahlerjurg broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning