Cargando…
Broad functional profiling of fission yeast proteins using phenomics and machine learning
Many proteins remain poorly characterized even in well-studied organisms, presenting a bottleneck for research. We applied phenomics and machine-learning approaches with Schizosaccharomyces pombe for broad cues on protein functions. We assayed colony-growth phenotypes to measure the fitness of delet...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
eLife Sciences Publications, Ltd
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10547477/ https://www.ncbi.nlm.nih.gov/pubmed/37787768 http://dx.doi.org/10.7554/eLife.88229 |
_version_ | 1785115066814693376 |
---|---|
author | Rodríguez-López, María Bordin, Nicola Lees, Jon Scholes, Harry Hassan, Shaimaa Saintain, Quentin Kamrad, Stephan Orengo, Christine Bähler, Jürg |
author_facet | Rodríguez-López, María Bordin, Nicola Lees, Jon Scholes, Harry Hassan, Shaimaa Saintain, Quentin Kamrad, Stephan Orengo, Christine Bähler, Jürg |
author_sort | Rodríguez-López, María |
collection | PubMed |
description | Many proteins remain poorly characterized even in well-studied organisms, presenting a bottleneck for research. We applied phenomics and machine-learning approaches with Schizosaccharomyces pombe for broad cues on protein functions. We assayed colony-growth phenotypes to measure the fitness of deletion mutants for 3509 non-essential genes in 131 conditions with different nutrients, drugs, and stresses. These analyses exposed phenotypes for 3492 mutants, including 124 mutants of ‘priority unstudied’ proteins conserved in humans, providing varied functional clues. For example, over 900 proteins were newly implicated in the resistance to oxidative stress. Phenotype-correlation networks suggested roles for poorly characterized proteins through ‘guilt by association’ with known proteins. For complementary functional insights, we predicted Gene Ontology (GO) terms using machine learning methods exploiting protein-network and protein-homology data (NET-FF). We obtained 56,594 high-scoring GO predictions, of which 22,060 also featured high information content. Our phenotype-correlation data and NET-FF predictions showed a strong concordance with existing PomBase GO annotations and protein networks, with integrated analyses revealing 1675 novel GO predictions for 783 genes, including 47 predictions for 23 priority unstudied proteins. Experimental validation identified new proteins involved in cellular aging, showing that these predictions and phenomics data provide a rich resource to uncover new protein functions. |
format | Online Article Text |
id | pubmed-10547477 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | eLife Sciences Publications, Ltd |
record_format | MEDLINE/PubMed |
spelling | pubmed-105474772023-10-04 Broad functional profiling of fission yeast proteins using phenomics and machine learning Rodríguez-López, María Bordin, Nicola Lees, Jon Scholes, Harry Hassan, Shaimaa Saintain, Quentin Kamrad, Stephan Orengo, Christine Bähler, Jürg eLife Computational and Systems Biology Many proteins remain poorly characterized even in well-studied organisms, presenting a bottleneck for research. We applied phenomics and machine-learning approaches with Schizosaccharomyces pombe for broad cues on protein functions. We assayed colony-growth phenotypes to measure the fitness of deletion mutants for 3509 non-essential genes in 131 conditions with different nutrients, drugs, and stresses. These analyses exposed phenotypes for 3492 mutants, including 124 mutants of ‘priority unstudied’ proteins conserved in humans, providing varied functional clues. For example, over 900 proteins were newly implicated in the resistance to oxidative stress. Phenotype-correlation networks suggested roles for poorly characterized proteins through ‘guilt by association’ with known proteins. For complementary functional insights, we predicted Gene Ontology (GO) terms using machine learning methods exploiting protein-network and protein-homology data (NET-FF). We obtained 56,594 high-scoring GO predictions, of which 22,060 also featured high information content. Our phenotype-correlation data and NET-FF predictions showed a strong concordance with existing PomBase GO annotations and protein networks, with integrated analyses revealing 1675 novel GO predictions for 783 genes, including 47 predictions for 23 priority unstudied proteins. Experimental validation identified new proteins involved in cellular aging, showing that these predictions and phenomics data provide a rich resource to uncover new protein functions. eLife Sciences Publications, Ltd 2023-10-03 /pmc/articles/PMC10547477/ /pubmed/37787768 http://dx.doi.org/10.7554/eLife.88229 Text en © 2023, Rodríguez-López, Bordin et al https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use and redistribution provided that the original author and source are credited. |
spellingShingle | Computational and Systems Biology Rodríguez-López, María Bordin, Nicola Lees, Jon Scholes, Harry Hassan, Shaimaa Saintain, Quentin Kamrad, Stephan Orengo, Christine Bähler, Jürg Broad functional profiling of fission yeast proteins using phenomics and machine learning |
title | Broad functional profiling of fission yeast proteins using phenomics and machine learning |
title_full | Broad functional profiling of fission yeast proteins using phenomics and machine learning |
title_fullStr | Broad functional profiling of fission yeast proteins using phenomics and machine learning |
title_full_unstemmed | Broad functional profiling of fission yeast proteins using phenomics and machine learning |
title_short | Broad functional profiling of fission yeast proteins using phenomics and machine learning |
title_sort | broad functional profiling of fission yeast proteins using phenomics and machine learning |
topic | Computational and Systems Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10547477/ https://www.ncbi.nlm.nih.gov/pubmed/37787768 http://dx.doi.org/10.7554/eLife.88229 |
work_keys_str_mv | AT rodriguezlopezmaria broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning AT bordinnicola broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning AT leesjon broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning AT scholesharry broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning AT hassanshaimaa broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning AT saintainquentin broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning AT kamradstephan broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning AT orengochristine broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning AT bahlerjurg broadfunctionalprofilingoffissionyeastproteinsusingphenomicsandmachinelearning |