Cargando…
NetGO 3.0: Protein Language Model Improves Large-scale Functional Annotations
As one of the state-of-the-art automated function prediction (AFP) methods, NetGO 2.0 integrates multi-source information to improve the performance. However, it mainly utilizes the proteins with experimentally supported functional annotations without leveraging valuable information from a vast numb...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10626176/ https://www.ncbi.nlm.nih.gov/pubmed/37075830 http://dx.doi.org/10.1016/j.gpb.2023.04.001 |
_version_ | 1785131288891490304 |
---|---|
author | Wang, Shaojun You, Ronghui Liu, Yunjia Xiong, Yi Zhu, Shanfeng |
author_facet | Wang, Shaojun You, Ronghui Liu, Yunjia Xiong, Yi Zhu, Shanfeng |
author_sort | Wang, Shaojun |
collection | PubMed |
description | As one of the state-of-the-art automated function prediction (AFP) methods, NetGO 2.0 integrates multi-source information to improve the performance. However, it mainly utilizes the proteins with experimentally supported functional annotations without leveraging valuable information from a vast number of unannotated proteins. Recently, protein language models have been proposed to learn informative representations [e.g., Evolutionary Scale Modeling (ESM)-1b embedding] from protein sequences based on self-supervision. Here, we represented each protein by ESM-1b and used logistic regression (LR) to train a new model, LR-ESM, for AFP. The experimental results showed that LR-ESM achieved comparable performance with the best-performing component of NetGO 2.0. Therefore, by incorporating LR-ESM into NetGO 2.0, we developed NetGO 3.0 to improve the performance of AFP extensively. NetGO 3.0 is freely accessible at https://dmiip.sjtu.edu.cn/ng3.0. |
format | Online Article Text |
id | pubmed-10626176 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-106261762023-11-07 NetGO 3.0: Protein Language Model Improves Large-scale Functional Annotations Wang, Shaojun You, Ronghui Liu, Yunjia Xiong, Yi Zhu, Shanfeng Genomics Proteomics Bioinformatics Web Server As one of the state-of-the-art automated function prediction (AFP) methods, NetGO 2.0 integrates multi-source information to improve the performance. However, it mainly utilizes the proteins with experimentally supported functional annotations without leveraging valuable information from a vast number of unannotated proteins. Recently, protein language models have been proposed to learn informative representations [e.g., Evolutionary Scale Modeling (ESM)-1b embedding] from protein sequences based on self-supervision. Here, we represented each protein by ESM-1b and used logistic regression (LR) to train a new model, LR-ESM, for AFP. The experimental results showed that LR-ESM achieved comparable performance with the best-performing component of NetGO 2.0. Therefore, by incorporating LR-ESM into NetGO 2.0, we developed NetGO 3.0 to improve the performance of AFP extensively. NetGO 3.0 is freely accessible at https://dmiip.sjtu.edu.cn/ng3.0. Elsevier 2023-04 2023-04-17 /pmc/articles/PMC10626176/ /pubmed/37075830 http://dx.doi.org/10.1016/j.gpb.2023.04.001 Text en © 2023 Beijing Institute of Genomics https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Web Server Wang, Shaojun You, Ronghui Liu, Yunjia Xiong, Yi Zhu, Shanfeng NetGO 3.0: Protein Language Model Improves Large-scale Functional Annotations |
title | NetGO 3.0: Protein Language Model Improves Large-scale Functional Annotations |
title_full | NetGO 3.0: Protein Language Model Improves Large-scale Functional Annotations |
title_fullStr | NetGO 3.0: Protein Language Model Improves Large-scale Functional Annotations |
title_full_unstemmed | NetGO 3.0: Protein Language Model Improves Large-scale Functional Annotations |
title_short | NetGO 3.0: Protein Language Model Improves Large-scale Functional Annotations |
title_sort | netgo 3.0: protein language model improves large-scale functional annotations |
topic | Web Server |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10626176/ https://www.ncbi.nlm.nih.gov/pubmed/37075830 http://dx.doi.org/10.1016/j.gpb.2023.04.001 |
work_keys_str_mv | AT wangshaojun netgo30proteinlanguagemodelimproveslargescalefunctionalannotations AT youronghui netgo30proteinlanguagemodelimproveslargescalefunctionalannotations AT liuyunjia netgo30proteinlanguagemodelimproveslargescalefunctionalannotations AT xiongyi netgo30proteinlanguagemodelimproveslargescalefunctionalannotations AT zhushanfeng netgo30proteinlanguagemodelimproveslargescalefunctionalannotations |