Cargando…
Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes
Carbohydrate-processing enzymes, CAZymes, are classified into families based on sequence and three-dimensional fold. Because many CAZyme families contain members of diverse molecular function (different EC-numbers), sophisticated tools are required to further delineate these enzymes. Such delineatio...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10320097/ https://www.ncbi.nlm.nih.gov/pubmed/37216585 http://dx.doi.org/10.1093/nar/gkad385 |
_version_ | 1785068376912035840 |
---|---|
author | Barrett, Kristian Hunt, Cameron J Lange, Lene Grigoriev, Igor V Meyer, Anne S |
author_facet | Barrett, Kristian Hunt, Cameron J Lange, Lene Grigoriev, Igor V Meyer, Anne S |
author_sort | Barrett, Kristian |
collection | PubMed |
description | Carbohydrate-processing enzymes, CAZymes, are classified into families based on sequence and three-dimensional fold. Because many CAZyme families contain members of diverse molecular function (different EC-numbers), sophisticated tools are required to further delineate these enzymes. Such delineation is provided by the peptide-based clustering method CUPP, Conserved Unique Peptide Patterns. CUPP operates synergistically with the CAZy family/subfamily categorizations to allow systematic exploration of CAZymes by defining small protein groups with shared sequence motifs. The updated CUPP library contains 21,930 of such motif groups including 3,842,628 proteins. The new implementation of the CUPP-webserver, https://cupp.info/, now includes all published fungal and algal genomes from the Joint Genome Institute (JGI), genome resources MycoCosm and PhycoCosm, dynamically subdivided into motif groups of CAZymes. This allows users to browse the JGI portals for specific predicted functions or specific protein families from genome sequences. Thus, a genome can be searched for proteins having specific characteristics. All JGI proteins have a hyperlink to a summary page which links to the predicted gene splicing including which regions have RNA support. The new CUPP implementation also includes an update of the annotation algorithm that uses only a fourth of the RAM while enabling multi-threading, providing an annotation speed below 1 ms/protein. |
format | Online Article Text |
id | pubmed-10320097 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-103200972023-07-06 Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes Barrett, Kristian Hunt, Cameron J Lange, Lene Grigoriev, Igor V Meyer, Anne S Nucleic Acids Res Web Server Issue Carbohydrate-processing enzymes, CAZymes, are classified into families based on sequence and three-dimensional fold. Because many CAZyme families contain members of diverse molecular function (different EC-numbers), sophisticated tools are required to further delineate these enzymes. Such delineation is provided by the peptide-based clustering method CUPP, Conserved Unique Peptide Patterns. CUPP operates synergistically with the CAZy family/subfamily categorizations to allow systematic exploration of CAZymes by defining small protein groups with shared sequence motifs. The updated CUPP library contains 21,930 of such motif groups including 3,842,628 proteins. The new implementation of the CUPP-webserver, https://cupp.info/, now includes all published fungal and algal genomes from the Joint Genome Institute (JGI), genome resources MycoCosm and PhycoCosm, dynamically subdivided into motif groups of CAZymes. This allows users to browse the JGI portals for specific predicted functions or specific protein families from genome sequences. Thus, a genome can be searched for proteins having specific characteristics. All JGI proteins have a hyperlink to a summary page which links to the predicted gene splicing including which regions have RNA support. The new CUPP implementation also includes an update of the annotation algorithm that uses only a fourth of the RAM while enabling multi-threading, providing an annotation speed below 1 ms/protein. Oxford University Press 2023-05-22 /pmc/articles/PMC10320097/ /pubmed/37216585 http://dx.doi.org/10.1093/nar/gkad385 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Web Server Issue Barrett, Kristian Hunt, Cameron J Lange, Lene Grigoriev, Igor V Meyer, Anne S Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes |
title | Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes |
title_full | Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes |
title_fullStr | Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes |
title_full_unstemmed | Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes |
title_short | Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes |
title_sort | conserved unique peptide patterns (cupp) online platform 2.0: implementation of +1000 jgi fungal genomes |
topic | Web Server Issue |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10320097/ https://www.ncbi.nlm.nih.gov/pubmed/37216585 http://dx.doi.org/10.1093/nar/gkad385 |
work_keys_str_mv | AT barrettkristian conserveduniquepeptidepatternscupponlineplatform20implementationof1000jgifungalgenomes AT huntcameronj conserveduniquepeptidepatternscupponlineplatform20implementationof1000jgifungalgenomes AT langelene conserveduniquepeptidepatternscupponlineplatform20implementationof1000jgifungalgenomes AT grigorievigorv conserveduniquepeptidepatternscupponlineplatform20implementationof1000jgifungalgenomes AT meyerannes conserveduniquepeptidepatternscupponlineplatform20implementationof1000jgifungalgenomes |