Cargando…

Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes

Carbohydrate-processing enzymes, CAZymes, are classified into families based on sequence and three-dimensional fold. Because many CAZyme families contain members of diverse molecular function (different EC-numbers), sophisticated tools are required to further delineate these enzymes. Such delineatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Barrett, Kristian, Hunt, Cameron J, Lange, Lene, Grigoriev, Igor V, Meyer, Anne S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10320097/
https://www.ncbi.nlm.nih.gov/pubmed/37216585
http://dx.doi.org/10.1093/nar/gkad385
_version_ 1785068376912035840
author Barrett, Kristian
Hunt, Cameron J
Lange, Lene
Grigoriev, Igor V
Meyer, Anne S
author_facet Barrett, Kristian
Hunt, Cameron J
Lange, Lene
Grigoriev, Igor V
Meyer, Anne S
author_sort Barrett, Kristian
collection PubMed
description Carbohydrate-processing enzymes, CAZymes, are classified into families based on sequence and three-dimensional fold. Because many CAZyme families contain members of diverse molecular function (different EC-numbers), sophisticated tools are required to further delineate these enzymes. Such delineation is provided by the peptide-based clustering method CUPP, Conserved Unique Peptide Patterns. CUPP operates synergistically with the CAZy family/subfamily categorizations to allow systematic exploration of CAZymes by defining small protein groups with shared sequence motifs. The updated CUPP library contains 21,930 of such motif groups including 3,842,628 proteins. The new implementation of the CUPP-webserver, https://cupp.info/, now includes all published fungal and algal genomes from the Joint Genome Institute (JGI), genome resources MycoCosm and PhycoCosm, dynamically subdivided into motif groups of CAZymes. This allows users to browse the JGI portals for specific predicted functions or specific protein families from genome sequences. Thus, a genome can be searched for proteins having specific characteristics. All JGI proteins have a hyperlink to a summary page which links to the predicted gene splicing including which regions have RNA support. The new CUPP implementation also includes an update of the annotation algorithm that uses only a fourth of the RAM while enabling multi-threading, providing an annotation speed below 1 ms/protein.
format Online
Article
Text
id pubmed-10320097
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103200972023-07-06 Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes Barrett, Kristian Hunt, Cameron J Lange, Lene Grigoriev, Igor V Meyer, Anne S Nucleic Acids Res Web Server Issue Carbohydrate-processing enzymes, CAZymes, are classified into families based on sequence and three-dimensional fold. Because many CAZyme families contain members of diverse molecular function (different EC-numbers), sophisticated tools are required to further delineate these enzymes. Such delineation is provided by the peptide-based clustering method CUPP, Conserved Unique Peptide Patterns. CUPP operates synergistically with the CAZy family/subfamily categorizations to allow systematic exploration of CAZymes by defining small protein groups with shared sequence motifs. The updated CUPP library contains 21,930 of such motif groups including 3,842,628 proteins. The new implementation of the CUPP-webserver, https://cupp.info/, now includes all published fungal and algal genomes from the Joint Genome Institute (JGI), genome resources MycoCosm and PhycoCosm, dynamically subdivided into motif groups of CAZymes. This allows users to browse the JGI portals for specific predicted functions or specific protein families from genome sequences. Thus, a genome can be searched for proteins having specific characteristics. All JGI proteins have a hyperlink to a summary page which links to the predicted gene splicing including which regions have RNA support. The new CUPP implementation also includes an update of the annotation algorithm that uses only a fourth of the RAM while enabling multi-threading, providing an annotation speed below 1 ms/protein. Oxford University Press 2023-05-22 /pmc/articles/PMC10320097/ /pubmed/37216585 http://dx.doi.org/10.1093/nar/gkad385 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Web Server Issue
Barrett, Kristian
Hunt, Cameron J
Lange, Lene
Grigoriev, Igor V
Meyer, Anne S
Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes
title Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes
title_full Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes
title_fullStr Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes
title_full_unstemmed Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes
title_short Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of +1000 JGI fungal genomes
title_sort conserved unique peptide patterns (cupp) online platform 2.0: implementation of +1000 jgi fungal genomes
topic Web Server Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10320097/
https://www.ncbi.nlm.nih.gov/pubmed/37216585
http://dx.doi.org/10.1093/nar/gkad385
work_keys_str_mv AT barrettkristian conserveduniquepeptidepatternscupponlineplatform20implementationof1000jgifungalgenomes
AT huntcameronj conserveduniquepeptidepatternscupponlineplatform20implementationof1000jgifungalgenomes
AT langelene conserveduniquepeptidepatternscupponlineplatform20implementationof1000jgifungalgenomes
AT grigorievigorv conserveduniquepeptidepatternscupponlineplatform20implementationof1000jgifungalgenomes
AT meyerannes conserveduniquepeptidepatternscupponlineplatform20implementationof1000jgifungalgenomes