Cargando…

SulfAtlas, the sulfatase database: state of the art and new developments

SulfAtlas (https://sulfatlas.sb-roscoff.fr/) is a knowledge-based resource dedicated to a sequence-based classification of sulfatases. Currently four sulfatase families exist (S1–S4) and the largest family (S1, formylglycine-dependent sulfatases) is divided into subfamilies by a phylogenetic approac...

Descripción completa

Detalles Bibliográficos
Autores principales: Stam, Mark, Lelièvre, Pernelle, Hoebeke, Mark, Corre, Erwan, Barbeyron, Tristan, Michel, Gurvan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825549/
https://www.ncbi.nlm.nih.gov/pubmed/36318251
http://dx.doi.org/10.1093/nar/gkac977
_version_ 1784866656586039296
author Stam, Mark
Lelièvre, Pernelle
Hoebeke, Mark
Corre, Erwan
Barbeyron, Tristan
Michel, Gurvan
author_facet Stam, Mark
Lelièvre, Pernelle
Hoebeke, Mark
Corre, Erwan
Barbeyron, Tristan
Michel, Gurvan
author_sort Stam, Mark
collection PubMed
description SulfAtlas (https://sulfatlas.sb-roscoff.fr/) is a knowledge-based resource dedicated to a sequence-based classification of sulfatases. Currently four sulfatase families exist (S1–S4) and the largest family (S1, formylglycine-dependent sulfatases) is divided into subfamilies by a phylogenetic approach, each subfamily corresponding to either a single characterized specificity (or few specificities in some cases) or to unknown substrates. Sequences are linked to their biochemical and structural information according to an expert scrutiny of the available literature. Database browsing was initially made possible both through a keyword search engine and a specific sequence similarity (BLAST) server. In this article, we will briefly summarize the experimental progresses in the sulfatase field in the last 6 years. To improve and speed up the (sub)family assignment of sulfatases in (meta)genomic data, we have developed a new, freely-accessible search engine using Hidden Markov model (HMM) for each (sub)family. This new tool (SulfAtlas HMM) is also a key part of the internal pipeline used to regularly update the database. SulfAtlas resource has indeed significantly grown since its creation in 2016, from 4550 sequences to 162 430 sequences in August 2022.
format Online
Article
Text
id pubmed-9825549
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98255492023-01-10 SulfAtlas, the sulfatase database: state of the art and new developments Stam, Mark Lelièvre, Pernelle Hoebeke, Mark Corre, Erwan Barbeyron, Tristan Michel, Gurvan Nucleic Acids Res Database Issue SulfAtlas (https://sulfatlas.sb-roscoff.fr/) is a knowledge-based resource dedicated to a sequence-based classification of sulfatases. Currently four sulfatase families exist (S1–S4) and the largest family (S1, formylglycine-dependent sulfatases) is divided into subfamilies by a phylogenetic approach, each subfamily corresponding to either a single characterized specificity (or few specificities in some cases) or to unknown substrates. Sequences are linked to their biochemical and structural information according to an expert scrutiny of the available literature. Database browsing was initially made possible both through a keyword search engine and a specific sequence similarity (BLAST) server. In this article, we will briefly summarize the experimental progresses in the sulfatase field in the last 6 years. To improve and speed up the (sub)family assignment of sulfatases in (meta)genomic data, we have developed a new, freely-accessible search engine using Hidden Markov model (HMM) for each (sub)family. This new tool (SulfAtlas HMM) is also a key part of the internal pipeline used to regularly update the database. SulfAtlas resource has indeed significantly grown since its creation in 2016, from 4550 sequences to 162 430 sequences in August 2022. Oxford University Press 2022-11-01 /pmc/articles/PMC9825549/ /pubmed/36318251 http://dx.doi.org/10.1093/nar/gkac977 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Database Issue
Stam, Mark
Lelièvre, Pernelle
Hoebeke, Mark
Corre, Erwan
Barbeyron, Tristan
Michel, Gurvan
SulfAtlas, the sulfatase database: state of the art and new developments
title SulfAtlas, the sulfatase database: state of the art and new developments
title_full SulfAtlas, the sulfatase database: state of the art and new developments
title_fullStr SulfAtlas, the sulfatase database: state of the art and new developments
title_full_unstemmed SulfAtlas, the sulfatase database: state of the art and new developments
title_short SulfAtlas, the sulfatase database: state of the art and new developments
title_sort sulfatlas, the sulfatase database: state of the art and new developments
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825549/
https://www.ncbi.nlm.nih.gov/pubmed/36318251
http://dx.doi.org/10.1093/nar/gkac977
work_keys_str_mv AT stammark sulfatlasthesulfatasedatabasestateoftheartandnewdevelopments
AT lelievrepernelle sulfatlasthesulfatasedatabasestateoftheartandnewdevelopments
AT hoebekemark sulfatlasthesulfatasedatabasestateoftheartandnewdevelopments
AT correerwan sulfatlasthesulfatasedatabasestateoftheartandnewdevelopments
AT barbeyrontristan sulfatlasthesulfatasedatabasestateoftheartandnewdevelopments
AT michelgurvan sulfatlasthesulfatasedatabasestateoftheartandnewdevelopments