Cargando…
The Pfam protein families database in 2019
The last few years have witnessed significant changes in Pfam (https://pfam.xfam.org). The number of families has grown substantially to a total of 17,929 in release 32.0. New additions have been coupled with efforts to improve existing families, including refinement of domain boundaries, their clas...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6324024/ https://www.ncbi.nlm.nih.gov/pubmed/30357350 http://dx.doi.org/10.1093/nar/gky995 |
_version_ | 1783385893903532032 |
---|---|
author | El-Gebali, Sara Mistry, Jaina Bateman, Alex Eddy, Sean R Luciani, Aurélien Potter, Simon C Qureshi, Matloob Richardson, Lorna J Salazar, Gustavo A Smart, Alfredo Sonnhammer, Erik L L Hirsh, Layla Paladin, Lisanna Piovesan, Damiano Tosatto, Silvio C E Finn, Robert D |
author_facet | El-Gebali, Sara Mistry, Jaina Bateman, Alex Eddy, Sean R Luciani, Aurélien Potter, Simon C Qureshi, Matloob Richardson, Lorna J Salazar, Gustavo A Smart, Alfredo Sonnhammer, Erik L L Hirsh, Layla Paladin, Lisanna Piovesan, Damiano Tosatto, Silvio C E Finn, Robert D |
author_sort | El-Gebali, Sara |
collection | PubMed |
description | The last few years have witnessed significant changes in Pfam (https://pfam.xfam.org). The number of families has grown substantially to a total of 17,929 in release 32.0. New additions have been coupled with efforts to improve existing families, including refinement of domain boundaries, their classification into Pfam clans, as well as their functional annotation. We recently began to collaborate with the RepeatsDB resource to improve the definition of tandem repeat families within Pfam. We carried out a significant comparison to the structural classification database, namely the Evolutionary Classification of Protein Domains (ECOD) that led to the creation of 825 new families based on their set of uncharacterized families (EUFs). Furthermore, we also connected Pfam entries to the Sequence Ontology (SO) through mapping of the Pfam type definitions to SO terms. Since Pfam has many community contributors, we recently enabled the linking between authorship of all Pfam entries with the corresponding authors’ ORCID identifiers. This effectively permits authors to claim credit for their Pfam curation and link them to their ORCID record. |
format | Online Article Text |
id | pubmed-6324024 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-63240242019-01-10 The Pfam protein families database in 2019 El-Gebali, Sara Mistry, Jaina Bateman, Alex Eddy, Sean R Luciani, Aurélien Potter, Simon C Qureshi, Matloob Richardson, Lorna J Salazar, Gustavo A Smart, Alfredo Sonnhammer, Erik L L Hirsh, Layla Paladin, Lisanna Piovesan, Damiano Tosatto, Silvio C E Finn, Robert D Nucleic Acids Res Database Issue The last few years have witnessed significant changes in Pfam (https://pfam.xfam.org). The number of families has grown substantially to a total of 17,929 in release 32.0. New additions have been coupled with efforts to improve existing families, including refinement of domain boundaries, their classification into Pfam clans, as well as their functional annotation. We recently began to collaborate with the RepeatsDB resource to improve the definition of tandem repeat families within Pfam. We carried out a significant comparison to the structural classification database, namely the Evolutionary Classification of Protein Domains (ECOD) that led to the creation of 825 new families based on their set of uncharacterized families (EUFs). Furthermore, we also connected Pfam entries to the Sequence Ontology (SO) through mapping of the Pfam type definitions to SO terms. Since Pfam has many community contributors, we recently enabled the linking between authorship of all Pfam entries with the corresponding authors’ ORCID identifiers. This effectively permits authors to claim credit for their Pfam curation and link them to their ORCID record. Oxford University Press 2019-01-08 2018-10-24 /pmc/articles/PMC6324024/ /pubmed/30357350 http://dx.doi.org/10.1093/nar/gky995 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Database Issue El-Gebali, Sara Mistry, Jaina Bateman, Alex Eddy, Sean R Luciani, Aurélien Potter, Simon C Qureshi, Matloob Richardson, Lorna J Salazar, Gustavo A Smart, Alfredo Sonnhammer, Erik L L Hirsh, Layla Paladin, Lisanna Piovesan, Damiano Tosatto, Silvio C E Finn, Robert D The Pfam protein families database in 2019 |
title | The Pfam protein families database in 2019 |
title_full | The Pfam protein families database in 2019 |
title_fullStr | The Pfam protein families database in 2019 |
title_full_unstemmed | The Pfam protein families database in 2019 |
title_short | The Pfam protein families database in 2019 |
title_sort | pfam protein families database in 2019 |
topic | Database Issue |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6324024/ https://www.ncbi.nlm.nih.gov/pubmed/30357350 http://dx.doi.org/10.1093/nar/gky995 |
work_keys_str_mv | AT elgebalisara thepfamproteinfamiliesdatabasein2019 AT mistryjaina thepfamproteinfamiliesdatabasein2019 AT batemanalex thepfamproteinfamiliesdatabasein2019 AT eddyseanr thepfamproteinfamiliesdatabasein2019 AT lucianiaurelien thepfamproteinfamiliesdatabasein2019 AT pottersimonc thepfamproteinfamiliesdatabasein2019 AT qureshimatloob thepfamproteinfamiliesdatabasein2019 AT richardsonlornaj thepfamproteinfamiliesdatabasein2019 AT salazargustavoa thepfamproteinfamiliesdatabasein2019 AT smartalfredo thepfamproteinfamiliesdatabasein2019 AT sonnhammererikll thepfamproteinfamiliesdatabasein2019 AT hirshlayla thepfamproteinfamiliesdatabasein2019 AT paladinlisanna thepfamproteinfamiliesdatabasein2019 AT piovesandamiano thepfamproteinfamiliesdatabasein2019 AT tosattosilvioce thepfamproteinfamiliesdatabasein2019 AT finnrobertd thepfamproteinfamiliesdatabasein2019 AT elgebalisara pfamproteinfamiliesdatabasein2019 AT mistryjaina pfamproteinfamiliesdatabasein2019 AT batemanalex pfamproteinfamiliesdatabasein2019 AT eddyseanr pfamproteinfamiliesdatabasein2019 AT lucianiaurelien pfamproteinfamiliesdatabasein2019 AT pottersimonc pfamproteinfamiliesdatabasein2019 AT qureshimatloob pfamproteinfamiliesdatabasein2019 AT richardsonlornaj pfamproteinfamiliesdatabasein2019 AT salazargustavoa pfamproteinfamiliesdatabasein2019 AT smartalfredo pfamproteinfamiliesdatabasein2019 AT sonnhammererikll pfamproteinfamiliesdatabasein2019 AT hirshlayla pfamproteinfamiliesdatabasein2019 AT paladinlisanna pfamproteinfamiliesdatabasein2019 AT piovesandamiano pfamproteinfamiliesdatabasein2019 AT tosattosilvioce pfamproteinfamiliesdatabasein2019 AT finnrobertd pfamproteinfamiliesdatabasein2019 |