Cargando…
Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation
Genome3D (https://www.genome3d.eu) is a freely available resource that provides consensus structural annotations for representative protein sequences taken from a selection of model organisms. Since the last NAR update in 2015, the method of data submission has been overhauled, with annotations now...
Autores principales: | , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7139969/ https://www.ncbi.nlm.nih.gov/pubmed/31733063 http://dx.doi.org/10.1093/nar/gkz967 |
_version_ | 1783518888715091968 |
---|---|
author | Sillitoe, Ian Andreeva, Antonina Blundell, Tom L Buchan, Daniel W A Finn, Robert D Gough, Julian Jones, David Kelley, Lawrence A Paysan-Lafosse, Typhaine Lam, Su Datt Murzin, Alexey G Pandurangan, Arun Prasad Salazar, Gustavo A Skwark, Marcin J Sternberg, Michael J E Velankar, Sameer Orengo, Christine |
author_facet | Sillitoe, Ian Andreeva, Antonina Blundell, Tom L Buchan, Daniel W A Finn, Robert D Gough, Julian Jones, David Kelley, Lawrence A Paysan-Lafosse, Typhaine Lam, Su Datt Murzin, Alexey G Pandurangan, Arun Prasad Salazar, Gustavo A Skwark, Marcin J Sternberg, Michael J E Velankar, Sameer Orengo, Christine |
author_sort | Sillitoe, Ian |
collection | PubMed |
description | Genome3D (https://www.genome3d.eu) is a freely available resource that provides consensus structural annotations for representative protein sequences taken from a selection of model organisms. Since the last NAR update in 2015, the method of data submission has been overhauled, with annotations now being ‘pushed’ to the database via an API. As a result, contributing groups are now able to manage their own structural annotations, making the resource more flexible and maintainable. The new submission protocol brings a number of additional benefits including: providing instant validation of data and avoiding the requirement to synchronise releases between resources. It also makes it possible to implement the submission of these structural annotations as an automated part of existing internal workflows. In turn, these improvements facilitate Genome3D being opened up to new prediction algorithms and groups. For the latest release of Genome3D (v2.1), the underlying dataset of sequences used as prediction targets has been updated using the latest reference proteomes available in UniProtKB. A number of new reference proteomes have also been added of particular interest to the wider scientific community: cow, pig, wheat and mycobacterium tuberculosis. These additions, along with improvements to the underlying predictions from contributing resources, has ensured that the number of annotations in Genome3D has nearly doubled since the last NAR update article. The new API has also been used to facilitate the dissemination of Genome3D data into InterPro, thereby widening the visibility of both the annotation data and annotation algorithms. |
format | Online Article Text |
id | pubmed-7139969 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-71399692020-04-13 Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation Sillitoe, Ian Andreeva, Antonina Blundell, Tom L Buchan, Daniel W A Finn, Robert D Gough, Julian Jones, David Kelley, Lawrence A Paysan-Lafosse, Typhaine Lam, Su Datt Murzin, Alexey G Pandurangan, Arun Prasad Salazar, Gustavo A Skwark, Marcin J Sternberg, Michael J E Velankar, Sameer Orengo, Christine Nucleic Acids Res Database Issue Genome3D (https://www.genome3d.eu) is a freely available resource that provides consensus structural annotations for representative protein sequences taken from a selection of model organisms. Since the last NAR update in 2015, the method of data submission has been overhauled, with annotations now being ‘pushed’ to the database via an API. As a result, contributing groups are now able to manage their own structural annotations, making the resource more flexible and maintainable. The new submission protocol brings a number of additional benefits including: providing instant validation of data and avoiding the requirement to synchronise releases between resources. It also makes it possible to implement the submission of these structural annotations as an automated part of existing internal workflows. In turn, these improvements facilitate Genome3D being opened up to new prediction algorithms and groups. For the latest release of Genome3D (v2.1), the underlying dataset of sequences used as prediction targets has been updated using the latest reference proteomes available in UniProtKB. A number of new reference proteomes have also been added of particular interest to the wider scientific community: cow, pig, wheat and mycobacterium tuberculosis. These additions, along with improvements to the underlying predictions from contributing resources, has ensured that the number of annotations in Genome3D has nearly doubled since the last NAR update article. The new API has also been used to facilitate the dissemination of Genome3D data into InterPro, thereby widening the visibility of both the annotation data and annotation algorithms. Oxford University Press 2020-01-08 2019-11-16 /pmc/articles/PMC7139969/ /pubmed/31733063 http://dx.doi.org/10.1093/nar/gkz967 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Database Issue Sillitoe, Ian Andreeva, Antonina Blundell, Tom L Buchan, Daniel W A Finn, Robert D Gough, Julian Jones, David Kelley, Lawrence A Paysan-Lafosse, Typhaine Lam, Su Datt Murzin, Alexey G Pandurangan, Arun Prasad Salazar, Gustavo A Skwark, Marcin J Sternberg, Michael J E Velankar, Sameer Orengo, Christine Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation |
title | Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation |
title_full | Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation |
title_fullStr | Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation |
title_full_unstemmed | Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation |
title_short | Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation |
title_sort | genome3d: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation |
topic | Database Issue |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7139969/ https://www.ncbi.nlm.nih.gov/pubmed/31733063 http://dx.doi.org/10.1093/nar/gkz967 |
work_keys_str_mv | AT sillitoeian genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT andreevaantonina genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT blundelltoml genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT buchandanielwa genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT finnrobertd genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT goughjulian genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT jonesdavid genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT kelleylawrencea genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT paysanlafossetyphaine genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT lamsudatt genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT murzinalexeyg genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT panduranganarunprasad genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT salazargustavoa genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT skwarkmarcinj genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT sternbergmichaelje genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT velankarsameer genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation AT orengochristine genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation |