Cargando…

Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation

Genome3D (https://www.genome3d.eu) is a freely available resource that provides consensus structural annotations for representative protein sequences taken from a selection of model organisms. Since the last NAR update in 2015, the method of data submission has been overhauled, with annotations now...

Descripción completa

Detalles Bibliográficos
Autores principales: Sillitoe, Ian, Andreeva, Antonina, Blundell, Tom L, Buchan, Daniel W A, Finn, Robert D, Gough, Julian, Jones, David, Kelley, Lawrence A, Paysan-Lafosse, Typhaine, Lam, Su Datt, Murzin, Alexey G, Pandurangan, Arun Prasad, Salazar, Gustavo A, Skwark, Marcin J, Sternberg, Michael J E, Velankar, Sameer, Orengo, Christine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7139969/
https://www.ncbi.nlm.nih.gov/pubmed/31733063
http://dx.doi.org/10.1093/nar/gkz967
_version_ 1783518888715091968
author Sillitoe, Ian
Andreeva, Antonina
Blundell, Tom L
Buchan, Daniel W A
Finn, Robert D
Gough, Julian
Jones, David
Kelley, Lawrence A
Paysan-Lafosse, Typhaine
Lam, Su Datt
Murzin, Alexey G
Pandurangan, Arun Prasad
Salazar, Gustavo A
Skwark, Marcin J
Sternberg, Michael J E
Velankar, Sameer
Orengo, Christine
author_facet Sillitoe, Ian
Andreeva, Antonina
Blundell, Tom L
Buchan, Daniel W A
Finn, Robert D
Gough, Julian
Jones, David
Kelley, Lawrence A
Paysan-Lafosse, Typhaine
Lam, Su Datt
Murzin, Alexey G
Pandurangan, Arun Prasad
Salazar, Gustavo A
Skwark, Marcin J
Sternberg, Michael J E
Velankar, Sameer
Orengo, Christine
author_sort Sillitoe, Ian
collection PubMed
description Genome3D (https://www.genome3d.eu) is a freely available resource that provides consensus structural annotations for representative protein sequences taken from a selection of model organisms. Since the last NAR update in 2015, the method of data submission has been overhauled, with annotations now being ‘pushed’ to the database via an API. As a result, contributing groups are now able to manage their own structural annotations, making the resource more flexible and maintainable. The new submission protocol brings a number of additional benefits including: providing instant validation of data and avoiding the requirement to synchronise releases between resources. It also makes it possible to implement the submission of these structural annotations as an automated part of existing internal workflows. In turn, these improvements facilitate Genome3D being opened up to new prediction algorithms and groups. For the latest release of Genome3D (v2.1), the underlying dataset of sequences used as prediction targets has been updated using the latest reference proteomes available in UniProtKB. A number of new reference proteomes have also been added of particular interest to the wider scientific community: cow, pig, wheat and mycobacterium tuberculosis. These additions, along with improvements to the underlying predictions from contributing resources, has ensured that the number of annotations in Genome3D has nearly doubled since the last NAR update article. The new API has also been used to facilitate the dissemination of Genome3D data into InterPro, thereby widening the visibility of both the annotation data and annotation algorithms.
format Online
Article
Text
id pubmed-7139969
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-71399692020-04-13 Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation Sillitoe, Ian Andreeva, Antonina Blundell, Tom L Buchan, Daniel W A Finn, Robert D Gough, Julian Jones, David Kelley, Lawrence A Paysan-Lafosse, Typhaine Lam, Su Datt Murzin, Alexey G Pandurangan, Arun Prasad Salazar, Gustavo A Skwark, Marcin J Sternberg, Michael J E Velankar, Sameer Orengo, Christine Nucleic Acids Res Database Issue Genome3D (https://www.genome3d.eu) is a freely available resource that provides consensus structural annotations for representative protein sequences taken from a selection of model organisms. Since the last NAR update in 2015, the method of data submission has been overhauled, with annotations now being ‘pushed’ to the database via an API. As a result, contributing groups are now able to manage their own structural annotations, making the resource more flexible and maintainable. The new submission protocol brings a number of additional benefits including: providing instant validation of data and avoiding the requirement to synchronise releases between resources. It also makes it possible to implement the submission of these structural annotations as an automated part of existing internal workflows. In turn, these improvements facilitate Genome3D being opened up to new prediction algorithms and groups. For the latest release of Genome3D (v2.1), the underlying dataset of sequences used as prediction targets has been updated using the latest reference proteomes available in UniProtKB. A number of new reference proteomes have also been added of particular interest to the wider scientific community: cow, pig, wheat and mycobacterium tuberculosis. These additions, along with improvements to the underlying predictions from contributing resources, has ensured that the number of annotations in Genome3D has nearly doubled since the last NAR update article. The new API has also been used to facilitate the dissemination of Genome3D data into InterPro, thereby widening the visibility of both the annotation data and annotation algorithms. Oxford University Press 2020-01-08 2019-11-16 /pmc/articles/PMC7139969/ /pubmed/31733063 http://dx.doi.org/10.1093/nar/gkz967 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Issue
Sillitoe, Ian
Andreeva, Antonina
Blundell, Tom L
Buchan, Daniel W A
Finn, Robert D
Gough, Julian
Jones, David
Kelley, Lawrence A
Paysan-Lafosse, Typhaine
Lam, Su Datt
Murzin, Alexey G
Pandurangan, Arun Prasad
Salazar, Gustavo A
Skwark, Marcin J
Sternberg, Michael J E
Velankar, Sameer
Orengo, Christine
Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation
title Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation
title_full Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation
title_fullStr Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation
title_full_unstemmed Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation
title_short Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation
title_sort genome3d: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7139969/
https://www.ncbi.nlm.nih.gov/pubmed/31733063
http://dx.doi.org/10.1093/nar/gkz967
work_keys_str_mv AT sillitoeian genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT andreevaantonina genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT blundelltoml genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT buchandanielwa genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT finnrobertd genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT goughjulian genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT jonesdavid genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT kelleylawrencea genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT paysanlafossetyphaine genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT lamsudatt genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT murzinalexeyg genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT panduranganarunprasad genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT salazargustavoa genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT skwarkmarcinj genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT sternbergmichaelje genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT velankarsameer genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation
AT orengochristine genome3dintegratingacollaborativedatapipelinetoexpandthedepthandbreadthofconsensusproteinstructureannotation