Cargando…

Protein Ontology (PRO): enhancing and scaling up the representation of protein entities

The Protein Ontology (PRO; http://purl.obolibrary.org/obo/pr) formally defines and describes taxon-specific and taxon-neutral protein-related entities in three major areas: proteins related by evolution; proteins produced from a given gene; and protein-containing complexes. PRO thus serves as a tool...

Descripción completa

Detalles Bibliográficos
Autores principales: Natale, Darren A., Arighi, Cecilia N., Blake, Judith A., Bona, Jonathan, Chen, Chuming, Chen, Sheng-Chih, Christie, Karen R., Cowart, Julie, D'Eustachio, Peter, Diehl, Alexander D., Drabkin, Harold J., Duncan, William D., Huang, Hongzhan, Ren, Jia, Ross, Karen, Ruttenberg, Alan, Shamovsky, Veronica, Smith, Barry, Wang, Qinghua, Zhang, Jian, El-Sayed, Abdelrahman, Wu, Cathy H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210558/
https://www.ncbi.nlm.nih.gov/pubmed/27899649
http://dx.doi.org/10.1093/nar/gkw1075
_version_ 1782490907549892608
author Natale, Darren A.
Arighi, Cecilia N.
Blake, Judith A.
Bona, Jonathan
Chen, Chuming
Chen, Sheng-Chih
Christie, Karen R.
Cowart, Julie
D'Eustachio, Peter
Diehl, Alexander D.
Drabkin, Harold J.
Duncan, William D.
Huang, Hongzhan
Ren, Jia
Ross, Karen
Ruttenberg, Alan
Shamovsky, Veronica
Smith, Barry
Wang, Qinghua
Zhang, Jian
El-Sayed, Abdelrahman
Wu, Cathy H.
author_facet Natale, Darren A.
Arighi, Cecilia N.
Blake, Judith A.
Bona, Jonathan
Chen, Chuming
Chen, Sheng-Chih
Christie, Karen R.
Cowart, Julie
D'Eustachio, Peter
Diehl, Alexander D.
Drabkin, Harold J.
Duncan, William D.
Huang, Hongzhan
Ren, Jia
Ross, Karen
Ruttenberg, Alan
Shamovsky, Veronica
Smith, Barry
Wang, Qinghua
Zhang, Jian
El-Sayed, Abdelrahman
Wu, Cathy H.
author_sort Natale, Darren A.
collection PubMed
description The Protein Ontology (PRO; http://purl.obolibrary.org/obo/pr) formally defines and describes taxon-specific and taxon-neutral protein-related entities in three major areas: proteins related by evolution; proteins produced from a given gene; and protein-containing complexes. PRO thus serves as a tool for referencing protein entities at any level of specificity. To enhance this ability, and to facilitate the comparison of such entities described in different resources, we developed a standardized representation of proteoforms using UniProtKB as a sequence reference and PSI-MOD as a post-translational modification reference. We illustrate its use in facilitating an alignment between PRO and Reactome protein entities. We also address issues of scalability, describing our first steps into the use of text mining to identify protein-related entities, the large-scale import of proteoform information from expert curated resources, and our ability to dynamically generate PRO terms. Web views for individual terms are now more informative about closely-related terms, including for example an interactive multiple sequence alignment. Finally, we describe recent improvement in semantic utility, with PRO now represented in OWL and as a SPARQL endpoint. These developments will further support the anticipated growth of PRO and facilitate discoverability of and allow aggregation of data relating to protein entities.
format Online
Article
Text
id pubmed-5210558
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-52105582017-01-05 Protein Ontology (PRO): enhancing and scaling up the representation of protein entities Natale, Darren A. Arighi, Cecilia N. Blake, Judith A. Bona, Jonathan Chen, Chuming Chen, Sheng-Chih Christie, Karen R. Cowart, Julie D'Eustachio, Peter Diehl, Alexander D. Drabkin, Harold J. Duncan, William D. Huang, Hongzhan Ren, Jia Ross, Karen Ruttenberg, Alan Shamovsky, Veronica Smith, Barry Wang, Qinghua Zhang, Jian El-Sayed, Abdelrahman Wu, Cathy H. Nucleic Acids Res Database Issue The Protein Ontology (PRO; http://purl.obolibrary.org/obo/pr) formally defines and describes taxon-specific and taxon-neutral protein-related entities in three major areas: proteins related by evolution; proteins produced from a given gene; and protein-containing complexes. PRO thus serves as a tool for referencing protein entities at any level of specificity. To enhance this ability, and to facilitate the comparison of such entities described in different resources, we developed a standardized representation of proteoforms using UniProtKB as a sequence reference and PSI-MOD as a post-translational modification reference. We illustrate its use in facilitating an alignment between PRO and Reactome protein entities. We also address issues of scalability, describing our first steps into the use of text mining to identify protein-related entities, the large-scale import of proteoform information from expert curated resources, and our ability to dynamically generate PRO terms. Web views for individual terms are now more informative about closely-related terms, including for example an interactive multiple sequence alignment. Finally, we describe recent improvement in semantic utility, with PRO now represented in OWL and as a SPARQL endpoint. These developments will further support the anticipated growth of PRO and facilitate discoverability of and allow aggregation of data relating to protein entities. Oxford University Press 2017-01-04 2016-11-28 /pmc/articles/PMC5210558/ /pubmed/27899649 http://dx.doi.org/10.1093/nar/gkw1075 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Issue
Natale, Darren A.
Arighi, Cecilia N.
Blake, Judith A.
Bona, Jonathan
Chen, Chuming
Chen, Sheng-Chih
Christie, Karen R.
Cowart, Julie
D'Eustachio, Peter
Diehl, Alexander D.
Drabkin, Harold J.
Duncan, William D.
Huang, Hongzhan
Ren, Jia
Ross, Karen
Ruttenberg, Alan
Shamovsky, Veronica
Smith, Barry
Wang, Qinghua
Zhang, Jian
El-Sayed, Abdelrahman
Wu, Cathy H.
Protein Ontology (PRO): enhancing and scaling up the representation of protein entities
title Protein Ontology (PRO): enhancing and scaling up the representation of protein entities
title_full Protein Ontology (PRO): enhancing and scaling up the representation of protein entities
title_fullStr Protein Ontology (PRO): enhancing and scaling up the representation of protein entities
title_full_unstemmed Protein Ontology (PRO): enhancing and scaling up the representation of protein entities
title_short Protein Ontology (PRO): enhancing and scaling up the representation of protein entities
title_sort protein ontology (pro): enhancing and scaling up the representation of protein entities
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210558/
https://www.ncbi.nlm.nih.gov/pubmed/27899649
http://dx.doi.org/10.1093/nar/gkw1075
work_keys_str_mv AT nataledarrena proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT arighicecilian proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT blakejuditha proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT bonajonathan proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT chenchuming proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT chenshengchih proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT christiekarenr proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT cowartjulie proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT deustachiopeter proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT diehlalexanderd proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT drabkinharoldj proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT duncanwilliamd proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT huanghongzhan proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT renjia proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT rosskaren proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT ruttenbergalan proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT shamovskyveronica proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT smithbarry proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT wangqinghua proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT zhangjian proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT elsayedabdelrahman proteinontologyproenhancingandscalinguptherepresentationofproteinentities
AT wucathyh proteinontologyproenhancingandscalinguptherepresentationofproteinentities