Cargando…

TGF-beta signaling proteins and the Protein Ontology

BACKGROUND: The Protein Ontology (PRO) is designed as a formal and principled Open Biomedical Ontologies (OBO) Foundry ontology for proteins. The components of PRO extend from a classification of proteins on the basis of evolutionary relationships at the homeomorphic level to the representation of t...

Descripción completa

Detalles Bibliográficos
Autores principales: Arighi, Cecilia N, Liu, Hongfang, Natale, Darren A, Barker, Winona C, Drabkin, Harold, Blake, Judith A, Smith, Barry, Wu, Cathy H
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2679403/
https://www.ncbi.nlm.nih.gov/pubmed/19426460
http://dx.doi.org/10.1186/1471-2105-10-S5-S3
_version_ 1782166891890999296
author Arighi, Cecilia N
Liu, Hongfang
Natale, Darren A
Barker, Winona C
Drabkin, Harold
Blake, Judith A
Smith, Barry
Wu, Cathy H
author_facet Arighi, Cecilia N
Liu, Hongfang
Natale, Darren A
Barker, Winona C
Drabkin, Harold
Blake, Judith A
Smith, Barry
Wu, Cathy H
author_sort Arighi, Cecilia N
collection PubMed
description BACKGROUND: The Protein Ontology (PRO) is designed as a formal and principled Open Biomedical Ontologies (OBO) Foundry ontology for proteins. The components of PRO extend from a classification of proteins on the basis of evolutionary relationships at the homeomorphic level to the representation of the multiple protein forms of a gene, including those resulting from alternative splicing, cleavage and/or post-translational modifications. Focusing specifically on the TGF-beta signaling proteins, we describe the building, curation, usage and dissemination of PRO. RESULTS: PRO is manually curated on the basis of PrePRO, an automatically generated file with content derived from standard protein data sources. Manual curation ensures that the treatment of the protein classes and the internal and external relationships conform to the PRO framework. The current release of PRO is based upon experimental data from mouse and human proteins wherein equivalent protein forms are represented by single terms. In addition to the PRO ontology, the annotation of PRO terms is released as a separate PRO association file, which contains, for each given PRO term, an annotation from the experimentally characterized sub-types as well as the corresponding database identifiers and sequence coordinates. The annotations are added in the form of relationship to other ontologies. Whenever possible, equivalent forms in other species are listed to facilitate cross-species comparison. Splice and allelic variants, gene fusion products and modified protein forms are all represented as entities in the ontology. Therefore, PRO provides for the representation of protein entities and a resource for describing the associated data. This makes PRO useful both for proteomics studies where isoforms and modified forms must be differentiated, and for studies of biological pathways, where representations need to take account of the different ways in which the cascade of events may depend on specific protein modifications. CONCLUSION: PRO provides a framework for the formal representation of protein classes and protein forms in the OBO Foundry. It is designed to enable data retrieval and integration and machine reasoning at the molecular level of proteins, thereby facilitating cross-species comparisons, pathway analysis, disease modeling and the generation of new hypotheses.
format Text
id pubmed-2679403
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26794032009-05-09 TGF-beta signaling proteins and the Protein Ontology Arighi, Cecilia N Liu, Hongfang Natale, Darren A Barker, Winona C Drabkin, Harold Blake, Judith A Smith, Barry Wu, Cathy H BMC Bioinformatics Proceedings BACKGROUND: The Protein Ontology (PRO) is designed as a formal and principled Open Biomedical Ontologies (OBO) Foundry ontology for proteins. The components of PRO extend from a classification of proteins on the basis of evolutionary relationships at the homeomorphic level to the representation of the multiple protein forms of a gene, including those resulting from alternative splicing, cleavage and/or post-translational modifications. Focusing specifically on the TGF-beta signaling proteins, we describe the building, curation, usage and dissemination of PRO. RESULTS: PRO is manually curated on the basis of PrePRO, an automatically generated file with content derived from standard protein data sources. Manual curation ensures that the treatment of the protein classes and the internal and external relationships conform to the PRO framework. The current release of PRO is based upon experimental data from mouse and human proteins wherein equivalent protein forms are represented by single terms. In addition to the PRO ontology, the annotation of PRO terms is released as a separate PRO association file, which contains, for each given PRO term, an annotation from the experimentally characterized sub-types as well as the corresponding database identifiers and sequence coordinates. The annotations are added in the form of relationship to other ontologies. Whenever possible, equivalent forms in other species are listed to facilitate cross-species comparison. Splice and allelic variants, gene fusion products and modified protein forms are all represented as entities in the ontology. Therefore, PRO provides for the representation of protein entities and a resource for describing the associated data. This makes PRO useful both for proteomics studies where isoforms and modified forms must be differentiated, and for studies of biological pathways, where representations need to take account of the different ways in which the cascade of events may depend on specific protein modifications. CONCLUSION: PRO provides a framework for the formal representation of protein classes and protein forms in the OBO Foundry. It is designed to enable data retrieval and integration and machine reasoning at the molecular level of proteins, thereby facilitating cross-species comparisons, pathway analysis, disease modeling and the generation of new hypotheses. BioMed Central 2009-05-06 /pmc/articles/PMC2679403/ /pubmed/19426460 http://dx.doi.org/10.1186/1471-2105-10-S5-S3 Text en Copyright © 2009 Arighi et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Arighi, Cecilia N
Liu, Hongfang
Natale, Darren A
Barker, Winona C
Drabkin, Harold
Blake, Judith A
Smith, Barry
Wu, Cathy H
TGF-beta signaling proteins and the Protein Ontology
title TGF-beta signaling proteins and the Protein Ontology
title_full TGF-beta signaling proteins and the Protein Ontology
title_fullStr TGF-beta signaling proteins and the Protein Ontology
title_full_unstemmed TGF-beta signaling proteins and the Protein Ontology
title_short TGF-beta signaling proteins and the Protein Ontology
title_sort tgf-beta signaling proteins and the protein ontology
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2679403/
https://www.ncbi.nlm.nih.gov/pubmed/19426460
http://dx.doi.org/10.1186/1471-2105-10-S5-S3
work_keys_str_mv AT arighicecilian tgfbetasignalingproteinsandtheproteinontology
AT liuhongfang tgfbetasignalingproteinsandtheproteinontology
AT nataledarrena tgfbetasignalingproteinsandtheproteinontology
AT barkerwinonac tgfbetasignalingproteinsandtheproteinontology
AT drabkinharold tgfbetasignalingproteinsandtheproteinontology
AT blakejuditha tgfbetasignalingproteinsandtheproteinontology
AT smithbarry tgfbetasignalingproteinsandtheproteinontology
AT wucathyh tgfbetasignalingproteinsandtheproteinontology