Cargando…

dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations

I introduce an open-source R package ‘dcGOR’ to provide the bioinformatics community with the ease to analyse ontologies and protein domain annotations, particularly those in the dcGO database. The dcGO is a comprehensive resource for protein domain annotations using a panel of ontologies including...

Descripción completa

Detalles Bibliográficos
Autor principal: Fang, Hai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4214615/
https://www.ncbi.nlm.nih.gov/pubmed/25356683
http://dx.doi.org/10.1371/journal.pcbi.1003929
_version_ 1782341983880085504
author Fang, Hai
author_facet Fang, Hai
author_sort Fang, Hai
collection PubMed
description I introduce an open-source R package ‘dcGOR’ to provide the bioinformatics community with the ease to analyse ontologies and protein domain annotations, particularly those in the dcGO database. The dcGO is a comprehensive resource for protein domain annotations using a panel of ontologies including Gene Ontology. Although increasing in popularity, this database needs statistical and graphical support to meet its full potential. Moreover, there are no bioinformatics tools specifically designed for domain ontology analysis. As an add-on package built in the R software environment, dcGOR offers a basic infrastructure with great flexibility and functionality. It implements new data structure to represent domains, ontologies, annotations, and all analytical outputs as well. For each ontology, it provides various mining facilities, including: (i) domain-based enrichment analysis and visualisation; (ii) construction of a domain (semantic similarity) network according to ontology annotations; and (iii) significance analysis for estimating a contact (statistical significance) network. To reduce runtime, most analyses support high-performance parallel computing. Taking as inputs a list of protein domains of interest, the package is able to easily carry out in-depth analyses in terms of functional, phenotypic and diseased relevance, and network-level understanding. More importantly, dcGOR is designed to allow users to import and analyse their own ontologies and annotations on domains (taken from SCOP, Pfam and InterPro) and RNAs (from Rfam) as well. The package is freely available at CRAN for easy installation, and also at GitHub for version control. The dedicated website with reproducible demos can be found at http://supfam.org/dcGOR.
format Online
Article
Text
id pubmed-4214615
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-42146152014-11-05 dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations Fang, Hai PLoS Comput Biol Research Article I introduce an open-source R package ‘dcGOR’ to provide the bioinformatics community with the ease to analyse ontologies and protein domain annotations, particularly those in the dcGO database. The dcGO is a comprehensive resource for protein domain annotations using a panel of ontologies including Gene Ontology. Although increasing in popularity, this database needs statistical and graphical support to meet its full potential. Moreover, there are no bioinformatics tools specifically designed for domain ontology analysis. As an add-on package built in the R software environment, dcGOR offers a basic infrastructure with great flexibility and functionality. It implements new data structure to represent domains, ontologies, annotations, and all analytical outputs as well. For each ontology, it provides various mining facilities, including: (i) domain-based enrichment analysis and visualisation; (ii) construction of a domain (semantic similarity) network according to ontology annotations; and (iii) significance analysis for estimating a contact (statistical significance) network. To reduce runtime, most analyses support high-performance parallel computing. Taking as inputs a list of protein domains of interest, the package is able to easily carry out in-depth analyses in terms of functional, phenotypic and diseased relevance, and network-level understanding. More importantly, dcGOR is designed to allow users to import and analyse their own ontologies and annotations on domains (taken from SCOP, Pfam and InterPro) and RNAs (from Rfam) as well. The package is freely available at CRAN for easy installation, and also at GitHub for version control. The dedicated website with reproducible demos can be found at http://supfam.org/dcGOR. Public Library of Science 2014-10-30 /pmc/articles/PMC4214615/ /pubmed/25356683 http://dx.doi.org/10.1371/journal.pcbi.1003929 Text en © 2014 Hai Fang http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Fang, Hai
dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations
title dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations
title_full dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations
title_fullStr dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations
title_full_unstemmed dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations
title_short dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations
title_sort dcgor: an r package for analysing ontologies and protein domain annotations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4214615/
https://www.ncbi.nlm.nih.gov/pubmed/25356683
http://dx.doi.org/10.1371/journal.pcbi.1003929
work_keys_str_mv AT fanghai dcgoranrpackageforanalysingontologiesandproteindomainannotations