Cargando…
dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations
I introduce an open-source R package ‘dcGOR’ to provide the bioinformatics community with the ease to analyse ontologies and protein domain annotations, particularly those in the dcGO database. The dcGO is a comprehensive resource for protein domain annotations using a panel of ontologies including...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4214615/ https://www.ncbi.nlm.nih.gov/pubmed/25356683 http://dx.doi.org/10.1371/journal.pcbi.1003929 |
_version_ | 1782341983880085504 |
---|---|
author | Fang, Hai |
author_facet | Fang, Hai |
author_sort | Fang, Hai |
collection | PubMed |
description | I introduce an open-source R package ‘dcGOR’ to provide the bioinformatics community with the ease to analyse ontologies and protein domain annotations, particularly those in the dcGO database. The dcGO is a comprehensive resource for protein domain annotations using a panel of ontologies including Gene Ontology. Although increasing in popularity, this database needs statistical and graphical support to meet its full potential. Moreover, there are no bioinformatics tools specifically designed for domain ontology analysis. As an add-on package built in the R software environment, dcGOR offers a basic infrastructure with great flexibility and functionality. It implements new data structure to represent domains, ontologies, annotations, and all analytical outputs as well. For each ontology, it provides various mining facilities, including: (i) domain-based enrichment analysis and visualisation; (ii) construction of a domain (semantic similarity) network according to ontology annotations; and (iii) significance analysis for estimating a contact (statistical significance) network. To reduce runtime, most analyses support high-performance parallel computing. Taking as inputs a list of protein domains of interest, the package is able to easily carry out in-depth analyses in terms of functional, phenotypic and diseased relevance, and network-level understanding. More importantly, dcGOR is designed to allow users to import and analyse their own ontologies and annotations on domains (taken from SCOP, Pfam and InterPro) and RNAs (from Rfam) as well. The package is freely available at CRAN for easy installation, and also at GitHub for version control. The dedicated website with reproducible demos can be found at http://supfam.org/dcGOR. |
format | Online Article Text |
id | pubmed-4214615 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-42146152014-11-05 dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations Fang, Hai PLoS Comput Biol Research Article I introduce an open-source R package ‘dcGOR’ to provide the bioinformatics community with the ease to analyse ontologies and protein domain annotations, particularly those in the dcGO database. The dcGO is a comprehensive resource for protein domain annotations using a panel of ontologies including Gene Ontology. Although increasing in popularity, this database needs statistical and graphical support to meet its full potential. Moreover, there are no bioinformatics tools specifically designed for domain ontology analysis. As an add-on package built in the R software environment, dcGOR offers a basic infrastructure with great flexibility and functionality. It implements new data structure to represent domains, ontologies, annotations, and all analytical outputs as well. For each ontology, it provides various mining facilities, including: (i) domain-based enrichment analysis and visualisation; (ii) construction of a domain (semantic similarity) network according to ontology annotations; and (iii) significance analysis for estimating a contact (statistical significance) network. To reduce runtime, most analyses support high-performance parallel computing. Taking as inputs a list of protein domains of interest, the package is able to easily carry out in-depth analyses in terms of functional, phenotypic and diseased relevance, and network-level understanding. More importantly, dcGOR is designed to allow users to import and analyse their own ontologies and annotations on domains (taken from SCOP, Pfam and InterPro) and RNAs (from Rfam) as well. The package is freely available at CRAN for easy installation, and also at GitHub for version control. The dedicated website with reproducible demos can be found at http://supfam.org/dcGOR. Public Library of Science 2014-10-30 /pmc/articles/PMC4214615/ /pubmed/25356683 http://dx.doi.org/10.1371/journal.pcbi.1003929 Text en © 2014 Hai Fang http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Fang, Hai dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations |
title | dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations |
title_full | dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations |
title_fullStr | dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations |
title_full_unstemmed | dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations |
title_short | dcGOR: An R Package for Analysing Ontologies and Protein Domain Annotations |
title_sort | dcgor: an r package for analysing ontologies and protein domain annotations |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4214615/ https://www.ncbi.nlm.nih.gov/pubmed/25356683 http://dx.doi.org/10.1371/journal.pcbi.1003929 |
work_keys_str_mv | AT fanghai dcgoranrpackageforanalysingontologiesandproteindomainannotations |