Cargando…
Dintor: functional annotation of genomic and proteomic data
BACKGROUND: During the last decade, a great number of extremely valuable large-scale genomics and proteomics datasets have become available to the research community. In addition, dropping costs for conducting high-throughput sequencing experiments and the option to outsource them considerably contr...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4687148/ https://www.ncbi.nlm.nih.gov/pubmed/26691694 http://dx.doi.org/10.1186/s12864-015-2279-5 |
_version_ | 1782406572695093248 |
---|---|
author | Weichenberger, Christian X. Blankenburg, Hagen Palermo, Antonia D’Elia, Yuri König, Eva Bernstein, Erik Domingues, Francisco S. |
author_facet | Weichenberger, Christian X. Blankenburg, Hagen Palermo, Antonia D’Elia, Yuri König, Eva Bernstein, Erik Domingues, Francisco S. |
author_sort | Weichenberger, Christian X. |
collection | PubMed |
description | BACKGROUND: During the last decade, a great number of extremely valuable large-scale genomics and proteomics datasets have become available to the research community. In addition, dropping costs for conducting high-throughput sequencing experiments and the option to outsource them considerably contribute to an increasing number of researchers becoming active in this field. Even though various computational approaches have been developed to analyze these data, it is still a laborious task involving prudent integration of many heterogeneous and frequently updated data sources, creating a barrier for interested scientists to accomplish their own analysis. RESULTS: We have implemented Dintor, a data integration framework that provides a set of over 30 tools to assist researchers in the exploration of genomics and proteomics datasets. Each of the tools solves a particular task and several tools can be combined into data processing pipelines. Dintor covers a wide range of frequently required functionalities, from gene identifier conversions and orthology mappings to functional annotation of proteins and genetic variants up to candidate gene prioritization and Gene Ontology-based gene set enrichment analysis. Since the tools operate on constantly changing datasets, we provide a mechanism to unambiguously link tools with different versions of archived datasets, which guarantees reproducible results for future tool invocations. We demonstrate a selection of Dintor’s capabilities by analyzing datasets from four representative publications. The open source software can be downloaded and installed on a local Unix machine. For reasons of data privacy it can be configured to retrieve local data only. In addition, the Dintor tools are available on our public Galaxy web service at http://dintor.eurac.edu. CONCLUSIONS: Dintor is a computational annotation framework for the analysis of genomic and proteomic datasets, providing a rich set of tools that cover the most frequently encountered tasks. A major advantage is its capability to consistently handle multiple versions of tool-associated datasets, supporting the researcher in delivering reproducible results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2279-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4687148 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46871482015-12-23 Dintor: functional annotation of genomic and proteomic data Weichenberger, Christian X. Blankenburg, Hagen Palermo, Antonia D’Elia, Yuri König, Eva Bernstein, Erik Domingues, Francisco S. BMC Genomics Software BACKGROUND: During the last decade, a great number of extremely valuable large-scale genomics and proteomics datasets have become available to the research community. In addition, dropping costs for conducting high-throughput sequencing experiments and the option to outsource them considerably contribute to an increasing number of researchers becoming active in this field. Even though various computational approaches have been developed to analyze these data, it is still a laborious task involving prudent integration of many heterogeneous and frequently updated data sources, creating a barrier for interested scientists to accomplish their own analysis. RESULTS: We have implemented Dintor, a data integration framework that provides a set of over 30 tools to assist researchers in the exploration of genomics and proteomics datasets. Each of the tools solves a particular task and several tools can be combined into data processing pipelines. Dintor covers a wide range of frequently required functionalities, from gene identifier conversions and orthology mappings to functional annotation of proteins and genetic variants up to candidate gene prioritization and Gene Ontology-based gene set enrichment analysis. Since the tools operate on constantly changing datasets, we provide a mechanism to unambiguously link tools with different versions of archived datasets, which guarantees reproducible results for future tool invocations. We demonstrate a selection of Dintor’s capabilities by analyzing datasets from four representative publications. The open source software can be downloaded and installed on a local Unix machine. For reasons of data privacy it can be configured to retrieve local data only. In addition, the Dintor tools are available on our public Galaxy web service at http://dintor.eurac.edu. CONCLUSIONS: Dintor is a computational annotation framework for the analysis of genomic and proteomic datasets, providing a rich set of tools that cover the most frequently encountered tasks. A major advantage is its capability to consistently handle multiple versions of tool-associated datasets, supporting the researcher in delivering reproducible results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2279-5) contains supplementary material, which is available to authorized users. BioMed Central 2015-12-21 /pmc/articles/PMC4687148/ /pubmed/26691694 http://dx.doi.org/10.1186/s12864-015-2279-5 Text en © Weichenberger et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Weichenberger, Christian X. Blankenburg, Hagen Palermo, Antonia D’Elia, Yuri König, Eva Bernstein, Erik Domingues, Francisco S. Dintor: functional annotation of genomic and proteomic data |
title | Dintor: functional annotation of genomic and proteomic data |
title_full | Dintor: functional annotation of genomic and proteomic data |
title_fullStr | Dintor: functional annotation of genomic and proteomic data |
title_full_unstemmed | Dintor: functional annotation of genomic and proteomic data |
title_short | Dintor: functional annotation of genomic and proteomic data |
title_sort | dintor: functional annotation of genomic and proteomic data |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4687148/ https://www.ncbi.nlm.nih.gov/pubmed/26691694 http://dx.doi.org/10.1186/s12864-015-2279-5 |
work_keys_str_mv | AT weichenbergerchristianx dintorfunctionalannotationofgenomicandproteomicdata AT blankenburghagen dintorfunctionalannotationofgenomicandproteomicdata AT palermoantonia dintorfunctionalannotationofgenomicandproteomicdata AT deliayuri dintorfunctionalannotationofgenomicandproteomicdata AT konigeva dintorfunctionalannotationofgenomicandproteomicdata AT bernsteinerik dintorfunctionalannotationofgenomicandproteomicdata AT dominguesfranciscos dintorfunctionalannotationofgenomicandproteomicdata |