Cargando…

Dintor: functional annotation of genomic and proteomic data

BACKGROUND: During the last decade, a great number of extremely valuable large-scale genomics and proteomics datasets have become available to the research community. In addition, dropping costs for conducting high-throughput sequencing experiments and the option to outsource them considerably contr...

Descripción completa

Detalles Bibliográficos
Autores principales: Weichenberger, Christian X., Blankenburg, Hagen, Palermo, Antonia, D’Elia, Yuri, König, Eva, Bernstein, Erik, Domingues, Francisco S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4687148/
https://www.ncbi.nlm.nih.gov/pubmed/26691694
http://dx.doi.org/10.1186/s12864-015-2279-5
_version_ 1782406572695093248
author Weichenberger, Christian X.
Blankenburg, Hagen
Palermo, Antonia
D’Elia, Yuri
König, Eva
Bernstein, Erik
Domingues, Francisco S.
author_facet Weichenberger, Christian X.
Blankenburg, Hagen
Palermo, Antonia
D’Elia, Yuri
König, Eva
Bernstein, Erik
Domingues, Francisco S.
author_sort Weichenberger, Christian X.
collection PubMed
description BACKGROUND: During the last decade, a great number of extremely valuable large-scale genomics and proteomics datasets have become available to the research community. In addition, dropping costs for conducting high-throughput sequencing experiments and the option to outsource them considerably contribute to an increasing number of researchers becoming active in this field. Even though various computational approaches have been developed to analyze these data, it is still a laborious task involving prudent integration of many heterogeneous and frequently updated data sources, creating a barrier for interested scientists to accomplish their own analysis. RESULTS: We have implemented Dintor, a data integration framework that provides a set of over 30 tools to assist researchers in the exploration of genomics and proteomics datasets. Each of the tools solves a particular task and several tools can be combined into data processing pipelines. Dintor covers a wide range of frequently required functionalities, from gene identifier conversions and orthology mappings to functional annotation of proteins and genetic variants up to candidate gene prioritization and Gene Ontology-based gene set enrichment analysis. Since the tools operate on constantly changing datasets, we provide a mechanism to unambiguously link tools with different versions of archived datasets, which guarantees reproducible results for future tool invocations. We demonstrate a selection of Dintor’s capabilities by analyzing datasets from four representative publications. The open source software can be downloaded and installed on a local Unix machine. For reasons of data privacy it can be configured to retrieve local data only. In addition, the Dintor tools are available on our public Galaxy web service at http://dintor.eurac.edu. CONCLUSIONS: Dintor is a computational annotation framework for the analysis of genomic and proteomic datasets, providing a rich set of tools that cover the most frequently encountered tasks. A major advantage is its capability to consistently handle multiple versions of tool-associated datasets, supporting the researcher in delivering reproducible results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2279-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4687148
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46871482015-12-23 Dintor: functional annotation of genomic and proteomic data Weichenberger, Christian X. Blankenburg, Hagen Palermo, Antonia D’Elia, Yuri König, Eva Bernstein, Erik Domingues, Francisco S. BMC Genomics Software BACKGROUND: During the last decade, a great number of extremely valuable large-scale genomics and proteomics datasets have become available to the research community. In addition, dropping costs for conducting high-throughput sequencing experiments and the option to outsource them considerably contribute to an increasing number of researchers becoming active in this field. Even though various computational approaches have been developed to analyze these data, it is still a laborious task involving prudent integration of many heterogeneous and frequently updated data sources, creating a barrier for interested scientists to accomplish their own analysis. RESULTS: We have implemented Dintor, a data integration framework that provides a set of over 30 tools to assist researchers in the exploration of genomics and proteomics datasets. Each of the tools solves a particular task and several tools can be combined into data processing pipelines. Dintor covers a wide range of frequently required functionalities, from gene identifier conversions and orthology mappings to functional annotation of proteins and genetic variants up to candidate gene prioritization and Gene Ontology-based gene set enrichment analysis. Since the tools operate on constantly changing datasets, we provide a mechanism to unambiguously link tools with different versions of archived datasets, which guarantees reproducible results for future tool invocations. We demonstrate a selection of Dintor’s capabilities by analyzing datasets from four representative publications. The open source software can be downloaded and installed on a local Unix machine. For reasons of data privacy it can be configured to retrieve local data only. In addition, the Dintor tools are available on our public Galaxy web service at http://dintor.eurac.edu. CONCLUSIONS: Dintor is a computational annotation framework for the analysis of genomic and proteomic datasets, providing a rich set of tools that cover the most frequently encountered tasks. A major advantage is its capability to consistently handle multiple versions of tool-associated datasets, supporting the researcher in delivering reproducible results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2279-5) contains supplementary material, which is available to authorized users. BioMed Central 2015-12-21 /pmc/articles/PMC4687148/ /pubmed/26691694 http://dx.doi.org/10.1186/s12864-015-2279-5 Text en © Weichenberger et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Weichenberger, Christian X.
Blankenburg, Hagen
Palermo, Antonia
D’Elia, Yuri
König, Eva
Bernstein, Erik
Domingues, Francisco S.
Dintor: functional annotation of genomic and proteomic data
title Dintor: functional annotation of genomic and proteomic data
title_full Dintor: functional annotation of genomic and proteomic data
title_fullStr Dintor: functional annotation of genomic and proteomic data
title_full_unstemmed Dintor: functional annotation of genomic and proteomic data
title_short Dintor: functional annotation of genomic and proteomic data
title_sort dintor: functional annotation of genomic and proteomic data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4687148/
https://www.ncbi.nlm.nih.gov/pubmed/26691694
http://dx.doi.org/10.1186/s12864-015-2279-5
work_keys_str_mv AT weichenbergerchristianx dintorfunctionalannotationofgenomicandproteomicdata
AT blankenburghagen dintorfunctionalannotationofgenomicandproteomicdata
AT palermoantonia dintorfunctionalannotationofgenomicandproteomicdata
AT deliayuri dintorfunctionalannotationofgenomicandproteomicdata
AT konigeva dintorfunctionalannotationofgenomicandproteomicdata
AT bernsteinerik dintorfunctionalannotationofgenomicandproteomicdata
AT dominguesfranciscos dintorfunctionalannotationofgenomicandproteomicdata