Cargando…
CDinFusion – Submission-Ready, On-Line Integration of Sequence and Contextual Data
State of the art (DNA) sequencing methods applied in “Omics” studies grant insight into the ‘blueprints’ of organisms from all domains of life. Sequencing is carried out around the globe and the data is submitted to the public repositories of the International Nucleotide Sequence Database Collaborat...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3172294/ https://www.ncbi.nlm.nih.gov/pubmed/21935468 http://dx.doi.org/10.1371/journal.pone.0024797 |
_version_ | 1782211856854679552 |
---|---|
author | Hankeln, Wolfgang Wendel, Norma Johanna Gerken, Jan Waldmann, Jost Buttigieg, Pier Luigi Kostadinov, Ivaylo Kottmann, Renzo Yilmaz, Pelin Glöckner, Frank Oliver |
author_facet | Hankeln, Wolfgang Wendel, Norma Johanna Gerken, Jan Waldmann, Jost Buttigieg, Pier Luigi Kostadinov, Ivaylo Kottmann, Renzo Yilmaz, Pelin Glöckner, Frank Oliver |
author_sort | Hankeln, Wolfgang |
collection | PubMed |
description | State of the art (DNA) sequencing methods applied in “Omics” studies grant insight into the ‘blueprints’ of organisms from all domains of life. Sequencing is carried out around the globe and the data is submitted to the public repositories of the International Nucleotide Sequence Database Collaboration. However, the context in which these studies are conducted often gets lost, because experimental data, as well as information about the environment are rarely submitted along with the sequence data. If these contextual or metadata are missing, key opportunities of comparison and analysis across studies and habitats are hampered or even impossible. To address this problem, the Genomic Standards Consortium (GSC) promotes checklists and standards to better describe our sequence data collection and to promote the capturing, exchange and integration of sequence data with contextual data. In a recent community effort the GSC has developed a series of recommendations for contextual data that should be submitted along with sequence data. To support the scientific community to significantly enhance the quality and quantity of contextual data in the public sequence data repositories, specialized software tools are needed. In this work we present CDinFusion, a web-based tool to integrate contextual and sequence data in (Multi)FASTA format prior to submission. The tool is open source and available under the Lesser GNU Public License 3. A public installation is hosted and maintained at the Max Planck Institute for Marine Microbiology at http://www.megx.net/cdinfusion. The tool may also be installed locally using the open source code available at http://code.google.com/p/cdinfusion. |
format | Online Article Text |
id | pubmed-3172294 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-31722942011-09-20 CDinFusion – Submission-Ready, On-Line Integration of Sequence and Contextual Data Hankeln, Wolfgang Wendel, Norma Johanna Gerken, Jan Waldmann, Jost Buttigieg, Pier Luigi Kostadinov, Ivaylo Kottmann, Renzo Yilmaz, Pelin Glöckner, Frank Oliver PLoS One Research Article State of the art (DNA) sequencing methods applied in “Omics” studies grant insight into the ‘blueprints’ of organisms from all domains of life. Sequencing is carried out around the globe and the data is submitted to the public repositories of the International Nucleotide Sequence Database Collaboration. However, the context in which these studies are conducted often gets lost, because experimental data, as well as information about the environment are rarely submitted along with the sequence data. If these contextual or metadata are missing, key opportunities of comparison and analysis across studies and habitats are hampered or even impossible. To address this problem, the Genomic Standards Consortium (GSC) promotes checklists and standards to better describe our sequence data collection and to promote the capturing, exchange and integration of sequence data with contextual data. In a recent community effort the GSC has developed a series of recommendations for contextual data that should be submitted along with sequence data. To support the scientific community to significantly enhance the quality and quantity of contextual data in the public sequence data repositories, specialized software tools are needed. In this work we present CDinFusion, a web-based tool to integrate contextual and sequence data in (Multi)FASTA format prior to submission. The tool is open source and available under the Lesser GNU Public License 3. A public installation is hosted and maintained at the Max Planck Institute for Marine Microbiology at http://www.megx.net/cdinfusion. The tool may also be installed locally using the open source code available at http://code.google.com/p/cdinfusion. Public Library of Science 2011-09-13 /pmc/articles/PMC3172294/ /pubmed/21935468 http://dx.doi.org/10.1371/journal.pone.0024797 Text en Hankeln et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Hankeln, Wolfgang Wendel, Norma Johanna Gerken, Jan Waldmann, Jost Buttigieg, Pier Luigi Kostadinov, Ivaylo Kottmann, Renzo Yilmaz, Pelin Glöckner, Frank Oliver CDinFusion – Submission-Ready, On-Line Integration of Sequence and Contextual Data |
title | CDinFusion – Submission-Ready, On-Line Integration of Sequence and Contextual Data |
title_full | CDinFusion – Submission-Ready, On-Line Integration of Sequence and Contextual Data |
title_fullStr | CDinFusion – Submission-Ready, On-Line Integration of Sequence and Contextual Data |
title_full_unstemmed | CDinFusion – Submission-Ready, On-Line Integration of Sequence and Contextual Data |
title_short | CDinFusion – Submission-Ready, On-Line Integration of Sequence and Contextual Data |
title_sort | cdinfusion – submission-ready, on-line integration of sequence and contextual data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3172294/ https://www.ncbi.nlm.nih.gov/pubmed/21935468 http://dx.doi.org/10.1371/journal.pone.0024797 |
work_keys_str_mv | AT hankelnwolfgang cdinfusionsubmissionreadyonlineintegrationofsequenceandcontextualdata AT wendelnormajohanna cdinfusionsubmissionreadyonlineintegrationofsequenceandcontextualdata AT gerkenjan cdinfusionsubmissionreadyonlineintegrationofsequenceandcontextualdata AT waldmannjost cdinfusionsubmissionreadyonlineintegrationofsequenceandcontextualdata AT buttigiegpierluigi cdinfusionsubmissionreadyonlineintegrationofsequenceandcontextualdata AT kostadinovivaylo cdinfusionsubmissionreadyonlineintegrationofsequenceandcontextualdata AT kottmannrenzo cdinfusionsubmissionreadyonlineintegrationofsequenceandcontextualdata AT yilmazpelin cdinfusionsubmissionreadyonlineintegrationofsequenceandcontextualdata AT glocknerfrankoliver cdinfusionsubmissionreadyonlineintegrationofsequenceandcontextualdata |