Cargando…
OMeta: an ontology-based, data-driven metadata tracking system
BACKGROUND: The development of high-throughput sequencing and analysis has accelerated multi-omics studies of thousands of microbial species, metagenomes, and infectious disease pathogens. Omics studies are enabling genotype-phenotype association studies which identify genetic determinants of pathog...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6322262/ https://www.ncbi.nlm.nih.gov/pubmed/30612540 http://dx.doi.org/10.1186/s12859-018-2580-9 |
_version_ | 1783385584979410944 |
---|---|
author | Singh, Indresh Kuscuoglu, Mehmet Harkins, Derek M. Sutton, Granger Fouts, Derrick E. Nelson, Karen E. |
author_facet | Singh, Indresh Kuscuoglu, Mehmet Harkins, Derek M. Sutton, Granger Fouts, Derrick E. Nelson, Karen E. |
author_sort | Singh, Indresh |
collection | PubMed |
description | BACKGROUND: The development of high-throughput sequencing and analysis has accelerated multi-omics studies of thousands of microbial species, metagenomes, and infectious disease pathogens. Omics studies are enabling genotype-phenotype association studies which identify genetic determinants of pathogen virulence and drug resistance, as well as phylogenetic studies designed to track the origin and spread of disease outbreaks. These omics studies are complex and often employ multiple assay technologies including genomics, metagenomics, transcriptomics, proteomics, and metabolomics. To maximize the impact of omics studies, it is essential that data be accompanied by detailed contextual metadata (e.g., specimen, spatial-temporal, phenotypic characteristics) in clear, organized, and consistent formats. Over the years, many metadata standards developed by various metadata standards initiatives have arisen; the Genomic Standards Consortium’s minimal information standards (MIxS), the GSCID/BRC Project and Sample Application Standard. Some tools exist for tracking metadata, but they do not provide event based capabilities to configure, collect, validate, and distribute metadata. To address this gap in the scientific community, an event based data-driven application, OMeta, was created that allows users to quickly configure, collect, validate, distribute, and integrate metadata. RESULTS: A data-driven web application, OMeta, has been developed for use by researchers consisting of a browser-based interface, a command-line interface (CLI), and server-side components that provide an intuitive platform for configuring, capturing, viewing, and sharing metadata. Project and sample metadata can be set based on existing standards or based on projects goals. Recorded information includes details on the biological samples, procedures, protocols, and experimental technologies, etc. This information can be organized based on events, including sample collection, sample quantification, sequencing assay, and analysis results. OMeta enables configuration in various presentation types: checkbox, file, drop-box, ontology, and fields can be configured to use the National Center for Biomedical Ontology (NCBO), a biomedical ontology server. Furthermore, OMeta maintains a complete audit trail of all changes made by users and allows metadata export in comma separated value (CSV) format for convenient deposition of data into public databases. CONCLUSIONS: We present, OMeta, a web-based software application that is built on data-driven principles for configuring and customizing data standards, capturing, curating, and sharing metadata. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2580-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6322262 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63222622019-01-09 OMeta: an ontology-based, data-driven metadata tracking system Singh, Indresh Kuscuoglu, Mehmet Harkins, Derek M. Sutton, Granger Fouts, Derrick E. Nelson, Karen E. BMC Bioinformatics Software BACKGROUND: The development of high-throughput sequencing and analysis has accelerated multi-omics studies of thousands of microbial species, metagenomes, and infectious disease pathogens. Omics studies are enabling genotype-phenotype association studies which identify genetic determinants of pathogen virulence and drug resistance, as well as phylogenetic studies designed to track the origin and spread of disease outbreaks. These omics studies are complex and often employ multiple assay technologies including genomics, metagenomics, transcriptomics, proteomics, and metabolomics. To maximize the impact of omics studies, it is essential that data be accompanied by detailed contextual metadata (e.g., specimen, spatial-temporal, phenotypic characteristics) in clear, organized, and consistent formats. Over the years, many metadata standards developed by various metadata standards initiatives have arisen; the Genomic Standards Consortium’s minimal information standards (MIxS), the GSCID/BRC Project and Sample Application Standard. Some tools exist for tracking metadata, but they do not provide event based capabilities to configure, collect, validate, and distribute metadata. To address this gap in the scientific community, an event based data-driven application, OMeta, was created that allows users to quickly configure, collect, validate, distribute, and integrate metadata. RESULTS: A data-driven web application, OMeta, has been developed for use by researchers consisting of a browser-based interface, a command-line interface (CLI), and server-side components that provide an intuitive platform for configuring, capturing, viewing, and sharing metadata. Project and sample metadata can be set based on existing standards or based on projects goals. Recorded information includes details on the biological samples, procedures, protocols, and experimental technologies, etc. This information can be organized based on events, including sample collection, sample quantification, sequencing assay, and analysis results. OMeta enables configuration in various presentation types: checkbox, file, drop-box, ontology, and fields can be configured to use the National Center for Biomedical Ontology (NCBO), a biomedical ontology server. Furthermore, OMeta maintains a complete audit trail of all changes made by users and allows metadata export in comma separated value (CSV) format for convenient deposition of data into public databases. CONCLUSIONS: We present, OMeta, a web-based software application that is built on data-driven principles for configuring and customizing data standards, capturing, curating, and sharing metadata. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2580-9) contains supplementary material, which is available to authorized users. BioMed Central 2019-01-07 /pmc/articles/PMC6322262/ /pubmed/30612540 http://dx.doi.org/10.1186/s12859-018-2580-9 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Singh, Indresh Kuscuoglu, Mehmet Harkins, Derek M. Sutton, Granger Fouts, Derrick E. Nelson, Karen E. OMeta: an ontology-based, data-driven metadata tracking system |
title | OMeta: an ontology-based, data-driven metadata tracking system |
title_full | OMeta: an ontology-based, data-driven metadata tracking system |
title_fullStr | OMeta: an ontology-based, data-driven metadata tracking system |
title_full_unstemmed | OMeta: an ontology-based, data-driven metadata tracking system |
title_short | OMeta: an ontology-based, data-driven metadata tracking system |
title_sort | ometa: an ontology-based, data-driven metadata tracking system |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6322262/ https://www.ncbi.nlm.nih.gov/pubmed/30612540 http://dx.doi.org/10.1186/s12859-018-2580-9 |
work_keys_str_mv | AT singhindresh ometaanontologybaseddatadrivenmetadatatrackingsystem AT kuscuoglumehmet ometaanontologybaseddatadrivenmetadatatrackingsystem AT harkinsderekm ometaanontologybaseddatadrivenmetadatatrackingsystem AT suttongranger ometaanontologybaseddatadrivenmetadatatrackingsystem AT foutsderricke ometaanontologybaseddatadrivenmetadatatrackingsystem AT nelsonkarene ometaanontologybaseddatadrivenmetadatatrackingsystem |