Cargando…

OMeta: an ontology-based, data-driven metadata tracking system

BACKGROUND: The development of high-throughput sequencing and analysis has accelerated multi-omics studies of thousands of microbial species, metagenomes, and infectious disease pathogens. Omics studies are enabling genotype-phenotype association studies which identify genetic determinants of pathog...

Descripción completa

Detalles Bibliográficos
Autores principales: Singh, Indresh, Kuscuoglu, Mehmet, Harkins, Derek M., Sutton, Granger, Fouts, Derrick E., Nelson, Karen E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6322262/
https://www.ncbi.nlm.nih.gov/pubmed/30612540
http://dx.doi.org/10.1186/s12859-018-2580-9
_version_ 1783385584979410944
author Singh, Indresh
Kuscuoglu, Mehmet
Harkins, Derek M.
Sutton, Granger
Fouts, Derrick E.
Nelson, Karen E.
author_facet Singh, Indresh
Kuscuoglu, Mehmet
Harkins, Derek M.
Sutton, Granger
Fouts, Derrick E.
Nelson, Karen E.
author_sort Singh, Indresh
collection PubMed
description BACKGROUND: The development of high-throughput sequencing and analysis has accelerated multi-omics studies of thousands of microbial species, metagenomes, and infectious disease pathogens. Omics studies are enabling genotype-phenotype association studies which identify genetic determinants of pathogen virulence and drug resistance, as well as phylogenetic studies designed to track the origin and spread of disease outbreaks. These omics studies are complex and often employ multiple assay technologies including genomics, metagenomics, transcriptomics, proteomics, and metabolomics. To maximize the impact of omics studies, it is essential that data be accompanied by detailed contextual metadata (e.g., specimen, spatial-temporal, phenotypic characteristics) in clear, organized, and consistent formats. Over the years, many metadata standards developed by various metadata standards initiatives have arisen; the Genomic Standards Consortium’s minimal information standards (MIxS), the GSCID/BRC Project and Sample Application Standard. Some tools exist for tracking metadata, but they do not provide event based capabilities to configure, collect, validate, and distribute metadata. To address this gap in the scientific community, an event based data-driven application, OMeta, was created that allows users to quickly configure, collect, validate, distribute, and integrate metadata. RESULTS: A data-driven web application, OMeta, has been developed for use by researchers consisting of a browser-based interface, a command-line interface (CLI), and server-side components that provide an intuitive platform for configuring, capturing, viewing, and sharing metadata. Project and sample metadata can be set based on existing standards or based on projects goals. Recorded information includes details on the biological samples, procedures, protocols, and experimental technologies, etc. This information can be organized based on events, including sample collection, sample quantification, sequencing assay, and analysis results. OMeta enables configuration in various presentation types: checkbox, file, drop-box, ontology, and fields can be configured to use the National Center for Biomedical Ontology (NCBO), a biomedical ontology server. Furthermore, OMeta maintains a complete audit trail of all changes made by users and allows metadata export in comma separated value (CSV) format for convenient deposition of data into public databases. CONCLUSIONS: We present, OMeta, a web-based software application that is built on data-driven principles for configuring and customizing data standards, capturing, curating, and sharing metadata. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2580-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6322262
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63222622019-01-09 OMeta: an ontology-based, data-driven metadata tracking system Singh, Indresh Kuscuoglu, Mehmet Harkins, Derek M. Sutton, Granger Fouts, Derrick E. Nelson, Karen E. BMC Bioinformatics Software BACKGROUND: The development of high-throughput sequencing and analysis has accelerated multi-omics studies of thousands of microbial species, metagenomes, and infectious disease pathogens. Omics studies are enabling genotype-phenotype association studies which identify genetic determinants of pathogen virulence and drug resistance, as well as phylogenetic studies designed to track the origin and spread of disease outbreaks. These omics studies are complex and often employ multiple assay technologies including genomics, metagenomics, transcriptomics, proteomics, and metabolomics. To maximize the impact of omics studies, it is essential that data be accompanied by detailed contextual metadata (e.g., specimen, spatial-temporal, phenotypic characteristics) in clear, organized, and consistent formats. Over the years, many metadata standards developed by various metadata standards initiatives have arisen; the Genomic Standards Consortium’s minimal information standards (MIxS), the GSCID/BRC Project and Sample Application Standard. Some tools exist for tracking metadata, but they do not provide event based capabilities to configure, collect, validate, and distribute metadata. To address this gap in the scientific community, an event based data-driven application, OMeta, was created that allows users to quickly configure, collect, validate, distribute, and integrate metadata. RESULTS: A data-driven web application, OMeta, has been developed for use by researchers consisting of a browser-based interface, a command-line interface (CLI), and server-side components that provide an intuitive platform for configuring, capturing, viewing, and sharing metadata. Project and sample metadata can be set based on existing standards or based on projects goals. Recorded information includes details on the biological samples, procedures, protocols, and experimental technologies, etc. This information can be organized based on events, including sample collection, sample quantification, sequencing assay, and analysis results. OMeta enables configuration in various presentation types: checkbox, file, drop-box, ontology, and fields can be configured to use the National Center for Biomedical Ontology (NCBO), a biomedical ontology server. Furthermore, OMeta maintains a complete audit trail of all changes made by users and allows metadata export in comma separated value (CSV) format for convenient deposition of data into public databases. CONCLUSIONS: We present, OMeta, a web-based software application that is built on data-driven principles for configuring and customizing data standards, capturing, curating, and sharing metadata. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2580-9) contains supplementary material, which is available to authorized users. BioMed Central 2019-01-07 /pmc/articles/PMC6322262/ /pubmed/30612540 http://dx.doi.org/10.1186/s12859-018-2580-9 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Singh, Indresh
Kuscuoglu, Mehmet
Harkins, Derek M.
Sutton, Granger
Fouts, Derrick E.
Nelson, Karen E.
OMeta: an ontology-based, data-driven metadata tracking system
title OMeta: an ontology-based, data-driven metadata tracking system
title_full OMeta: an ontology-based, data-driven metadata tracking system
title_fullStr OMeta: an ontology-based, data-driven metadata tracking system
title_full_unstemmed OMeta: an ontology-based, data-driven metadata tracking system
title_short OMeta: an ontology-based, data-driven metadata tracking system
title_sort ometa: an ontology-based, data-driven metadata tracking system
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6322262/
https://www.ncbi.nlm.nih.gov/pubmed/30612540
http://dx.doi.org/10.1186/s12859-018-2580-9
work_keys_str_mv AT singhindresh ometaanontologybaseddatadrivenmetadatatrackingsystem
AT kuscuoglumehmet ometaanontologybaseddatadrivenmetadatatrackingsystem
AT harkinsderekm ometaanontologybaseddatadrivenmetadatatrackingsystem
AT suttongranger ometaanontologybaseddatadrivenmetadatatrackingsystem
AT foutsderricke ometaanontologybaseddatadrivenmetadatatrackingsystem
AT nelsonkarene ometaanontologybaseddatadrivenmetadatatrackingsystem