Cargando…

Crowd-Sourced Chemistry: Considerations for Building a Standardized Database to Improve Omic Analyses

[Image: see text] Mass spectrometry (MS) is used in multiple omics disciplines to generate large collections of data. This data enables advancements in biomedical research by providing global profiles of a given system. One of the main barriers to generating these profiles is the inability to accura...

Descripción completa

Detalles Bibliográficos
Autores principales: Picache, Jaqueline A., May, Jody C., McLean, John A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2020
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6977078/
https://www.ncbi.nlm.nih.gov/pubmed/31984253
http://dx.doi.org/10.1021/acsomega.9b03708
_version_ 1783490433356136448
author Picache, Jaqueline A.
May, Jody C.
McLean, John A.
author_facet Picache, Jaqueline A.
May, Jody C.
McLean, John A.
author_sort Picache, Jaqueline A.
collection PubMed
description [Image: see text] Mass spectrometry (MS) is used in multiple omics disciplines to generate large collections of data. This data enables advancements in biomedical research by providing global profiles of a given system. One of the main barriers to generating these profiles is the inability to accurately annotate omics data, especially small molecules. To complement pre-existing large databases that are not quite complete, research groups devote efforts to generating personal libraries to annotate their data. Scientific progress is impeded during the generation of these personal libraries because the data contained within them is often redundant and/or incompatible with other databases. To overcome these redundancies and incompatibilities, we propose that communal, crowd-sourced databases be curated in a standardized fashion. A small number of groups have shown this model is feasible and successful. While the needs of a specific field will dictate the functionality of a communal database, we discuss some features to consider during database development. Special emphasis is made on standardization of terminology, documentation, format, reference materials, and quality assurance practices. These standardization procedures enable a field to have higher confidence in the quality of the data within a given database. We also discuss the three conceptual pillars of database design as well as how crowd-sourcing is practiced. Generating open-source databases requires front-end effort, but the result is a well curated, high quality data set that all can use. Having a resource such as this fosters collaboration and scientific advancement.
format Online
Article
Text
id pubmed-6977078
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-69770782020-01-24 Crowd-Sourced Chemistry: Considerations for Building a Standardized Database to Improve Omic Analyses Picache, Jaqueline A. May, Jody C. McLean, John A. ACS Omega [Image: see text] Mass spectrometry (MS) is used in multiple omics disciplines to generate large collections of data. This data enables advancements in biomedical research by providing global profiles of a given system. One of the main barriers to generating these profiles is the inability to accurately annotate omics data, especially small molecules. To complement pre-existing large databases that are not quite complete, research groups devote efforts to generating personal libraries to annotate their data. Scientific progress is impeded during the generation of these personal libraries because the data contained within them is often redundant and/or incompatible with other databases. To overcome these redundancies and incompatibilities, we propose that communal, crowd-sourced databases be curated in a standardized fashion. A small number of groups have shown this model is feasible and successful. While the needs of a specific field will dictate the functionality of a communal database, we discuss some features to consider during database development. Special emphasis is made on standardization of terminology, documentation, format, reference materials, and quality assurance practices. These standardization procedures enable a field to have higher confidence in the quality of the data within a given database. We also discuss the three conceptual pillars of database design as well as how crowd-sourcing is practiced. Generating open-source databases requires front-end effort, but the result is a well curated, high quality data set that all can use. Having a resource such as this fosters collaboration and scientific advancement. American Chemical Society 2020-01-09 /pmc/articles/PMC6977078/ /pubmed/31984253 http://dx.doi.org/10.1021/acsomega.9b03708 Text en Copyright © 2020 American Chemical Society This is an open access article published under an ACS AuthorChoice License (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html) , which permits copying and redistribution of the article or any adaptations for non-commercial purposes.
spellingShingle Picache, Jaqueline A.
May, Jody C.
McLean, John A.
Crowd-Sourced Chemistry: Considerations for Building a Standardized Database to Improve Omic Analyses
title Crowd-Sourced Chemistry: Considerations for Building a Standardized Database to Improve Omic Analyses
title_full Crowd-Sourced Chemistry: Considerations for Building a Standardized Database to Improve Omic Analyses
title_fullStr Crowd-Sourced Chemistry: Considerations for Building a Standardized Database to Improve Omic Analyses
title_full_unstemmed Crowd-Sourced Chemistry: Considerations for Building a Standardized Database to Improve Omic Analyses
title_short Crowd-Sourced Chemistry: Considerations for Building a Standardized Database to Improve Omic Analyses
title_sort crowd-sourced chemistry: considerations for building a standardized database to improve omic analyses
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6977078/
https://www.ncbi.nlm.nih.gov/pubmed/31984253
http://dx.doi.org/10.1021/acsomega.9b03708
work_keys_str_mv AT picachejaquelinea crowdsourcedchemistryconsiderationsforbuildingastandardizeddatabasetoimproveomicanalyses
AT mayjodyc crowdsourcedchemistryconsiderationsforbuildingastandardizeddatabasetoimproveomicanalyses
AT mcleanjohna crowdsourcedchemistryconsiderationsforbuildingastandardizeddatabasetoimproveomicanalyses