Cargando…

CEDAR OnDemand: a browser extension to generate ontology-based scientific metadata

BACKGROUND: Public biomedical data repositories often provide web-based interfaces to collect experimental metadata. However, these interfaces typically reflect the ad hoc metadata specification practices of the associated repositories, leading to a lack of standardization in the collected metadata....

Descripción completa

Detalles Bibliográficos
Autores principales: Bukhari, Syed Ahmad Chan, Martínez-Romero, Marcos, O’ Connor, Martin J., Egyedi, Attila L., Willrett, Debra, Graybeal, John, Musen, Mark A., Cheung, Kei-Hoi, Kleinstein, Steven H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6048706/
https://www.ncbi.nlm.nih.gov/pubmed/30012108
http://dx.doi.org/10.1186/s12859-018-2247-6
_version_ 1783340144363831296
author Bukhari, Syed Ahmad Chan
Martínez-Romero, Marcos
O’ Connor, Martin J.
Egyedi, Attila L.
Willrett, Debra
Graybeal, John
Musen, Mark A.
Cheung, Kei-Hoi
Kleinstein, Steven H.
author_facet Bukhari, Syed Ahmad Chan
Martínez-Romero, Marcos
O’ Connor, Martin J.
Egyedi, Attila L.
Willrett, Debra
Graybeal, John
Musen, Mark A.
Cheung, Kei-Hoi
Kleinstein, Steven H.
author_sort Bukhari, Syed Ahmad Chan
collection PubMed
description BACKGROUND: Public biomedical data repositories often provide web-based interfaces to collect experimental metadata. However, these interfaces typically reflect the ad hoc metadata specification practices of the associated repositories, leading to a lack of standardization in the collected metadata. This lack of standardization limits the ability of the source datasets to be broadly discovered, reused, and integrated with other datasets. To increase reuse, discoverability, and reproducibility of the described experiments, datasets should be appropriately annotated by using agreed-upon terms, ideally from ontologies or other controlled term sources. RESULTS: This work presents “CEDAR OnDemand”, a browser extension powered by the NCBO (National Center for Biomedical Ontology) BioPortal that enables users to seamlessly enter ontology-based metadata through existing web forms native to individual repositories. CEDAR OnDemand analyzes the web page contents to identify the text input fields and associate them with relevant ontologies which are recommended automatically based upon input fields’ labels (using the NCBO ontology recommender) and a pre-defined list of ontologies. These field-specific ontologies are used for controlling metadata entry. CEDAR OnDemand works for any web form designed in the HTML format. We demonstrate how CEDAR OnDemand works through the NCBI (National Center for Biotechnology Information) BioSample web-based metadata entry. CONCLUSION: CEDAR OnDemand helps lower the barrier of incorporating ontologies into standardized metadata entry for public data repositories. CEDAR OnDemand is available freely on the Google Chrome store https://chrome.google.com/webstore/search/CEDAROnDemand
format Online
Article
Text
id pubmed-6048706
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-60487062018-07-19 CEDAR OnDemand: a browser extension to generate ontology-based scientific metadata Bukhari, Syed Ahmad Chan Martínez-Romero, Marcos O’ Connor, Martin J. Egyedi, Attila L. Willrett, Debra Graybeal, John Musen, Mark A. Cheung, Kei-Hoi Kleinstein, Steven H. BMC Bioinformatics Software BACKGROUND: Public biomedical data repositories often provide web-based interfaces to collect experimental metadata. However, these interfaces typically reflect the ad hoc metadata specification practices of the associated repositories, leading to a lack of standardization in the collected metadata. This lack of standardization limits the ability of the source datasets to be broadly discovered, reused, and integrated with other datasets. To increase reuse, discoverability, and reproducibility of the described experiments, datasets should be appropriately annotated by using agreed-upon terms, ideally from ontologies or other controlled term sources. RESULTS: This work presents “CEDAR OnDemand”, a browser extension powered by the NCBO (National Center for Biomedical Ontology) BioPortal that enables users to seamlessly enter ontology-based metadata through existing web forms native to individual repositories. CEDAR OnDemand analyzes the web page contents to identify the text input fields and associate them with relevant ontologies which are recommended automatically based upon input fields’ labels (using the NCBO ontology recommender) and a pre-defined list of ontologies. These field-specific ontologies are used for controlling metadata entry. CEDAR OnDemand works for any web form designed in the HTML format. We demonstrate how CEDAR OnDemand works through the NCBI (National Center for Biotechnology Information) BioSample web-based metadata entry. CONCLUSION: CEDAR OnDemand helps lower the barrier of incorporating ontologies into standardized metadata entry for public data repositories. CEDAR OnDemand is available freely on the Google Chrome store https://chrome.google.com/webstore/search/CEDAROnDemand BioMed Central 2018-07-16 /pmc/articles/PMC6048706/ /pubmed/30012108 http://dx.doi.org/10.1186/s12859-018-2247-6 Text en © The Author(s). 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Bukhari, Syed Ahmad Chan
Martínez-Romero, Marcos
O’ Connor, Martin J.
Egyedi, Attila L.
Willrett, Debra
Graybeal, John
Musen, Mark A.
Cheung, Kei-Hoi
Kleinstein, Steven H.
CEDAR OnDemand: a browser extension to generate ontology-based scientific metadata
title CEDAR OnDemand: a browser extension to generate ontology-based scientific metadata
title_full CEDAR OnDemand: a browser extension to generate ontology-based scientific metadata
title_fullStr CEDAR OnDemand: a browser extension to generate ontology-based scientific metadata
title_full_unstemmed CEDAR OnDemand: a browser extension to generate ontology-based scientific metadata
title_short CEDAR OnDemand: a browser extension to generate ontology-based scientific metadata
title_sort cedar ondemand: a browser extension to generate ontology-based scientific metadata
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6048706/
https://www.ncbi.nlm.nih.gov/pubmed/30012108
http://dx.doi.org/10.1186/s12859-018-2247-6
work_keys_str_mv AT bukharisyedahmadchan cedarondemandabrowserextensiontogenerateontologybasedscientificmetadata
AT martinezromeromarcos cedarondemandabrowserextensiontogenerateontologybasedscientificmetadata
AT oconnormartinj cedarondemandabrowserextensiontogenerateontologybasedscientificmetadata
AT egyediattilal cedarondemandabrowserextensiontogenerateontologybasedscientificmetadata
AT willrettdebra cedarondemandabrowserextensiontogenerateontologybasedscientificmetadata
AT graybealjohn cedarondemandabrowserextensiontogenerateontologybasedscientificmetadata
AT musenmarka cedarondemandabrowserextensiontogenerateontologybasedscientificmetadata
AT cheungkeihoi cedarondemandabrowserextensiontogenerateontologybasedscientificmetadata
AT kleinsteinstevenh cedarondemandabrowserextensiontogenerateontologybasedscientificmetadata