Cargando…

Data discovery with DATS: exemplar adoptions and lessons learned

The DAta Tag Suite (DATS) is a model supporting dataset description, indexing, and discovery. It is available as an annotated serialization with schema.org, a vocabulary used by major search engines, thus making the datasets discoverable on the web. DATS underlies DataMed, the National Institutes of...

Descripción completa

Detalles Bibliográficos
Autores principales: Gonzalez-Beltran, Alejandra N, Campbell, John, Dunn, Patrick, Guijarro, Diana, Ionescu, Sanda, Kim, Hyeoneui, Lyle, Jared, Wiser, Jeffrey, Sansone, Susanna-Assunta, Rocca-Serra, Philippe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6481379/
https://www.ncbi.nlm.nih.gov/pubmed/29228196
http://dx.doi.org/10.1093/jamia/ocx119
_version_ 1783413758371037184
author Gonzalez-Beltran, Alejandra N
Campbell, John
Dunn, Patrick
Guijarro, Diana
Ionescu, Sanda
Kim, Hyeoneui
Lyle, Jared
Wiser, Jeffrey
Sansone, Susanna-Assunta
Rocca-Serra, Philippe
author_facet Gonzalez-Beltran, Alejandra N
Campbell, John
Dunn, Patrick
Guijarro, Diana
Ionescu, Sanda
Kim, Hyeoneui
Lyle, Jared
Wiser, Jeffrey
Sansone, Susanna-Assunta
Rocca-Serra, Philippe
author_sort Gonzalez-Beltran, Alejandra N
collection PubMed
description The DAta Tag Suite (DATS) is a model supporting dataset description, indexing, and discovery. It is available as an annotated serialization with schema.org, a vocabulary used by major search engines, thus making the datasets discoverable on the web. DATS underlies DataMed, the National Institutes of Health Big Data to Knowledge Data Discovery Index prototype, which aims to provide a “PubMed for datasets.” The experience gained while indexing a heterogeneous range of >60 repositories in DataMed helped in evaluating DATS’s entities, attributes, and scope. In this work, 3 additional exemplary and diverse data sources were mapped to DATS by their representatives or experts, offering a deep scan of DATS fitness against a new set of existing data. The procedure, including feedback from users and implementers, resulted in DATS implementation guidelines and best practices, and identification of a path for evolving and optimizing the model. Finally, the work exposed additional needs when defining datasets for indexing, especially in the context of clinical and observational information.
format Online
Article
Text
id pubmed-6481379
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-64813792019-04-26 Data discovery with DATS: exemplar adoptions and lessons learned Gonzalez-Beltran, Alejandra N Campbell, John Dunn, Patrick Guijarro, Diana Ionescu, Sanda Kim, Hyeoneui Lyle, Jared Wiser, Jeffrey Sansone, Susanna-Assunta Rocca-Serra, Philippe J Am Med Inform Assoc Brief Communication The DAta Tag Suite (DATS) is a model supporting dataset description, indexing, and discovery. It is available as an annotated serialization with schema.org, a vocabulary used by major search engines, thus making the datasets discoverable on the web. DATS underlies DataMed, the National Institutes of Health Big Data to Knowledge Data Discovery Index prototype, which aims to provide a “PubMed for datasets.” The experience gained while indexing a heterogeneous range of >60 repositories in DataMed helped in evaluating DATS’s entities, attributes, and scope. In this work, 3 additional exemplary and diverse data sources were mapped to DATS by their representatives or experts, offering a deep scan of DATS fitness against a new set of existing data. The procedure, including feedback from users and implementers, resulted in DATS implementation guidelines and best practices, and identification of a path for evolving and optimizing the model. Finally, the work exposed additional needs when defining datasets for indexing, especially in the context of clinical and observational information. Oxford University Press 2017-12-08 /pmc/articles/PMC6481379/ /pubmed/29228196 http://dx.doi.org/10.1093/jamia/ocx119 Text en © The Author 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Brief Communication
Gonzalez-Beltran, Alejandra N
Campbell, John
Dunn, Patrick
Guijarro, Diana
Ionescu, Sanda
Kim, Hyeoneui
Lyle, Jared
Wiser, Jeffrey
Sansone, Susanna-Assunta
Rocca-Serra, Philippe
Data discovery with DATS: exemplar adoptions and lessons learned
title Data discovery with DATS: exemplar adoptions and lessons learned
title_full Data discovery with DATS: exemplar adoptions and lessons learned
title_fullStr Data discovery with DATS: exemplar adoptions and lessons learned
title_full_unstemmed Data discovery with DATS: exemplar adoptions and lessons learned
title_short Data discovery with DATS: exemplar adoptions and lessons learned
title_sort data discovery with dats: exemplar adoptions and lessons learned
topic Brief Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6481379/
https://www.ncbi.nlm.nih.gov/pubmed/29228196
http://dx.doi.org/10.1093/jamia/ocx119
work_keys_str_mv AT gonzalezbeltranalejandran datadiscoverywithdatsexemplaradoptionsandlessonslearned
AT campbelljohn datadiscoverywithdatsexemplaradoptionsandlessonslearned
AT dunnpatrick datadiscoverywithdatsexemplaradoptionsandlessonslearned
AT guijarrodiana datadiscoverywithdatsexemplaradoptionsandlessonslearned
AT ionescusanda datadiscoverywithdatsexemplaradoptionsandlessonslearned
AT kimhyeoneui datadiscoverywithdatsexemplaradoptionsandlessonslearned
AT lylejared datadiscoverywithdatsexemplaradoptionsandlessonslearned
AT wiserjeffrey datadiscoverywithdatsexemplaradoptionsandlessonslearned
AT sansonesusannaassunta datadiscoverywithdatsexemplaradoptionsandlessonslearned
AT roccaserraphilippe datadiscoverywithdatsexemplaradoptionsandlessonslearned