Cargando…

The NYU Data Catalog: a modular, flexible infrastructure for data discovery

OBJECTIVE: Researchers at New York University (NYU) Grossman School of Medicine contacted the Health Sciences Library for help with locating large datasets for reuse. In response, the library developed and maintained the NYU Data Catalog, a public-facing data catalog that has supported not only facu...

Descripción completa

Detalles Bibliográficos
Autores principales: Yee, Michelle, Surkis, Alisa, Lamb, Ian, Contaxis, Nicole
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10531119/
https://www.ncbi.nlm.nih.gov/pubmed/37414539
http://dx.doi.org/10.1093/jamia/ocad125
_version_ 1785111643611463680
author Yee, Michelle
Surkis, Alisa
Lamb, Ian
Contaxis, Nicole
author_facet Yee, Michelle
Surkis, Alisa
Lamb, Ian
Contaxis, Nicole
author_sort Yee, Michelle
collection PubMed
description OBJECTIVE: Researchers at New York University (NYU) Grossman School of Medicine contacted the Health Sciences Library for help with locating large datasets for reuse. In response, the library developed and maintained the NYU Data Catalog, a public-facing data catalog that has supported not only faculty acquisition of data but also the dissemination of the products of their research in various ways. MATERIALS AND METHODS: The current NYU Data Catalog is built upon the Symfony framework with a tailored metadata schema reflecting the scope of faculty research areas. The project team curates new resources, including datasets and supporting software code, and conducts quarterly and annual evaluations to assess user interactions with the NYU Data Catalog and opportunities for growth. RESULTS: Since its launch in 2015, the NYU Data Catalog underwent a number of changes prompted by an increase in the disciplines represented by faculty contributors. The catalog has also utilized faculty feedback to enhance support of data reuse and researcher collaboration through alterations to its schema, layout, and visibility of records. DISCUSSION: These findings demonstrate the flexibility of data catalogs as a platform for enabling the discovery of disparate sources of data. While not a repository, the NYU Data Catalog is well-positioned to support mandates for data sharing from study sponsors and publishers. CONCLUSION: The NYU Data Catalog makes the most of the data that researchers share and can be harnessed as a modular and adaptable platform to promote data sharing as a cultural practice.
format Online
Article
Text
id pubmed-10531119
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105311192023-09-28 The NYU Data Catalog: a modular, flexible infrastructure for data discovery Yee, Michelle Surkis, Alisa Lamb, Ian Contaxis, Nicole J Am Med Inform Assoc Research and Applications OBJECTIVE: Researchers at New York University (NYU) Grossman School of Medicine contacted the Health Sciences Library for help with locating large datasets for reuse. In response, the library developed and maintained the NYU Data Catalog, a public-facing data catalog that has supported not only faculty acquisition of data but also the dissemination of the products of their research in various ways. MATERIALS AND METHODS: The current NYU Data Catalog is built upon the Symfony framework with a tailored metadata schema reflecting the scope of faculty research areas. The project team curates new resources, including datasets and supporting software code, and conducts quarterly and annual evaluations to assess user interactions with the NYU Data Catalog and opportunities for growth. RESULTS: Since its launch in 2015, the NYU Data Catalog underwent a number of changes prompted by an increase in the disciplines represented by faculty contributors. The catalog has also utilized faculty feedback to enhance support of data reuse and researcher collaboration through alterations to its schema, layout, and visibility of records. DISCUSSION: These findings demonstrate the flexibility of data catalogs as a platform for enabling the discovery of disparate sources of data. While not a repository, the NYU Data Catalog is well-positioned to support mandates for data sharing from study sponsors and publishers. CONCLUSION: The NYU Data Catalog makes the most of the data that researchers share and can be harnessed as a modular and adaptable platform to promote data sharing as a cultural practice. Oxford University Press 2023-07-06 /pmc/articles/PMC10531119/ /pubmed/37414539 http://dx.doi.org/10.1093/jamia/ocad125 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research and Applications
Yee, Michelle
Surkis, Alisa
Lamb, Ian
Contaxis, Nicole
The NYU Data Catalog: a modular, flexible infrastructure for data discovery
title The NYU Data Catalog: a modular, flexible infrastructure for data discovery
title_full The NYU Data Catalog: a modular, flexible infrastructure for data discovery
title_fullStr The NYU Data Catalog: a modular, flexible infrastructure for data discovery
title_full_unstemmed The NYU Data Catalog: a modular, flexible infrastructure for data discovery
title_short The NYU Data Catalog: a modular, flexible infrastructure for data discovery
title_sort nyu data catalog: a modular, flexible infrastructure for data discovery
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10531119/
https://www.ncbi.nlm.nih.gov/pubmed/37414539
http://dx.doi.org/10.1093/jamia/ocad125
work_keys_str_mv AT yeemichelle thenyudatacatalogamodularflexibleinfrastructurefordatadiscovery
AT surkisalisa thenyudatacatalogamodularflexibleinfrastructurefordatadiscovery
AT lambian thenyudatacatalogamodularflexibleinfrastructurefordatadiscovery
AT contaxisnicole thenyudatacatalogamodularflexibleinfrastructurefordatadiscovery
AT yeemichelle nyudatacatalogamodularflexibleinfrastructurefordatadiscovery
AT surkisalisa nyudatacatalogamodularflexibleinfrastructurefordatadiscovery
AT lambian nyudatacatalogamodularflexibleinfrastructurefordatadiscovery
AT contaxisnicole nyudatacatalogamodularflexibleinfrastructurefordatadiscovery