Cargando…

Findability of UK health datasets available for research: a mixed methods study

OBJECTIVE: How health researchers find secondary data to analyse is unclear. We sought to describe the approaches that UK organisations take to help researchers find data and to assess the findability of health data that are available for research. METHODS: We surveyed established organisations abou...

Descripción completa

Detalles Bibliográficos
Autores principales: Griffiths, Emily, Joseph, Rebecca M, Tilston, George, Thew, Sarah, Kapacee, Zoher, Dixon, William, Peek, Niels
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8867248/
https://www.ncbi.nlm.nih.gov/pubmed/35193857
http://dx.doi.org/10.1136/bmjhci-2021-100325
_version_ 1784656013777960960
author Griffiths, Emily
Joseph, Rebecca M
Tilston, George
Thew, Sarah
Kapacee, Zoher
Dixon, William
Peek, Niels
author_facet Griffiths, Emily
Joseph, Rebecca M
Tilston, George
Thew, Sarah
Kapacee, Zoher
Dixon, William
Peek, Niels
author_sort Griffiths, Emily
collection PubMed
description OBJECTIVE: How health researchers find secondary data to analyse is unclear. We sought to describe the approaches that UK organisations take to help researchers find data and to assess the findability of health data that are available for research. METHODS: We surveyed established organisations about how they make data findable. We derived measures of findability based on the first element of the FAIR principles (Findable, Accessible, Interoperable, Reproducible). We applied these to 13 UK health datasets and measured their findability via two major internet search engines in 2018 and repeated in 2021. RESULTS: Among 12 survey respondents, 11 indicated that they made metadata publicly available. Respondents said internet presence was important for findability, but that this needed improvement. In 2018, 8 out of 13 datasets were listed in the top 100 search results of 10 searches repeated on both search engines, while the remaining 5 were found one click away from those search results. In 2021, this had reduced to seven datasets directly listed and one dataset one click away. In 2021, Google Dataset Search had become available, which listed 3 of the 13 datasets within the top 100 search results. DISCUSSION: Measuring findability via online search engines is one method for evaluating efforts to improve findability. Findability could perhaps be improved with catalogues that have greater inclusion of datasets, field-level metadata and persistent identifiers. CONCLUSION: UK organisations recognised the importance of the internet for finding data for research. However, health datasets available for research were no more findable in 2021 than in 2018.
format Online
Article
Text
id pubmed-8867248
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-88672482022-03-15 Findability of UK health datasets available for research: a mixed methods study Griffiths, Emily Joseph, Rebecca M Tilston, George Thew, Sarah Kapacee, Zoher Dixon, William Peek, Niels BMJ Health Care Inform Original Research OBJECTIVE: How health researchers find secondary data to analyse is unclear. We sought to describe the approaches that UK organisations take to help researchers find data and to assess the findability of health data that are available for research. METHODS: We surveyed established organisations about how they make data findable. We derived measures of findability based on the first element of the FAIR principles (Findable, Accessible, Interoperable, Reproducible). We applied these to 13 UK health datasets and measured their findability via two major internet search engines in 2018 and repeated in 2021. RESULTS: Among 12 survey respondents, 11 indicated that they made metadata publicly available. Respondents said internet presence was important for findability, but that this needed improvement. In 2018, 8 out of 13 datasets were listed in the top 100 search results of 10 searches repeated on both search engines, while the remaining 5 were found one click away from those search results. In 2021, this had reduced to seven datasets directly listed and one dataset one click away. In 2021, Google Dataset Search had become available, which listed 3 of the 13 datasets within the top 100 search results. DISCUSSION: Measuring findability via online search engines is one method for evaluating efforts to improve findability. Findability could perhaps be improved with catalogues that have greater inclusion of datasets, field-level metadata and persistent identifiers. CONCLUSION: UK organisations recognised the importance of the internet for finding data for research. However, health datasets available for research were no more findable in 2021 than in 2018. BMJ Publishing Group 2022-02-22 /pmc/articles/PMC8867248/ /pubmed/35193857 http://dx.doi.org/10.1136/bmjhci-2021-100325 Text en © Author(s) (or their employer(s)) 2022. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Original Research
Griffiths, Emily
Joseph, Rebecca M
Tilston, George
Thew, Sarah
Kapacee, Zoher
Dixon, William
Peek, Niels
Findability of UK health datasets available for research: a mixed methods study
title Findability of UK health datasets available for research: a mixed methods study
title_full Findability of UK health datasets available for research: a mixed methods study
title_fullStr Findability of UK health datasets available for research: a mixed methods study
title_full_unstemmed Findability of UK health datasets available for research: a mixed methods study
title_short Findability of UK health datasets available for research: a mixed methods study
title_sort findability of uk health datasets available for research: a mixed methods study
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8867248/
https://www.ncbi.nlm.nih.gov/pubmed/35193857
http://dx.doi.org/10.1136/bmjhci-2021-100325
work_keys_str_mv AT griffithsemily findabilityofukhealthdatasetsavailableforresearchamixedmethodsstudy
AT josephrebeccam findabilityofukhealthdatasetsavailableforresearchamixedmethodsstudy
AT tilstongeorge findabilityofukhealthdatasetsavailableforresearchamixedmethodsstudy
AT thewsarah findabilityofukhealthdatasetsavailableforresearchamixedmethodsstudy
AT kapaceezoher findabilityofukhealthdatasetsavailableforresearchamixedmethodsstudy
AT dixonwilliam findabilityofukhealthdatasetsavailableforresearchamixedmethodsstudy
AT peekniels findabilityofukhealthdatasetsavailableforresearchamixedmethodsstudy