Cargando…

Open data products-A framework for creating valuable analysis ready data

This paper develops the notion of “open data product”. We define an open data product as the open result of the processes through which a variety of data (open and not) are turned into accessible information through a service, infrastructure, analytics or a combination of all of them, where each ste...

Descripción completa

Detalles Bibliográficos
Autores principales: Arribas-Bel, Dani, Green, Mark, Rowe, Francisco, Singleton, Alex
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8528182/
https://www.ncbi.nlm.nih.gov/pubmed/34697537
http://dx.doi.org/10.1007/s10109-021-00363-5
_version_ 1784586206006214656
author Arribas-Bel, Dani
Green, Mark
Rowe, Francisco
Singleton, Alex
author_facet Arribas-Bel, Dani
Green, Mark
Rowe, Francisco
Singleton, Alex
author_sort Arribas-Bel, Dani
collection PubMed
description This paper develops the notion of “open data product”. We define an open data product as the open result of the processes through which a variety of data (open and not) are turned into accessible information through a service, infrastructure, analytics or a combination of all of them, where each step of development is designed to promote open principles. Open data products are born out of a (data) need and add value beyond simply publishing existing datasets. We argue that the process of adding value should adhere to the principles of open (geographic) data science, ensuring openness, transparency and reproducibility. We also contend that outreach, in the form of active communication and dissemination through dashboards, software and publication are key to engage end-users and ensure societal impact. Open data products have major benefits. First, they enable insights from highly sensitive, controlled and/or secure data which may not be accessible otherwise. Second, they can expand the use of commercial and administrative data for the public good leveraging on their high temporal frequency and geographic granularity. We also contend that there is a compelling need for open data products as we experience the current data revolution. New, emerging data sources are unprecedented in temporal frequency and geographical resolution, but they are large, unstructured, fragmented and often hard to access due to privacy and confidentiality concerns. By transforming raw (open or “closed”) data into ready to use open data products, new dimensions of human geographical processes can be captured and analysed, as we illustrate with existing examples. We conclude by arguing that several parallels exist between the role that open source software played in enabling research on spatial analysis in the 90 s and early 2000s, and the opportunities that open data products offer to unlock the potential of new forms of (geo-)data.
format Online
Article
Text
id pubmed-8528182
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-85281822021-10-21 Open data products-A framework for creating valuable analysis ready data Arribas-Bel, Dani Green, Mark Rowe, Francisco Singleton, Alex J Geogr Syst Original Article This paper develops the notion of “open data product”. We define an open data product as the open result of the processes through which a variety of data (open and not) are turned into accessible information through a service, infrastructure, analytics or a combination of all of them, where each step of development is designed to promote open principles. Open data products are born out of a (data) need and add value beyond simply publishing existing datasets. We argue that the process of adding value should adhere to the principles of open (geographic) data science, ensuring openness, transparency and reproducibility. We also contend that outreach, in the form of active communication and dissemination through dashboards, software and publication are key to engage end-users and ensure societal impact. Open data products have major benefits. First, they enable insights from highly sensitive, controlled and/or secure data which may not be accessible otherwise. Second, they can expand the use of commercial and administrative data for the public good leveraging on their high temporal frequency and geographic granularity. We also contend that there is a compelling need for open data products as we experience the current data revolution. New, emerging data sources are unprecedented in temporal frequency and geographical resolution, but they are large, unstructured, fragmented and often hard to access due to privacy and confidentiality concerns. By transforming raw (open or “closed”) data into ready to use open data products, new dimensions of human geographical processes can be captured and analysed, as we illustrate with existing examples. We conclude by arguing that several parallels exist between the role that open source software played in enabling research on spatial analysis in the 90 s and early 2000s, and the opportunities that open data products offer to unlock the potential of new forms of (geo-)data. Springer Berlin Heidelberg 2021-10-20 2021 /pmc/articles/PMC8528182/ /pubmed/34697537 http://dx.doi.org/10.1007/s10109-021-00363-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Article
Arribas-Bel, Dani
Green, Mark
Rowe, Francisco
Singleton, Alex
Open data products-A framework for creating valuable analysis ready data
title Open data products-A framework for creating valuable analysis ready data
title_full Open data products-A framework for creating valuable analysis ready data
title_fullStr Open data products-A framework for creating valuable analysis ready data
title_full_unstemmed Open data products-A framework for creating valuable analysis ready data
title_short Open data products-A framework for creating valuable analysis ready data
title_sort open data products-a framework for creating valuable analysis ready data
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8528182/
https://www.ncbi.nlm.nih.gov/pubmed/34697537
http://dx.doi.org/10.1007/s10109-021-00363-5
work_keys_str_mv AT arribasbeldani opendataproductsaframeworkforcreatingvaluableanalysisreadydata
AT greenmark opendataproductsaframeworkforcreatingvaluableanalysisreadydata
AT rowefrancisco opendataproductsaframeworkforcreatingvaluableanalysisreadydata
AT singletonalex opendataproductsaframeworkforcreatingvaluableanalysisreadydata