Cargando…
Open data products-A framework for creating valuable analysis ready data
This paper develops the notion of “open data product”. We define an open data product as the open result of the processes through which a variety of data (open and not) are turned into accessible information through a service, infrastructure, analytics or a combination of all of them, where each ste...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8528182/ https://www.ncbi.nlm.nih.gov/pubmed/34697537 http://dx.doi.org/10.1007/s10109-021-00363-5 |
_version_ | 1784586206006214656 |
---|---|
author | Arribas-Bel, Dani Green, Mark Rowe, Francisco Singleton, Alex |
author_facet | Arribas-Bel, Dani Green, Mark Rowe, Francisco Singleton, Alex |
author_sort | Arribas-Bel, Dani |
collection | PubMed |
description | This paper develops the notion of “open data product”. We define an open data product as the open result of the processes through which a variety of data (open and not) are turned into accessible information through a service, infrastructure, analytics or a combination of all of them, where each step of development is designed to promote open principles. Open data products are born out of a (data) need and add value beyond simply publishing existing datasets. We argue that the process of adding value should adhere to the principles of open (geographic) data science, ensuring openness, transparency and reproducibility. We also contend that outreach, in the form of active communication and dissemination through dashboards, software and publication are key to engage end-users and ensure societal impact. Open data products have major benefits. First, they enable insights from highly sensitive, controlled and/or secure data which may not be accessible otherwise. Second, they can expand the use of commercial and administrative data for the public good leveraging on their high temporal frequency and geographic granularity. We also contend that there is a compelling need for open data products as we experience the current data revolution. New, emerging data sources are unprecedented in temporal frequency and geographical resolution, but they are large, unstructured, fragmented and often hard to access due to privacy and confidentiality concerns. By transforming raw (open or “closed”) data into ready to use open data products, new dimensions of human geographical processes can be captured and analysed, as we illustrate with existing examples. We conclude by arguing that several parallels exist between the role that open source software played in enabling research on spatial analysis in the 90 s and early 2000s, and the opportunities that open data products offer to unlock the potential of new forms of (geo-)data. |
format | Online Article Text |
id | pubmed-8528182 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-85281822021-10-21 Open data products-A framework for creating valuable analysis ready data Arribas-Bel, Dani Green, Mark Rowe, Francisco Singleton, Alex J Geogr Syst Original Article This paper develops the notion of “open data product”. We define an open data product as the open result of the processes through which a variety of data (open and not) are turned into accessible information through a service, infrastructure, analytics or a combination of all of them, where each step of development is designed to promote open principles. Open data products are born out of a (data) need and add value beyond simply publishing existing datasets. We argue that the process of adding value should adhere to the principles of open (geographic) data science, ensuring openness, transparency and reproducibility. We also contend that outreach, in the form of active communication and dissemination through dashboards, software and publication are key to engage end-users and ensure societal impact. Open data products have major benefits. First, they enable insights from highly sensitive, controlled and/or secure data which may not be accessible otherwise. Second, they can expand the use of commercial and administrative data for the public good leveraging on their high temporal frequency and geographic granularity. We also contend that there is a compelling need for open data products as we experience the current data revolution. New, emerging data sources are unprecedented in temporal frequency and geographical resolution, but they are large, unstructured, fragmented and often hard to access due to privacy and confidentiality concerns. By transforming raw (open or “closed”) data into ready to use open data products, new dimensions of human geographical processes can be captured and analysed, as we illustrate with existing examples. We conclude by arguing that several parallels exist between the role that open source software played in enabling research on spatial analysis in the 90 s and early 2000s, and the opportunities that open data products offer to unlock the potential of new forms of (geo-)data. Springer Berlin Heidelberg 2021-10-20 2021 /pmc/articles/PMC8528182/ /pubmed/34697537 http://dx.doi.org/10.1007/s10109-021-00363-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Original Article Arribas-Bel, Dani Green, Mark Rowe, Francisco Singleton, Alex Open data products-A framework for creating valuable analysis ready data |
title | Open data products-A framework for creating valuable analysis ready data |
title_full | Open data products-A framework for creating valuable analysis ready data |
title_fullStr | Open data products-A framework for creating valuable analysis ready data |
title_full_unstemmed | Open data products-A framework for creating valuable analysis ready data |
title_short | Open data products-A framework for creating valuable analysis ready data |
title_sort | open data products-a framework for creating valuable analysis ready data |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8528182/ https://www.ncbi.nlm.nih.gov/pubmed/34697537 http://dx.doi.org/10.1007/s10109-021-00363-5 |
work_keys_str_mv | AT arribasbeldani opendataproductsaframeworkforcreatingvaluableanalysisreadydata AT greenmark opendataproductsaframeworkforcreatingvaluableanalysisreadydata AT rowefrancisco opendataproductsaframeworkforcreatingvaluableanalysisreadydata AT singletonalex opendataproductsaframeworkforcreatingvaluableanalysisreadydata |