Cargando…

Data Governance as Success Factor for Data Science

More and more, asset management organizations are introducing data science initiatives to support predictive maintenance and anomaly detection. Asset management organizations are by nature data intensive to manage their assets like bridges, dykes, railways and roads. For this, they often implement d...

Descripción completa

Detalles Bibliográficos
Autores principales: Brous, Paul, Janssen, Marijn, Krans, Rutger
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7134294/
http://dx.doi.org/10.1007/978-3-030-44999-5_36
_version_ 1783517810315493376
author Brous, Paul
Janssen, Marijn
Krans, Rutger
author_facet Brous, Paul
Janssen, Marijn
Krans, Rutger
author_sort Brous, Paul
collection PubMed
description More and more, asset management organizations are introducing data science initiatives to support predictive maintenance and anomaly detection. Asset management organizations are by nature data intensive to manage their assets like bridges, dykes, railways and roads. For this, they often implement data lakes using a variety of architectures and technologies to store big data and facilitate data science initiatives. However, the decision-outcomes of data science models are often highly reliant on the quality of the data. The data in the data lake therefore has to be of sufficient quality to develop trust by decision-makers. Not surprisingly, organizations are increasingly adopting data governance as a means to ensure that the quality of data entering the data lake is and remains of sufficient quality, and to ensure the organization remains legally compliant. The objective of the case study is to understand the role of data governance as success factor for data science. For this, a case study regarding the governance of data in a data lake in the asset management domain is analyzed to test three propositions contributing to the success of using data science. The results show that unambiguous ownership of the data, monitoring the quality of the data entering the data lake, and a controlled overview of standard and specific compliance requirements are important factors for maintaining data quality and compliance and building trust in data science products.
format Online
Article
Text
id pubmed-7134294
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-71342942020-04-06 Data Governance as Success Factor for Data Science Brous, Paul Janssen, Marijn Krans, Rutger Responsible Design, Implementation and Use of Information and Communication Technology Article More and more, asset management organizations are introducing data science initiatives to support predictive maintenance and anomaly detection. Asset management organizations are by nature data intensive to manage their assets like bridges, dykes, railways and roads. For this, they often implement data lakes using a variety of architectures and technologies to store big data and facilitate data science initiatives. However, the decision-outcomes of data science models are often highly reliant on the quality of the data. The data in the data lake therefore has to be of sufficient quality to develop trust by decision-makers. Not surprisingly, organizations are increasingly adopting data governance as a means to ensure that the quality of data entering the data lake is and remains of sufficient quality, and to ensure the organization remains legally compliant. The objective of the case study is to understand the role of data governance as success factor for data science. For this, a case study regarding the governance of data in a data lake in the asset management domain is analyzed to test three propositions contributing to the success of using data science. The results show that unambiguous ownership of the data, monitoring the quality of the data entering the data lake, and a controlled overview of standard and specific compliance requirements are important factors for maintaining data quality and compliance and building trust in data science products. 2020-03-06 /pmc/articles/PMC7134294/ http://dx.doi.org/10.1007/978-3-030-44999-5_36 Text en © IFIP International Federation for Information Processing 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Brous, Paul
Janssen, Marijn
Krans, Rutger
Data Governance as Success Factor for Data Science
title Data Governance as Success Factor for Data Science
title_full Data Governance as Success Factor for Data Science
title_fullStr Data Governance as Success Factor for Data Science
title_full_unstemmed Data Governance as Success Factor for Data Science
title_short Data Governance as Success Factor for Data Science
title_sort data governance as success factor for data science
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7134294/
http://dx.doi.org/10.1007/978-3-030-44999-5_36
work_keys_str_mv AT brouspaul datagovernanceassuccessfactorfordatascience
AT janssenmarijn datagovernanceassuccessfactorfordatascience
AT kransrutger datagovernanceassuccessfactorfordatascience