Cargando…

Leveraging Data Quality to Better Prepare for Process Mining: An Approach Illustrated Through Analysing Road Trauma Pre-Hospital Retrieval and Transport Processes in Queensland

While noting the importance of data quality, existing process mining methodologies (i) do not provide details on how to assess the quality of event data (ii) do not consider how the identification of data quality issues can be exploited in the planning, data extraction and log building phases of any...

Descripción completa

Detalles Bibliográficos
Autores principales: Andrews, Robert, Wynn, Moe T., Vallmuur, Kirsten, ter Hofstede, Arthur H. M., Bosley, Emma, Elcock, Mark, Rashford, Stephen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6479847/
https://www.ncbi.nlm.nih.gov/pubmed/30934913
http://dx.doi.org/10.3390/ijerph16071138
_version_ 1783413439553601536
author Andrews, Robert
Wynn, Moe T.
Vallmuur, Kirsten
ter Hofstede, Arthur H. M.
Bosley, Emma
Elcock, Mark
Rashford, Stephen
author_facet Andrews, Robert
Wynn, Moe T.
Vallmuur, Kirsten
ter Hofstede, Arthur H. M.
Bosley, Emma
Elcock, Mark
Rashford, Stephen
author_sort Andrews, Robert
collection PubMed
description While noting the importance of data quality, existing process mining methodologies (i) do not provide details on how to assess the quality of event data (ii) do not consider how the identification of data quality issues can be exploited in the planning, data extraction and log building phases of any process mining analysis, (iii) do not highlight potential impacts of poor quality data on different types of process analyses. As our key contribution, we develop a process-centric, data quality-driven approach to preparing for a process mining analysis which can be applied to any existing process mining methodology. Our approach, adapted from elements of the well known CRISP-DM data mining methodology, includes conceptual data modeling, quality assessment at both attribute and event level, and trial discovery and conformance to develop understanding of system processes and data properties to inform data extraction. We illustrate our approach in a case study involving the Queensland Ambulance Service (QAS) and Retrieval Services Queensland (RSQ). We describe the detailed preparation for a process mining analysis of retrieval and transport processes (ground and aero-medical) for road-trauma patients in Queensland. Sample datasets obtained from QAS and RSQ are utilised to show how quality metrics, data models and exploratory process mining analyses can be used to (i) identify data quality issues, (ii) anticipate and explain certain observable features in process mining analyses, (iii) distinguish between systemic and occasional quality issues, and (iv) reason about the mechanisms by which identified quality issues may have arisen in the event log. We contend that this knowledge can be used to guide the data extraction and pre-processing stages of a process mining case study to properly align the data with the case study research questions.
format Online
Article
Text
id pubmed-6479847
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-64798472019-04-29 Leveraging Data Quality to Better Prepare for Process Mining: An Approach Illustrated Through Analysing Road Trauma Pre-Hospital Retrieval and Transport Processes in Queensland Andrews, Robert Wynn, Moe T. Vallmuur, Kirsten ter Hofstede, Arthur H. M. Bosley, Emma Elcock, Mark Rashford, Stephen Int J Environ Res Public Health Article While noting the importance of data quality, existing process mining methodologies (i) do not provide details on how to assess the quality of event data (ii) do not consider how the identification of data quality issues can be exploited in the planning, data extraction and log building phases of any process mining analysis, (iii) do not highlight potential impacts of poor quality data on different types of process analyses. As our key contribution, we develop a process-centric, data quality-driven approach to preparing for a process mining analysis which can be applied to any existing process mining methodology. Our approach, adapted from elements of the well known CRISP-DM data mining methodology, includes conceptual data modeling, quality assessment at both attribute and event level, and trial discovery and conformance to develop understanding of system processes and data properties to inform data extraction. We illustrate our approach in a case study involving the Queensland Ambulance Service (QAS) and Retrieval Services Queensland (RSQ). We describe the detailed preparation for a process mining analysis of retrieval and transport processes (ground and aero-medical) for road-trauma patients in Queensland. Sample datasets obtained from QAS and RSQ are utilised to show how quality metrics, data models and exploratory process mining analyses can be used to (i) identify data quality issues, (ii) anticipate and explain certain observable features in process mining analyses, (iii) distinguish between systemic and occasional quality issues, and (iv) reason about the mechanisms by which identified quality issues may have arisen in the event log. We contend that this knowledge can be used to guide the data extraction and pre-processing stages of a process mining case study to properly align the data with the case study research questions. MDPI 2019-03-29 2019-04 /pmc/articles/PMC6479847/ /pubmed/30934913 http://dx.doi.org/10.3390/ijerph16071138 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Andrews, Robert
Wynn, Moe T.
Vallmuur, Kirsten
ter Hofstede, Arthur H. M.
Bosley, Emma
Elcock, Mark
Rashford, Stephen
Leveraging Data Quality to Better Prepare for Process Mining: An Approach Illustrated Through Analysing Road Trauma Pre-Hospital Retrieval and Transport Processes in Queensland
title Leveraging Data Quality to Better Prepare for Process Mining: An Approach Illustrated Through Analysing Road Trauma Pre-Hospital Retrieval and Transport Processes in Queensland
title_full Leveraging Data Quality to Better Prepare for Process Mining: An Approach Illustrated Through Analysing Road Trauma Pre-Hospital Retrieval and Transport Processes in Queensland
title_fullStr Leveraging Data Quality to Better Prepare for Process Mining: An Approach Illustrated Through Analysing Road Trauma Pre-Hospital Retrieval and Transport Processes in Queensland
title_full_unstemmed Leveraging Data Quality to Better Prepare for Process Mining: An Approach Illustrated Through Analysing Road Trauma Pre-Hospital Retrieval and Transport Processes in Queensland
title_short Leveraging Data Quality to Better Prepare for Process Mining: An Approach Illustrated Through Analysing Road Trauma Pre-Hospital Retrieval and Transport Processes in Queensland
title_sort leveraging data quality to better prepare for process mining: an approach illustrated through analysing road trauma pre-hospital retrieval and transport processes in queensland
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6479847/
https://www.ncbi.nlm.nih.gov/pubmed/30934913
http://dx.doi.org/10.3390/ijerph16071138
work_keys_str_mv AT andrewsrobert leveragingdataqualitytobetterprepareforprocessmininganapproachillustratedthroughanalysingroadtraumaprehospitalretrievalandtransportprocessesinqueensland
AT wynnmoet leveragingdataqualitytobetterprepareforprocessmininganapproachillustratedthroughanalysingroadtraumaprehospitalretrievalandtransportprocessesinqueensland
AT vallmuurkirsten leveragingdataqualitytobetterprepareforprocessmininganapproachillustratedthroughanalysingroadtraumaprehospitalretrievalandtransportprocessesinqueensland
AT terhofstedearthurhm leveragingdataqualitytobetterprepareforprocessmininganapproachillustratedthroughanalysingroadtraumaprehospitalretrievalandtransportprocessesinqueensland
AT bosleyemma leveragingdataqualitytobetterprepareforprocessmininganapproachillustratedthroughanalysingroadtraumaprehospitalretrievalandtransportprocessesinqueensland
AT elcockmark leveragingdataqualitytobetterprepareforprocessmininganapproachillustratedthroughanalysingroadtraumaprehospitalretrievalandtransportprocessesinqueensland
AT rashfordstephen leveragingdataqualitytobetterprepareforprocessmininganapproachillustratedthroughanalysingroadtraumaprehospitalretrievalandtransportprocessesinqueensland