Cargando…
Big Data Warehouse for Healthcare-Sensitive Data Applications
Obesity is a major public health problem worldwide, and the prevalence of childhood obesity is of particular concern. Effective interventions for preventing and treating childhood obesity aim to change behaviour and exposure at the individual, community, and societal levels. However, monitoring and...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8037603/ https://www.ncbi.nlm.nih.gov/pubmed/33800574 http://dx.doi.org/10.3390/s21072353 |
_version_ | 1783677182396071936 |
---|---|
author | Shahid, Arsalan Nguyen, Thien-An Ngoc Kechadi, M-Tahar |
author_facet | Shahid, Arsalan Nguyen, Thien-An Ngoc Kechadi, M-Tahar |
author_sort | Shahid, Arsalan |
collection | PubMed |
description | Obesity is a major public health problem worldwide, and the prevalence of childhood obesity is of particular concern. Effective interventions for preventing and treating childhood obesity aim to change behaviour and exposure at the individual, community, and societal levels. However, monitoring and evaluating such changes is very challenging. The EU Horizon 2020 project “Big Data against Childhood Obesity (BigO)” aims at gathering large-scale data from a large number of children using different sensor technologies to create comprehensive obesity prevalence models for data-driven predictions about specific policies on a community. It further provides real-time monitoring of the population responses, supported by meaningful real-time data analysis and visualisations. Since BigO involves monitoring and storing of personal data related to the behaviours of a potentially vulnerable population, the data representation, security, and access control are crucial. In this paper, we briefly present the BigO system architecture and focus on the necessary components of the system that deals with data access control, storage, anonymisation, and the corresponding interfaces with the rest of the system. We propose a three-layered data warehouse architecture: The back-end layer consists of a database management system for data collection, de-identification, and anonymisation of the original datasets. The role-based permissions and secured views are implemented in the access control layer. Lastly, the controller layer regulates the data access protocols for any data access and data analysis. We further present the data representation methods and the storage models considering the privacy and security mechanisms. The data privacy and security plans are devised based on the types of collected personal, the types of users, data storage, data transmission, and data analysis. We discuss in detail the challenges of privacy protection in this large distributed data-driven application and implement novel privacy-aware data analysis protocols to ensure that the proposed models guarantee the privacy and security of datasets. Finally, we present the BigO system architecture and its implementation that integrates privacy-aware protocols. |
format | Online Article Text |
id | pubmed-8037603 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-80376032021-04-12 Big Data Warehouse for Healthcare-Sensitive Data Applications Shahid, Arsalan Nguyen, Thien-An Ngoc Kechadi, M-Tahar Sensors (Basel) Article Obesity is a major public health problem worldwide, and the prevalence of childhood obesity is of particular concern. Effective interventions for preventing and treating childhood obesity aim to change behaviour and exposure at the individual, community, and societal levels. However, monitoring and evaluating such changes is very challenging. The EU Horizon 2020 project “Big Data against Childhood Obesity (BigO)” aims at gathering large-scale data from a large number of children using different sensor technologies to create comprehensive obesity prevalence models for data-driven predictions about specific policies on a community. It further provides real-time monitoring of the population responses, supported by meaningful real-time data analysis and visualisations. Since BigO involves monitoring and storing of personal data related to the behaviours of a potentially vulnerable population, the data representation, security, and access control are crucial. In this paper, we briefly present the BigO system architecture and focus on the necessary components of the system that deals with data access control, storage, anonymisation, and the corresponding interfaces with the rest of the system. We propose a three-layered data warehouse architecture: The back-end layer consists of a database management system for data collection, de-identification, and anonymisation of the original datasets. The role-based permissions and secured views are implemented in the access control layer. Lastly, the controller layer regulates the data access protocols for any data access and data analysis. We further present the data representation methods and the storage models considering the privacy and security mechanisms. The data privacy and security plans are devised based on the types of collected personal, the types of users, data storage, data transmission, and data analysis. We discuss in detail the challenges of privacy protection in this large distributed data-driven application and implement novel privacy-aware data analysis protocols to ensure that the proposed models guarantee the privacy and security of datasets. Finally, we present the BigO system architecture and its implementation that integrates privacy-aware protocols. MDPI 2021-03-28 /pmc/articles/PMC8037603/ /pubmed/33800574 http://dx.doi.org/10.3390/s21072353 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ). |
spellingShingle | Article Shahid, Arsalan Nguyen, Thien-An Ngoc Kechadi, M-Tahar Big Data Warehouse for Healthcare-Sensitive Data Applications |
title | Big Data Warehouse for Healthcare-Sensitive Data Applications |
title_full | Big Data Warehouse for Healthcare-Sensitive Data Applications |
title_fullStr | Big Data Warehouse for Healthcare-Sensitive Data Applications |
title_full_unstemmed | Big Data Warehouse for Healthcare-Sensitive Data Applications |
title_short | Big Data Warehouse for Healthcare-Sensitive Data Applications |
title_sort | big data warehouse for healthcare-sensitive data applications |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8037603/ https://www.ncbi.nlm.nih.gov/pubmed/33800574 http://dx.doi.org/10.3390/s21072353 |
work_keys_str_mv | AT shahidarsalan bigdatawarehouseforhealthcaresensitivedataapplications AT nguyenthienanngoc bigdatawarehouseforhealthcaresensitivedataapplications AT kechadimtahar bigdatawarehouseforhealthcaresensitivedataapplications |