Cargando…

Big Data Health Care Platform With Multisource Heterogeneous Data Integration and Massive High-Dimensional Data Governance for Large Hospitals: Design, Development, and Application

BACKGROUND: With the advent of data-intensive science, a full integration of big data science and health care will bring a cross-field revolution to the medical community in China. The concept big data represents not only a technology but also a resource and a method. Big data are regarded as an imp...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Miye, Li, Sheyu, Zheng, Tao, Li, Nan, Shi, Qingke, Zhuo, Xuejun, Ding, Renxin, Huang, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9047713/
https://www.ncbi.nlm.nih.gov/pubmed/35416792
http://dx.doi.org/10.2196/36481
_version_ 1784695782592479232
author Wang, Miye
Li, Sheyu
Zheng, Tao
Li, Nan
Shi, Qingke
Zhuo, Xuejun
Ding, Renxin
Huang, Yong
author_facet Wang, Miye
Li, Sheyu
Zheng, Tao
Li, Nan
Shi, Qingke
Zhuo, Xuejun
Ding, Renxin
Huang, Yong
author_sort Wang, Miye
collection PubMed
description BACKGROUND: With the advent of data-intensive science, a full integration of big data science and health care will bring a cross-field revolution to the medical community in China. The concept big data represents not only a technology but also a resource and a method. Big data are regarded as an important strategic resource both at the national level and at the medical institutional level, thus great importance has been attached to the construction of a big data platform for health care. OBJECTIVE: We aimed to develop and implement a big data platform for a large hospital, to overcome difficulties in integrating, calculating, storing, and governing multisource heterogeneous data in a standardized way, as well as to ensure health care data security. METHODS: The project to build a big data platform at West China Hospital of Sichuan University was launched in 2017. The West China Hospital of Sichuan University big data platform has extracted, integrated, and governed data from different departments and sections of the hospital since January 2008. A master–slave mode was implemented to realize the real-time integration of multisource heterogeneous massive data, and an environment that separates heterogeneous characteristic data storage and calculation processes was built. A business-based metadata model was improved for data quality control, and a standardized health care data governance system and scientific closed-loop data security ecology were established. RESULTS: After 3 years of design, development, and testing, the West China Hospital of Sichuan University big data platform was formally brought online in November 2020. It has formed a massive multidimensional data resource database, with more than 12.49 million patients, 75.67 million visits, and 8475 data variables. Along with hospital operations data, newly generated data are entered into the platform in real time. Since its launch, the platform has supported more than 20 major projects and provided data service, storage, and computing power support to many scientific teams, facilitating a shift in the data support model—from conventional manual extraction to self-service retrieval (which has reached 8561 retrievals per month). CONCLUSIONS: The platform can combine operation systems data from all departments and sections in a hospital to form a massive high-dimensional high-quality health care database that allows electronic medical records to be used effectively and taps into the value of data to fully support clinical services, scientific research, and operations management. The West China Hospital of Sichuan University big data platform can successfully generate multisource heterogeneous data storage and computing power. By effectively governing massive multidimensional data gathered from multiple sources, the West China Hospital of Sichuan University big data platform provides highly available data assets and thus has a high application value in the health care field. The West China Hospital of Sichuan University big data platform facilitates simpler and more efficient utilization of electronic medical record data for real-world research.
format Online
Article
Text
id pubmed-9047713
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-90477132022-04-29 Big Data Health Care Platform With Multisource Heterogeneous Data Integration and Massive High-Dimensional Data Governance for Large Hospitals: Design, Development, and Application Wang, Miye Li, Sheyu Zheng, Tao Li, Nan Shi, Qingke Zhuo, Xuejun Ding, Renxin Huang, Yong JMIR Med Inform Original Paper BACKGROUND: With the advent of data-intensive science, a full integration of big data science and health care will bring a cross-field revolution to the medical community in China. The concept big data represents not only a technology but also a resource and a method. Big data are regarded as an important strategic resource both at the national level and at the medical institutional level, thus great importance has been attached to the construction of a big data platform for health care. OBJECTIVE: We aimed to develop and implement a big data platform for a large hospital, to overcome difficulties in integrating, calculating, storing, and governing multisource heterogeneous data in a standardized way, as well as to ensure health care data security. METHODS: The project to build a big data platform at West China Hospital of Sichuan University was launched in 2017. The West China Hospital of Sichuan University big data platform has extracted, integrated, and governed data from different departments and sections of the hospital since January 2008. A master–slave mode was implemented to realize the real-time integration of multisource heterogeneous massive data, and an environment that separates heterogeneous characteristic data storage and calculation processes was built. A business-based metadata model was improved for data quality control, and a standardized health care data governance system and scientific closed-loop data security ecology were established. RESULTS: After 3 years of design, development, and testing, the West China Hospital of Sichuan University big data platform was formally brought online in November 2020. It has formed a massive multidimensional data resource database, with more than 12.49 million patients, 75.67 million visits, and 8475 data variables. Along with hospital operations data, newly generated data are entered into the platform in real time. Since its launch, the platform has supported more than 20 major projects and provided data service, storage, and computing power support to many scientific teams, facilitating a shift in the data support model—from conventional manual extraction to self-service retrieval (which has reached 8561 retrievals per month). CONCLUSIONS: The platform can combine operation systems data from all departments and sections in a hospital to form a massive high-dimensional high-quality health care database that allows electronic medical records to be used effectively and taps into the value of data to fully support clinical services, scientific research, and operations management. The West China Hospital of Sichuan University big data platform can successfully generate multisource heterogeneous data storage and computing power. By effectively governing massive multidimensional data gathered from multiple sources, the West China Hospital of Sichuan University big data platform provides highly available data assets and thus has a high application value in the health care field. The West China Hospital of Sichuan University big data platform facilitates simpler and more efficient utilization of electronic medical record data for real-world research. JMIR Publications 2022-04-13 /pmc/articles/PMC9047713/ /pubmed/35416792 http://dx.doi.org/10.2196/36481 Text en ©Miye Wang, Sheyu Li, Tao Zheng, Nan Li, Qingke Shi, Xuejun Zhuo, Renxin Ding, Yong Huang. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 13.04.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Wang, Miye
Li, Sheyu
Zheng, Tao
Li, Nan
Shi, Qingke
Zhuo, Xuejun
Ding, Renxin
Huang, Yong
Big Data Health Care Platform With Multisource Heterogeneous Data Integration and Massive High-Dimensional Data Governance for Large Hospitals: Design, Development, and Application
title Big Data Health Care Platform With Multisource Heterogeneous Data Integration and Massive High-Dimensional Data Governance for Large Hospitals: Design, Development, and Application
title_full Big Data Health Care Platform With Multisource Heterogeneous Data Integration and Massive High-Dimensional Data Governance for Large Hospitals: Design, Development, and Application
title_fullStr Big Data Health Care Platform With Multisource Heterogeneous Data Integration and Massive High-Dimensional Data Governance for Large Hospitals: Design, Development, and Application
title_full_unstemmed Big Data Health Care Platform With Multisource Heterogeneous Data Integration and Massive High-Dimensional Data Governance for Large Hospitals: Design, Development, and Application
title_short Big Data Health Care Platform With Multisource Heterogeneous Data Integration and Massive High-Dimensional Data Governance for Large Hospitals: Design, Development, and Application
title_sort big data health care platform with multisource heterogeneous data integration and massive high-dimensional data governance for large hospitals: design, development, and application
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9047713/
https://www.ncbi.nlm.nih.gov/pubmed/35416792
http://dx.doi.org/10.2196/36481
work_keys_str_mv AT wangmiye bigdatahealthcareplatformwithmultisourceheterogeneousdataintegrationandmassivehighdimensionaldatagovernanceforlargehospitalsdesigndevelopmentandapplication
AT lisheyu bigdatahealthcareplatformwithmultisourceheterogeneousdataintegrationandmassivehighdimensionaldatagovernanceforlargehospitalsdesigndevelopmentandapplication
AT zhengtao bigdatahealthcareplatformwithmultisourceheterogeneousdataintegrationandmassivehighdimensionaldatagovernanceforlargehospitalsdesigndevelopmentandapplication
AT linan bigdatahealthcareplatformwithmultisourceheterogeneousdataintegrationandmassivehighdimensionaldatagovernanceforlargehospitalsdesigndevelopmentandapplication
AT shiqingke bigdatahealthcareplatformwithmultisourceheterogeneousdataintegrationandmassivehighdimensionaldatagovernanceforlargehospitalsdesigndevelopmentandapplication
AT zhuoxuejun bigdatahealthcareplatformwithmultisourceheterogeneousdataintegrationandmassivehighdimensionaldatagovernanceforlargehospitalsdesigndevelopmentandapplication
AT dingrenxin bigdatahealthcareplatformwithmultisourceheterogeneousdataintegrationandmassivehighdimensionaldatagovernanceforlargehospitalsdesigndevelopmentandapplication
AT huangyong bigdatahealthcareplatformwithmultisourceheterogeneousdataintegrationandmassivehighdimensionaldatagovernanceforlargehospitalsdesigndevelopmentandapplication