Cargando…

CardioNet: a manually curated database for artificial intelligence-based research on cardiovascular diseases

BACKGROUND: Cardiovascular diseases (CVDs) are difficult to diagnose early and have risk factors that are easy to overlook. Early prediction and personalization of treatment through the use of artificial intelligence (AI) may help clinicians and patients manage CVDs more effectively. However, to app...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahn, Imjin, Na, Wonjun, Kwon, Osung, Yang, Dong Hyun, Park, Gyung-Min, Gwon, Hansle, Kang, Hee Jun, Jeong, Yeon Uk, Yoo, Jungsun, Kim, Yunha, Jun, Tae Joon, Kim, Young-Hak
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7842077/
https://www.ncbi.nlm.nih.gov/pubmed/33509180
http://dx.doi.org/10.1186/s12911-021-01392-2
_version_ 1783643941190500352
author Ahn, Imjin
Na, Wonjun
Kwon, Osung
Yang, Dong Hyun
Park, Gyung-Min
Gwon, Hansle
Kang, Hee Jun
Jeong, Yeon Uk
Yoo, Jungsun
Kim, Yunha
Jun, Tae Joon
Kim, Young-Hak
author_facet Ahn, Imjin
Na, Wonjun
Kwon, Osung
Yang, Dong Hyun
Park, Gyung-Min
Gwon, Hansle
Kang, Hee Jun
Jeong, Yeon Uk
Yoo, Jungsun
Kim, Yunha
Jun, Tae Joon
Kim, Young-Hak
author_sort Ahn, Imjin
collection PubMed
description BACKGROUND: Cardiovascular diseases (CVDs) are difficult to diagnose early and have risk factors that are easy to overlook. Early prediction and personalization of treatment through the use of artificial intelligence (AI) may help clinicians and patients manage CVDs more effectively. However, to apply AI approaches to CVDs data, it is necessary to establish and curate a specialized database based on electronic health records (EHRs) and include pre-processed unstructured data. METHODS: To build a suitable database (CardioNet) for CVDs that can utilize AI technology, contributing to the overall care of patients with CVDs. First, we collected the anonymized records of 748,474 patients who had visited the Asan Medical Center (AMC) or Ulsan University Hospital (UUH) because of CVDs. Second, we set clinically plausible criteria to remove errors and duplication. Third, we integrated unstructured data such as readings of medical examinations with structured data sourced from EHRs to create the CardioNet. We subsequently performed natural language processing to structuralize the significant variables associated with CVDs because most results of the principal CVD-related medical examinations are free-text readings. Additionally, to ensure interoperability for convergent multi-center research, we standardized the data using several codes that correspond to the common data model. Finally, we created the descriptive table (i.e., dictionary of the CardioNet) to simplify access and utilization of data for clinicians and engineers and continuously validated the data to ensure reliability. RESULTS: CardioNet is a comprehensive database that can serve as a training set for AI models and assist in all aspects of clinical management of CVDs. It comprises information extracted from EHRs and results of readings of CVD-related digital tests. It consists of 27 tables, a code-master table, and a descriptive table. CONCLUSIONS: CardioNet database specialized in CVDs was established, with continuing data collection. We are actively supporting multi-center research, which may require further data processing, depending on the subject of the study. CardioNet will serve as the fundamental database for future CVD-related research projects.
format Online
Article
Text
id pubmed-7842077
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-78420772021-01-28 CardioNet: a manually curated database for artificial intelligence-based research on cardiovascular diseases Ahn, Imjin Na, Wonjun Kwon, Osung Yang, Dong Hyun Park, Gyung-Min Gwon, Hansle Kang, Hee Jun Jeong, Yeon Uk Yoo, Jungsun Kim, Yunha Jun, Tae Joon Kim, Young-Hak BMC Med Inform Decis Mak Research Article BACKGROUND: Cardiovascular diseases (CVDs) are difficult to diagnose early and have risk factors that are easy to overlook. Early prediction and personalization of treatment through the use of artificial intelligence (AI) may help clinicians and patients manage CVDs more effectively. However, to apply AI approaches to CVDs data, it is necessary to establish and curate a specialized database based on electronic health records (EHRs) and include pre-processed unstructured data. METHODS: To build a suitable database (CardioNet) for CVDs that can utilize AI technology, contributing to the overall care of patients with CVDs. First, we collected the anonymized records of 748,474 patients who had visited the Asan Medical Center (AMC) or Ulsan University Hospital (UUH) because of CVDs. Second, we set clinically plausible criteria to remove errors and duplication. Third, we integrated unstructured data such as readings of medical examinations with structured data sourced from EHRs to create the CardioNet. We subsequently performed natural language processing to structuralize the significant variables associated with CVDs because most results of the principal CVD-related medical examinations are free-text readings. Additionally, to ensure interoperability for convergent multi-center research, we standardized the data using several codes that correspond to the common data model. Finally, we created the descriptive table (i.e., dictionary of the CardioNet) to simplify access and utilization of data for clinicians and engineers and continuously validated the data to ensure reliability. RESULTS: CardioNet is a comprehensive database that can serve as a training set for AI models and assist in all aspects of clinical management of CVDs. It comprises information extracted from EHRs and results of readings of CVD-related digital tests. It consists of 27 tables, a code-master table, and a descriptive table. CONCLUSIONS: CardioNet database specialized in CVDs was established, with continuing data collection. We are actively supporting multi-center research, which may require further data processing, depending on the subject of the study. CardioNet will serve as the fundamental database for future CVD-related research projects. BioMed Central 2021-01-28 /pmc/articles/PMC7842077/ /pubmed/33509180 http://dx.doi.org/10.1186/s12911-021-01392-2 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Ahn, Imjin
Na, Wonjun
Kwon, Osung
Yang, Dong Hyun
Park, Gyung-Min
Gwon, Hansle
Kang, Hee Jun
Jeong, Yeon Uk
Yoo, Jungsun
Kim, Yunha
Jun, Tae Joon
Kim, Young-Hak
CardioNet: a manually curated database for artificial intelligence-based research on cardiovascular diseases
title CardioNet: a manually curated database for artificial intelligence-based research on cardiovascular diseases
title_full CardioNet: a manually curated database for artificial intelligence-based research on cardiovascular diseases
title_fullStr CardioNet: a manually curated database for artificial intelligence-based research on cardiovascular diseases
title_full_unstemmed CardioNet: a manually curated database for artificial intelligence-based research on cardiovascular diseases
title_short CardioNet: a manually curated database for artificial intelligence-based research on cardiovascular diseases
title_sort cardionet: a manually curated database for artificial intelligence-based research on cardiovascular diseases
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7842077/
https://www.ncbi.nlm.nih.gov/pubmed/33509180
http://dx.doi.org/10.1186/s12911-021-01392-2
work_keys_str_mv AT ahnimjin cardionetamanuallycurateddatabaseforartificialintelligencebasedresearchoncardiovasculardiseases
AT nawonjun cardionetamanuallycurateddatabaseforartificialintelligencebasedresearchoncardiovasculardiseases
AT kwonosung cardionetamanuallycurateddatabaseforartificialintelligencebasedresearchoncardiovasculardiseases
AT yangdonghyun cardionetamanuallycurateddatabaseforartificialintelligencebasedresearchoncardiovasculardiseases
AT parkgyungmin cardionetamanuallycurateddatabaseforartificialintelligencebasedresearchoncardiovasculardiseases
AT gwonhansle cardionetamanuallycurateddatabaseforartificialintelligencebasedresearchoncardiovasculardiseases
AT kangheejun cardionetamanuallycurateddatabaseforartificialintelligencebasedresearchoncardiovasculardiseases
AT jeongyeonuk cardionetamanuallycurateddatabaseforartificialintelligencebasedresearchoncardiovasculardiseases
AT yoojungsun cardionetamanuallycurateddatabaseforartificialintelligencebasedresearchoncardiovasculardiseases
AT kimyunha cardionetamanuallycurateddatabaseforartificialintelligencebasedresearchoncardiovasculardiseases
AT juntaejoon cardionetamanuallycurateddatabaseforartificialintelligencebasedresearchoncardiovasculardiseases
AT kimyounghak cardionetamanuallycurateddatabaseforartificialintelligencebasedresearchoncardiovasculardiseases