Cargando…

Cardiovascular Risk Prediction Using Machine Learning in a Large Japanese Cohort

Background: Cardiovascular disease (CVD) screening entails precise event prediction to orient risk stratification, resource allocation, and insurance policy. We used random survival forests (RSF) to identify markers of incident CVD among Japanese adults enrolled in an employer-mandated screening pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Matheson, Matthew B., Kato, Yoko, Baba, Shinichi, Cox, Christopher, Lima, João A.C., Ambale-Venkatesh, Bharath
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Japanese Circulation Society 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9726526/
https://www.ncbi.nlm.nih.gov/pubmed/36530840
http://dx.doi.org/10.1253/circrep.CR-22-0101
Descripción
Sumario:Background: Cardiovascular disease (CVD) screening entails precise event prediction to orient risk stratification, resource allocation, and insurance policy. We used random survival forests (RSF) to identify markers of incident CVD among Japanese adults enrolled in an employer-mandated screening program. Methods and Results: We examined biomarker, health history, medication use, and lifestyle data from 155,108 adults aged ≥40 years. The occurrence of coronary artery disease (CAD) or atherosclerotic CVD (ASCVD) events was examined over 6 years of follow-up. The analysis used RSF to identify predictors, then investigated simplified RSF models with fewer predictors for individual-level risk prediction. Data were split into training (70%) and test (30%) datasets. At baseline, the median patient age was 47 years (interquartile range 41–56 years), with 65% males. In all, 1,642 CAD and 2,164 ASCVD events were observed. RSF identified history of heart disease, age, self-reported blood pressure medication, HbA1c, fasting blood sugar, and high-density lipoprotein as important markers of both endpoints. RSF analyses with only the top 20 predictors demonstrated good performance, with areas under the curve of >84% for CAD and >82% for ASCVD in test data across 6 years. Conclusions: We present a machine learning technique for accurate assessment of cardiovascular risk using employer-mandated annual health checkup information. The algorithm produces individual-level risk curves over time, empowering clinicians to efficiently implement prevention strategies in a low-risk population.