Cargando…

Lifelog Data-Based Prediction Model of Digital Health Care App Customer Churn: Retrospective Observational Study

BACKGROUND: Customer churn is the rate at which customers stop doing business with an entity. In the field of digital health care, user churn prediction is important not only in terms of company revenue but also for improving the health of users. Churn prediction has been previously studied, but mos...

Descripción completa

Detalles Bibliográficos
Autores principales: Kwon, Hongwook, Kim, Ho Heon, An, Jaeil, Lee, Jae-Ho, Park, Yu Rang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7817354/
https://www.ncbi.nlm.nih.gov/pubmed/33404511
http://dx.doi.org/10.2196/22184
_version_ 1783638618966851584
author Kwon, Hongwook
Kim, Ho Heon
An, Jaeil
Lee, Jae-Ho
Park, Yu Rang
author_facet Kwon, Hongwook
Kim, Ho Heon
An, Jaeil
Lee, Jae-Ho
Park, Yu Rang
author_sort Kwon, Hongwook
collection PubMed
description BACKGROUND: Customer churn is the rate at which customers stop doing business with an entity. In the field of digital health care, user churn prediction is important not only in terms of company revenue but also for improving the health of users. Churn prediction has been previously studied, but most studies applied time-invariant model structures and used structured data. However, additional unstructured data have become available; therefore, it has become essential to process daily time-series log data for churn predictions. OBJECTIVE: We aimed to apply a recurrent neural network structure to accept time-series patterns using lifelog data and text message data to predict the churn of digital health care users. METHODS: This study was based on the use data of a digital health care app that provides interactive messages with human coaches regarding food, exercise, and weight logs. Among the users in Korea who enrolled between January 1, 2017 and January 1, 2019, we defined churn users according to the following criteria: users who received a refund before the paid program ended and users who received a refund 7 days after the trial period. We used long short-term memory with a masking layer to receive sequence data with different lengths. We also performed topic modeling to vectorize text messages. To interpret the contributions of each variable to model predictions, we used integrated gradients, which is an attribution method. RESULTS: A total of 1868 eligible users were included in this study. The final performance of churn prediction was an F1 score of 0.89; that score decreased by 0.12 when the data of the final week were excluded (F1 score 0.77). Additionally, when text data were included, the mean predicted performance increased by approximately 0.085 at every time point. Steps per day had the largest contribution (0.1085). Among the topic variables, poor habits (eg, drinking alcohol, overeating, and late-night eating) showed the largest contribution (0.0875). CONCLUSIONS: The model with a recurrent neural network architecture that used log data and message data demonstrated high performance for churn classification. Additionally, the analysis of the contribution of the variables is expected to help identify signs of user churn in advance and improve the adherence in digital health care.
format Online
Article
Text
id pubmed-7817354
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-78173542021-01-26 Lifelog Data-Based Prediction Model of Digital Health Care App Customer Churn: Retrospective Observational Study Kwon, Hongwook Kim, Ho Heon An, Jaeil Lee, Jae-Ho Park, Yu Rang J Med Internet Res Original Paper BACKGROUND: Customer churn is the rate at which customers stop doing business with an entity. In the field of digital health care, user churn prediction is important not only in terms of company revenue but also for improving the health of users. Churn prediction has been previously studied, but most studies applied time-invariant model structures and used structured data. However, additional unstructured data have become available; therefore, it has become essential to process daily time-series log data for churn predictions. OBJECTIVE: We aimed to apply a recurrent neural network structure to accept time-series patterns using lifelog data and text message data to predict the churn of digital health care users. METHODS: This study was based on the use data of a digital health care app that provides interactive messages with human coaches regarding food, exercise, and weight logs. Among the users in Korea who enrolled between January 1, 2017 and January 1, 2019, we defined churn users according to the following criteria: users who received a refund before the paid program ended and users who received a refund 7 days after the trial period. We used long short-term memory with a masking layer to receive sequence data with different lengths. We also performed topic modeling to vectorize text messages. To interpret the contributions of each variable to model predictions, we used integrated gradients, which is an attribution method. RESULTS: A total of 1868 eligible users were included in this study. The final performance of churn prediction was an F1 score of 0.89; that score decreased by 0.12 when the data of the final week were excluded (F1 score 0.77). Additionally, when text data were included, the mean predicted performance increased by approximately 0.085 at every time point. Steps per day had the largest contribution (0.1085). Among the topic variables, poor habits (eg, drinking alcohol, overeating, and late-night eating) showed the largest contribution (0.0875). CONCLUSIONS: The model with a recurrent neural network architecture that used log data and message data demonstrated high performance for churn classification. Additionally, the analysis of the contribution of the variables is expected to help identify signs of user churn in advance and improve the adherence in digital health care. JMIR Publications 2021-01-06 /pmc/articles/PMC7817354/ /pubmed/33404511 http://dx.doi.org/10.2196/22184 Text en ©Hongwook Kwon, Ho Heon Kim, Jaeil An, Jae-Ho Lee, Yu Rang Park. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 06.01.2021. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Kwon, Hongwook
Kim, Ho Heon
An, Jaeil
Lee, Jae-Ho
Park, Yu Rang
Lifelog Data-Based Prediction Model of Digital Health Care App Customer Churn: Retrospective Observational Study
title Lifelog Data-Based Prediction Model of Digital Health Care App Customer Churn: Retrospective Observational Study
title_full Lifelog Data-Based Prediction Model of Digital Health Care App Customer Churn: Retrospective Observational Study
title_fullStr Lifelog Data-Based Prediction Model of Digital Health Care App Customer Churn: Retrospective Observational Study
title_full_unstemmed Lifelog Data-Based Prediction Model of Digital Health Care App Customer Churn: Retrospective Observational Study
title_short Lifelog Data-Based Prediction Model of Digital Health Care App Customer Churn: Retrospective Observational Study
title_sort lifelog data-based prediction model of digital health care app customer churn: retrospective observational study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7817354/
https://www.ncbi.nlm.nih.gov/pubmed/33404511
http://dx.doi.org/10.2196/22184
work_keys_str_mv AT kwonhongwook lifelogdatabasedpredictionmodelofdigitalhealthcareappcustomerchurnretrospectiveobservationalstudy
AT kimhoheon lifelogdatabasedpredictionmodelofdigitalhealthcareappcustomerchurnretrospectiveobservationalstudy
AT anjaeil lifelogdatabasedpredictionmodelofdigitalhealthcareappcustomerchurnretrospectiveobservationalstudy
AT leejaeho lifelogdatabasedpredictionmodelofdigitalhealthcareappcustomerchurnretrospectiveobservationalstudy
AT parkyurang lifelogdatabasedpredictionmodelofdigitalhealthcareappcustomerchurnretrospectiveobservationalstudy