Cargando…

Early warning of telecom enterprise customer churn based on ensemble learning

Analyzing customers’ characteristics and giving the early warning of customer churn based on machine learning algorithms, can help enterprises provide targeted marketing strategies and personalized services, and save a lot of operating costs. Data cleaning, oversampling, data standardization and oth...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Yancong, Chen, Wenyue, Sun, Xiaochen, Yang, Dandan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10566699/
https://www.ncbi.nlm.nih.gov/pubmed/37819986
http://dx.doi.org/10.1371/journal.pone.0292466
_version_ 1785118966995222528
author Zhou, Yancong
Chen, Wenyue
Sun, Xiaochen
Yang, Dandan
author_facet Zhou, Yancong
Chen, Wenyue
Sun, Xiaochen
Yang, Dandan
author_sort Zhou, Yancong
collection PubMed
description Analyzing customers’ characteristics and giving the early warning of customer churn based on machine learning algorithms, can help enterprises provide targeted marketing strategies and personalized services, and save a lot of operating costs. Data cleaning, oversampling, data standardization and other preprocessing operations are done on 900,000 telecom customer personal characteristics and historical behavior data set based on Python language. Appropriate model parameters were selected to build BPNN (Back Propagation Neural Network). Random Forest (RF) and Adaboost, the two classic ensemble learning models were introduced, and the Adaboost dual-ensemble learning model with RF as the base learner was put forward. The four models and the other four classical machine learning models-decision tree, naive Bayes, K-Nearest Neighbor (KNN), Support Vector Machine (SVM) were utilized respectively to analyze the customer churn data. The results show that the four models have better performance in terms of recall rate, precision rate, F1 score and other indicators, and the RF-Adaboost dual-ensemble model has the best performance. Among them, the recall rates of BPNN, RF, Adaboost and RF-Adaboost dual-ensemble model on positive samples are respectively 79%, 90%, 89%,93%, the precision rates are 97%, 99%, 98%, 99%, and the F1 scores are 87%, 95%, 94%, 96%. The RF-Adaboost dual-ensemble model has the best performance, and the three indicators are 10%, 1%, and 6% higher than the reference. The prediction results of customer churn provide strong data support for telecom companies to adopt appropriate retention strategies for pre-churn customers and reduce customer churn.
format Online
Article
Text
id pubmed-10566699
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-105666992023-10-12 Early warning of telecom enterprise customer churn based on ensemble learning Zhou, Yancong Chen, Wenyue Sun, Xiaochen Yang, Dandan PLoS One Research Article Analyzing customers’ characteristics and giving the early warning of customer churn based on machine learning algorithms, can help enterprises provide targeted marketing strategies and personalized services, and save a lot of operating costs. Data cleaning, oversampling, data standardization and other preprocessing operations are done on 900,000 telecom customer personal characteristics and historical behavior data set based on Python language. Appropriate model parameters were selected to build BPNN (Back Propagation Neural Network). Random Forest (RF) and Adaboost, the two classic ensemble learning models were introduced, and the Adaboost dual-ensemble learning model with RF as the base learner was put forward. The four models and the other four classical machine learning models-decision tree, naive Bayes, K-Nearest Neighbor (KNN), Support Vector Machine (SVM) were utilized respectively to analyze the customer churn data. The results show that the four models have better performance in terms of recall rate, precision rate, F1 score and other indicators, and the RF-Adaboost dual-ensemble model has the best performance. Among them, the recall rates of BPNN, RF, Adaboost and RF-Adaboost dual-ensemble model on positive samples are respectively 79%, 90%, 89%,93%, the precision rates are 97%, 99%, 98%, 99%, and the F1 scores are 87%, 95%, 94%, 96%. The RF-Adaboost dual-ensemble model has the best performance, and the three indicators are 10%, 1%, and 6% higher than the reference. The prediction results of customer churn provide strong data support for telecom companies to adopt appropriate retention strategies for pre-churn customers and reduce customer churn. Public Library of Science 2023-10-11 /pmc/articles/PMC10566699/ /pubmed/37819986 http://dx.doi.org/10.1371/journal.pone.0292466 Text en © 2023 Zhou et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhou, Yancong
Chen, Wenyue
Sun, Xiaochen
Yang, Dandan
Early warning of telecom enterprise customer churn based on ensemble learning
title Early warning of telecom enterprise customer churn based on ensemble learning
title_full Early warning of telecom enterprise customer churn based on ensemble learning
title_fullStr Early warning of telecom enterprise customer churn based on ensemble learning
title_full_unstemmed Early warning of telecom enterprise customer churn based on ensemble learning
title_short Early warning of telecom enterprise customer churn based on ensemble learning
title_sort early warning of telecom enterprise customer churn based on ensemble learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10566699/
https://www.ncbi.nlm.nih.gov/pubmed/37819986
http://dx.doi.org/10.1371/journal.pone.0292466
work_keys_str_mv AT zhouyancong earlywarningoftelecomenterprisecustomerchurnbasedonensemblelearning
AT chenwenyue earlywarningoftelecomenterprisecustomerchurnbasedonensemblelearning
AT sunxiaochen earlywarningoftelecomenterprisecustomerchurnbasedonensemblelearning
AT yangdandan earlywarningoftelecomenterprisecustomerchurnbasedonensemblelearning