Cargando…

Machine learning prediction models for different stages of non-small cell lung cancer based on tongue and tumor marker: a pilot study

OBJECTIVE: To analyze the tongue feature of NSCLC at different stages, as well as the correlation between tongue feature and tumor marker, and investigate the feasibility of establishing prediction models for NSCLC at different stages based on tongue feature and tumor marker. METHODS: Tongue images...

Descripción completa

Detalles Bibliográficos
Autores principales: Shi, Yulin, Wang, Hao, Yao, Xinghua, Li, Jun, Liu, Jiayi, Chen, Yuan, Liu, Lingshuang, Xu, Jiatuo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10542664/
https://www.ncbi.nlm.nih.gov/pubmed/37773123
http://dx.doi.org/10.1186/s12911-023-02266-5
_version_ 1785114139468759040
author Shi, Yulin
Wang, Hao
Yao, Xinghua
Li, Jun
Liu, Jiayi
Chen, Yuan
Liu, Lingshuang
Xu, Jiatuo
author_facet Shi, Yulin
Wang, Hao
Yao, Xinghua
Li, Jun
Liu, Jiayi
Chen, Yuan
Liu, Lingshuang
Xu, Jiatuo
author_sort Shi, Yulin
collection PubMed
description OBJECTIVE: To analyze the tongue feature of NSCLC at different stages, as well as the correlation between tongue feature and tumor marker, and investigate the feasibility of establishing prediction models for NSCLC at different stages based on tongue feature and tumor marker. METHODS: Tongue images were collected from non-advanced NSCLC patients (n = 109) and advanced NSCLC patients (n = 110), analyzed the tongue images to obtain tongue feature, and analyzed the correlation between tongue feature and tumor marker in different stages of NSCLC. On this basis, six classifiers, decision tree, logistic regression, SVM, random forest, naive bayes, and neural network, were used to establish prediction models for different stages of NSCLC based on tongue feature and tumor marker. RESULTS: There were statistically significant differences in tongue feature between the non-advanced and advanced NSCLC groups. In the advanced NSCLC group, the number of indexes with statistically significant correlations between tongue feature and tumor marker was significantly higher than in the non-advanced NSCLC group, and the correlations were stronger. Support Vector Machine (SVM), decision tree, and logistic regression among the machine learning methods performed poorly in models with different stages of NSCLC. Neural network, random forest and naive bayes had better classification efficiency for the data set of tongue feature and tumor marker and baseline. The models’ classification accuracies were 0.767 ± 0.081, 0.718 ± 0.062, and 0.688 ± 0.070, respectively, and the AUCs were 0.793 ± 0.086, 0.779 ± 0.075, and 0.771 ± 0.072, respectively. CONCLUSIONS: There were statistically significant differences in tongue feature between different stages of NSCLC, with advanced NSCLC tongue feature being more closely correlated with tumor marker. Due to the limited information, single data sources including baseline, tongue feature, and tumor marker cannot be used to identify the different stages of NSCLC in this pilot study. In addition to the logistic regression method, other machine learning methods, based on tumor marker and baseline data sets, can effectively improve the differential diagnosis efficiency of different stages of NSCLC by adding tongue image data, which requires further verification based on large sample studies in the future. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02266-5.
format Online
Article
Text
id pubmed-10542664
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-105426642023-10-03 Machine learning prediction models for different stages of non-small cell lung cancer based on tongue and tumor marker: a pilot study Shi, Yulin Wang, Hao Yao, Xinghua Li, Jun Liu, Jiayi Chen, Yuan Liu, Lingshuang Xu, Jiatuo BMC Med Inform Decis Mak Research OBJECTIVE: To analyze the tongue feature of NSCLC at different stages, as well as the correlation between tongue feature and tumor marker, and investigate the feasibility of establishing prediction models for NSCLC at different stages based on tongue feature and tumor marker. METHODS: Tongue images were collected from non-advanced NSCLC patients (n = 109) and advanced NSCLC patients (n = 110), analyzed the tongue images to obtain tongue feature, and analyzed the correlation between tongue feature and tumor marker in different stages of NSCLC. On this basis, six classifiers, decision tree, logistic regression, SVM, random forest, naive bayes, and neural network, were used to establish prediction models for different stages of NSCLC based on tongue feature and tumor marker. RESULTS: There were statistically significant differences in tongue feature between the non-advanced and advanced NSCLC groups. In the advanced NSCLC group, the number of indexes with statistically significant correlations between tongue feature and tumor marker was significantly higher than in the non-advanced NSCLC group, and the correlations were stronger. Support Vector Machine (SVM), decision tree, and logistic regression among the machine learning methods performed poorly in models with different stages of NSCLC. Neural network, random forest and naive bayes had better classification efficiency for the data set of tongue feature and tumor marker and baseline. The models’ classification accuracies were 0.767 ± 0.081, 0.718 ± 0.062, and 0.688 ± 0.070, respectively, and the AUCs were 0.793 ± 0.086, 0.779 ± 0.075, and 0.771 ± 0.072, respectively. CONCLUSIONS: There were statistically significant differences in tongue feature between different stages of NSCLC, with advanced NSCLC tongue feature being more closely correlated with tumor marker. Due to the limited information, single data sources including baseline, tongue feature, and tumor marker cannot be used to identify the different stages of NSCLC in this pilot study. In addition to the logistic regression method, other machine learning methods, based on tumor marker and baseline data sets, can effectively improve the differential diagnosis efficiency of different stages of NSCLC by adding tongue image data, which requires further verification based on large sample studies in the future. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02266-5. BioMed Central 2023-09-29 /pmc/articles/PMC10542664/ /pubmed/37773123 http://dx.doi.org/10.1186/s12911-023-02266-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Shi, Yulin
Wang, Hao
Yao, Xinghua
Li, Jun
Liu, Jiayi
Chen, Yuan
Liu, Lingshuang
Xu, Jiatuo
Machine learning prediction models for different stages of non-small cell lung cancer based on tongue and tumor marker: a pilot study
title Machine learning prediction models for different stages of non-small cell lung cancer based on tongue and tumor marker: a pilot study
title_full Machine learning prediction models for different stages of non-small cell lung cancer based on tongue and tumor marker: a pilot study
title_fullStr Machine learning prediction models for different stages of non-small cell lung cancer based on tongue and tumor marker: a pilot study
title_full_unstemmed Machine learning prediction models for different stages of non-small cell lung cancer based on tongue and tumor marker: a pilot study
title_short Machine learning prediction models for different stages of non-small cell lung cancer based on tongue and tumor marker: a pilot study
title_sort machine learning prediction models for different stages of non-small cell lung cancer based on tongue and tumor marker: a pilot study
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10542664/
https://www.ncbi.nlm.nih.gov/pubmed/37773123
http://dx.doi.org/10.1186/s12911-023-02266-5
work_keys_str_mv AT shiyulin machinelearningpredictionmodelsfordifferentstagesofnonsmallcelllungcancerbasedontongueandtumormarkerapilotstudy
AT wanghao machinelearningpredictionmodelsfordifferentstagesofnonsmallcelllungcancerbasedontongueandtumormarkerapilotstudy
AT yaoxinghua machinelearningpredictionmodelsfordifferentstagesofnonsmallcelllungcancerbasedontongueandtumormarkerapilotstudy
AT lijun machinelearningpredictionmodelsfordifferentstagesofnonsmallcelllungcancerbasedontongueandtumormarkerapilotstudy
AT liujiayi machinelearningpredictionmodelsfordifferentstagesofnonsmallcelllungcancerbasedontongueandtumormarkerapilotstudy
AT chenyuan machinelearningpredictionmodelsfordifferentstagesofnonsmallcelllungcancerbasedontongueandtumormarkerapilotstudy
AT liulingshuang machinelearningpredictionmodelsfordifferentstagesofnonsmallcelllungcancerbasedontongueandtumormarkerapilotstudy
AT xujiatuo machinelearningpredictionmodelsfordifferentstagesofnonsmallcelllungcancerbasedontongueandtumormarkerapilotstudy