Cargando…

Serum levels of chemical elements in esophageal squamous cell carcinoma in Anyang, China: a case-control study based on machine learning methods

OBJECTIVES: Esophageal squamous cell carcinoma (ESCC) is the predominant form of esophageal carcinoma with extremely aggressive nature and low survival rate. The risk factors for ESCC in the high-incidence areas of China remain unclear. We used machine learning methods to investigate whether there w...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Tong, Liu, Tiebing, Lin, Yucheng, Zhang, Chaoting, Yan, Lailai, Chen, Zhongxue, He, Zhonghu, Wang, Jingyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5623487/
https://www.ncbi.nlm.nih.gov/pubmed/28947442
http://dx.doi.org/10.1136/bmjopen-2016-015443
_version_ 1783268098796683264
author Lin, Tong
Liu, Tiebing
Lin, Yucheng
Zhang, Chaoting
Yan, Lailai
Chen, Zhongxue
He, Zhonghu
Wang, Jingyu
author_facet Lin, Tong
Liu, Tiebing
Lin, Yucheng
Zhang, Chaoting
Yan, Lailai
Chen, Zhongxue
He, Zhonghu
Wang, Jingyu
author_sort Lin, Tong
collection PubMed
description OBJECTIVES: Esophageal squamous cell carcinoma (ESCC) is the predominant form of esophageal carcinoma with extremely aggressive nature and low survival rate. The risk factors for ESCC in the high-incidence areas of China remain unclear. We used machine learning methods to investigate whether there was an association between the alterations of serum levels of certain chemical elements and ESCC. SETTINGS: Primary healthcare unit in Anyang city, Henan Province of China. PARTICIPANTS: 100 patients with ESCC and 100 healthy controls matched for age, sex and region were included. PRIMARY AND SECONDARY OUTCOME MEASURES: Primary outcome was the classification accuracy. Secondary outcome was the p Value of the t-test or rank-sum test. METHODS: Both traditional statistical methods of t-test and rank-sum test and fashionable machine learning approaches were employed. RESULTS: Random Forest achieves the best accuracy of 98.38% on the original feature vectors (without dimensionality reduction), and support vector machine outperforms other classifiers by yielding accuracy of 96.56% on embedding spaces (with dimensionality reduction). All six classifiers can achieve accuracies more than 90% based on the single most important element Sr. The other two elements with distinctive difference are S and P, providing accuracies around 80%. More than half of chemical elements were found to be significantly different between patients with ESCC and the controls. CONCLUSIONS: These results suggest clear differences between patients with ESCC and controls, implying some potential promising applications in diagnosis, prognosis, pharmacy and nutrition of ESCC. However, the results should be interpreted with caution due to the retrospective design nature, limited sample size and the lack of several potential confounding factors (including obesity, nutritional status, and fruit and vegetable consumption and potential regional carcinogen contacts).
format Online
Article
Text
id pubmed-5623487
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-56234872017-10-10 Serum levels of chemical elements in esophageal squamous cell carcinoma in Anyang, China: a case-control study based on machine learning methods Lin, Tong Liu, Tiebing Lin, Yucheng Zhang, Chaoting Yan, Lailai Chen, Zhongxue He, Zhonghu Wang, Jingyu BMJ Open Health Informatics OBJECTIVES: Esophageal squamous cell carcinoma (ESCC) is the predominant form of esophageal carcinoma with extremely aggressive nature and low survival rate. The risk factors for ESCC in the high-incidence areas of China remain unclear. We used machine learning methods to investigate whether there was an association between the alterations of serum levels of certain chemical elements and ESCC. SETTINGS: Primary healthcare unit in Anyang city, Henan Province of China. PARTICIPANTS: 100 patients with ESCC and 100 healthy controls matched for age, sex and region were included. PRIMARY AND SECONDARY OUTCOME MEASURES: Primary outcome was the classification accuracy. Secondary outcome was the p Value of the t-test or rank-sum test. METHODS: Both traditional statistical methods of t-test and rank-sum test and fashionable machine learning approaches were employed. RESULTS: Random Forest achieves the best accuracy of 98.38% on the original feature vectors (without dimensionality reduction), and support vector machine outperforms other classifiers by yielding accuracy of 96.56% on embedding spaces (with dimensionality reduction). All six classifiers can achieve accuracies more than 90% based on the single most important element Sr. The other two elements with distinctive difference are S and P, providing accuracies around 80%. More than half of chemical elements were found to be significantly different between patients with ESCC and the controls. CONCLUSIONS: These results suggest clear differences between patients with ESCC and controls, implying some potential promising applications in diagnosis, prognosis, pharmacy and nutrition of ESCC. However, the results should be interpreted with caution due to the retrospective design nature, limited sample size and the lack of several potential confounding factors (including obesity, nutritional status, and fruit and vegetable consumption and potential regional carcinogen contacts). BMJ Publishing Group 2017-09-24 /pmc/articles/PMC5623487/ /pubmed/28947442 http://dx.doi.org/10.1136/bmjopen-2016-015443 Text en © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted. This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
spellingShingle Health Informatics
Lin, Tong
Liu, Tiebing
Lin, Yucheng
Zhang, Chaoting
Yan, Lailai
Chen, Zhongxue
He, Zhonghu
Wang, Jingyu
Serum levels of chemical elements in esophageal squamous cell carcinoma in Anyang, China: a case-control study based on machine learning methods
title Serum levels of chemical elements in esophageal squamous cell carcinoma in Anyang, China: a case-control study based on machine learning methods
title_full Serum levels of chemical elements in esophageal squamous cell carcinoma in Anyang, China: a case-control study based on machine learning methods
title_fullStr Serum levels of chemical elements in esophageal squamous cell carcinoma in Anyang, China: a case-control study based on machine learning methods
title_full_unstemmed Serum levels of chemical elements in esophageal squamous cell carcinoma in Anyang, China: a case-control study based on machine learning methods
title_short Serum levels of chemical elements in esophageal squamous cell carcinoma in Anyang, China: a case-control study based on machine learning methods
title_sort serum levels of chemical elements in esophageal squamous cell carcinoma in anyang, china: a case-control study based on machine learning methods
topic Health Informatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5623487/
https://www.ncbi.nlm.nih.gov/pubmed/28947442
http://dx.doi.org/10.1136/bmjopen-2016-015443
work_keys_str_mv AT lintong serumlevelsofchemicalelementsinesophagealsquamouscellcarcinomainanyangchinaacasecontrolstudybasedonmachinelearningmethods
AT liutiebing serumlevelsofchemicalelementsinesophagealsquamouscellcarcinomainanyangchinaacasecontrolstudybasedonmachinelearningmethods
AT linyucheng serumlevelsofchemicalelementsinesophagealsquamouscellcarcinomainanyangchinaacasecontrolstudybasedonmachinelearningmethods
AT zhangchaoting serumlevelsofchemicalelementsinesophagealsquamouscellcarcinomainanyangchinaacasecontrolstudybasedonmachinelearningmethods
AT yanlailai serumlevelsofchemicalelementsinesophagealsquamouscellcarcinomainanyangchinaacasecontrolstudybasedonmachinelearningmethods
AT chenzhongxue serumlevelsofchemicalelementsinesophagealsquamouscellcarcinomainanyangchinaacasecontrolstudybasedonmachinelearningmethods
AT hezhonghu serumlevelsofchemicalelementsinesophagealsquamouscellcarcinomainanyangchinaacasecontrolstudybasedonmachinelearningmethods
AT wangjingyu serumlevelsofchemicalelementsinesophagealsquamouscellcarcinomainanyangchinaacasecontrolstudybasedonmachinelearningmethods