Cargando…

Development and Evaluation of Novel Statistical Methods in Urine Biomarker-Based Hepatocellular Carcinoma Screening

Hepatocellular carcinoma is one of the fastest growing cancers in the US and has a low survival rate, partly due to difficulties in early detection. Because of HCC’s high heterogeneity, it has been suggested that multiple biomarkers would be needed to develop a sensitive HCC screening test. This stu...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jeremy, Jain, Surbhi, Chen, Dion, Song, Wei, Hu, Chi-Tan, Su, Ying-Hsiu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5830457/
https://www.ncbi.nlm.nih.gov/pubmed/29491388
http://dx.doi.org/10.1038/s41598-018-21922-9
_version_ 1783302999578247168
author Wang, Jeremy
Jain, Surbhi
Chen, Dion
Song, Wei
Hu, Chi-Tan
Su, Ying-Hsiu
author_facet Wang, Jeremy
Jain, Surbhi
Chen, Dion
Song, Wei
Hu, Chi-Tan
Su, Ying-Hsiu
author_sort Wang, Jeremy
collection PubMed
description Hepatocellular carcinoma is one of the fastest growing cancers in the US and has a low survival rate, partly due to difficulties in early detection. Because of HCC’s high heterogeneity, it has been suggested that multiple biomarkers would be needed to develop a sensitive HCC screening test. This study applied random forest (RF), a machine learning technique, and proposed two novel models, fixed sequential (FS) and two-step (TS), for comparison with two commonly used statistical techniques, logistic regression (LR) and classification and regression trees (CART), in combining multiple urine DNA biomarkers for HCC screening using biomarker values obtained from 137 HCC and 431 non-HCC (224 hepatitis and 207 cirrhosis) subjects. The sensitivity, specificity, area under the receiver operating curve, and variability were estimated through repeated 10-fold cross-validation to compare the models’ performances in accuracy and robustness. We show that RF and TS have higher accuracy and stability; specifically, they reach 90% specificity and 86%/87% sensitivity respectively along with 15% higher sensitivity and 10% higher specificity than LR in cross-validation. The potential of RF and TS to develop a panel of multiple biomarkers and the possibility for self-training, cloud-based models for HCC screening are discussed.
format Online
Article
Text
id pubmed-5830457
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-58304572018-03-05 Development and Evaluation of Novel Statistical Methods in Urine Biomarker-Based Hepatocellular Carcinoma Screening Wang, Jeremy Jain, Surbhi Chen, Dion Song, Wei Hu, Chi-Tan Su, Ying-Hsiu Sci Rep Article Hepatocellular carcinoma is one of the fastest growing cancers in the US and has a low survival rate, partly due to difficulties in early detection. Because of HCC’s high heterogeneity, it has been suggested that multiple biomarkers would be needed to develop a sensitive HCC screening test. This study applied random forest (RF), a machine learning technique, and proposed two novel models, fixed sequential (FS) and two-step (TS), for comparison with two commonly used statistical techniques, logistic regression (LR) and classification and regression trees (CART), in combining multiple urine DNA biomarkers for HCC screening using biomarker values obtained from 137 HCC and 431 non-HCC (224 hepatitis and 207 cirrhosis) subjects. The sensitivity, specificity, area under the receiver operating curve, and variability were estimated through repeated 10-fold cross-validation to compare the models’ performances in accuracy and robustness. We show that RF and TS have higher accuracy and stability; specifically, they reach 90% specificity and 86%/87% sensitivity respectively along with 15% higher sensitivity and 10% higher specificity than LR in cross-validation. The potential of RF and TS to develop a panel of multiple biomarkers and the possibility for self-training, cloud-based models for HCC screening are discussed. Nature Publishing Group UK 2018-02-28 /pmc/articles/PMC5830457/ /pubmed/29491388 http://dx.doi.org/10.1038/s41598-018-21922-9 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Wang, Jeremy
Jain, Surbhi
Chen, Dion
Song, Wei
Hu, Chi-Tan
Su, Ying-Hsiu
Development and Evaluation of Novel Statistical Methods in Urine Biomarker-Based Hepatocellular Carcinoma Screening
title Development and Evaluation of Novel Statistical Methods in Urine Biomarker-Based Hepatocellular Carcinoma Screening
title_full Development and Evaluation of Novel Statistical Methods in Urine Biomarker-Based Hepatocellular Carcinoma Screening
title_fullStr Development and Evaluation of Novel Statistical Methods in Urine Biomarker-Based Hepatocellular Carcinoma Screening
title_full_unstemmed Development and Evaluation of Novel Statistical Methods in Urine Biomarker-Based Hepatocellular Carcinoma Screening
title_short Development and Evaluation of Novel Statistical Methods in Urine Biomarker-Based Hepatocellular Carcinoma Screening
title_sort development and evaluation of novel statistical methods in urine biomarker-based hepatocellular carcinoma screening
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5830457/
https://www.ncbi.nlm.nih.gov/pubmed/29491388
http://dx.doi.org/10.1038/s41598-018-21922-9
work_keys_str_mv AT wangjeremy developmentandevaluationofnovelstatisticalmethodsinurinebiomarkerbasedhepatocellularcarcinomascreening
AT jainsurbhi developmentandevaluationofnovelstatisticalmethodsinurinebiomarkerbasedhepatocellularcarcinomascreening
AT chendion developmentandevaluationofnovelstatisticalmethodsinurinebiomarkerbasedhepatocellularcarcinomascreening
AT songwei developmentandevaluationofnovelstatisticalmethodsinurinebiomarkerbasedhepatocellularcarcinomascreening
AT huchitan developmentandevaluationofnovelstatisticalmethodsinurinebiomarkerbasedhepatocellularcarcinomascreening
AT suyinghsiu developmentandevaluationofnovelstatisticalmethodsinurinebiomarkerbasedhepatocellularcarcinomascreening