Cargando…

Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder

MOTIVATION: Cancer is one of the world’s leading mortality causes, and its prognosis is hard to predict due to complicated biological interactions among heterogeneous data types. Numerous challenges, such as censorship, high dimensionality and small sample size, prevent researchers from using deep l...

Descripción completa

Detalles Bibliográficos
Autores principales: Hsu, Te-Cheng, Lin, Che
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9832968/
https://www.ncbi.nlm.nih.gov/pubmed/36698767
http://dx.doi.org/10.1093/bioadv/vbac100
_version_ 1784868167013629952
author Hsu, Te-Cheng
Lin, Che
author_facet Hsu, Te-Cheng
Lin, Che
author_sort Hsu, Te-Cheng
collection PubMed
description MOTIVATION: Cancer is one of the world’s leading mortality causes, and its prognosis is hard to predict due to complicated biological interactions among heterogeneous data types. Numerous challenges, such as censorship, high dimensionality and small sample size, prevent researchers from using deep learning models for precise prediction. RESULTS: We propose a robust Semi-supervised Cancer prognosis classifier with bAyesian variational autoeNcoder (SCAN) as a structured machine-learning framework for cancer prognosis prediction. SCAN incorporates semi-supervised learning for predicting 5-year disease-specific survival and overall survival in breast and non-small cell lung cancer (NSCLC) patients, respectively. SCAN achieved significantly better AUROC scores than all existing benchmarks (81.73% for breast cancer; 80.46% for NSCLC), including our previously proposed bimodal neural network classifiers (77.71% for breast cancer; 78.67% for NSCLC). Independent validation results showed that SCAN still achieved better AUROC scores (74.74% for breast; 72.80% for NSCLC) than the bimodal neural network classifiers (64.13% for breast; 67.07% for NSCLC). SCAN is general and can potentially be trained on more patient data. This paves the foundation for personalized medicine for early cancer risk screening. AVAILABILITY AND IMPLEMENTATION: The source codes reproducing the main results are available on GitHub: https://gitfront.io/r/user-4316673/36e8714573f3fbfa0b24690af5d1a9d5ca159cf4/scan/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online.
format Online
Article
Text
id pubmed-9832968
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98329682023-01-24 Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder Hsu, Te-Cheng Lin, Che Bioinform Adv Original Paper MOTIVATION: Cancer is one of the world’s leading mortality causes, and its prognosis is hard to predict due to complicated biological interactions among heterogeneous data types. Numerous challenges, such as censorship, high dimensionality and small sample size, prevent researchers from using deep learning models for precise prediction. RESULTS: We propose a robust Semi-supervised Cancer prognosis classifier with bAyesian variational autoeNcoder (SCAN) as a structured machine-learning framework for cancer prognosis prediction. SCAN incorporates semi-supervised learning for predicting 5-year disease-specific survival and overall survival in breast and non-small cell lung cancer (NSCLC) patients, respectively. SCAN achieved significantly better AUROC scores than all existing benchmarks (81.73% for breast cancer; 80.46% for NSCLC), including our previously proposed bimodal neural network classifiers (77.71% for breast cancer; 78.67% for NSCLC). Independent validation results showed that SCAN still achieved better AUROC scores (74.74% for breast; 72.80% for NSCLC) than the bimodal neural network classifiers (64.13% for breast; 67.07% for NSCLC). SCAN is general and can potentially be trained on more patient data. This paves the foundation for personalized medicine for early cancer risk screening. AVAILABILITY AND IMPLEMENTATION: The source codes reproducing the main results are available on GitHub: https://gitfront.io/r/user-4316673/36e8714573f3fbfa0b24690af5d1a9d5ca159cf4/scan/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Oxford University Press 2023-01-09 /pmc/articles/PMC9832968/ /pubmed/36698767 http://dx.doi.org/10.1093/bioadv/vbac100 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Hsu, Te-Cheng
Lin, Che
Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder
title Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder
title_full Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder
title_fullStr Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder
title_full_unstemmed Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder
title_short Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder
title_sort learning from small medical data—robust semi-supervised cancer prognosis classifier with bayesian variational autoencoder
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9832968/
https://www.ncbi.nlm.nih.gov/pubmed/36698767
http://dx.doi.org/10.1093/bioadv/vbac100
work_keys_str_mv AT hsutecheng learningfromsmallmedicaldatarobustsemisupervisedcancerprognosisclassifierwithbayesianvariationalautoencoder
AT linche learningfromsmallmedicaldatarobustsemisupervisedcancerprognosisclassifierwithbayesianvariationalautoencoder