Cargando…
Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder
MOTIVATION: Cancer is one of the world’s leading mortality causes, and its prognosis is hard to predict due to complicated biological interactions among heterogeneous data types. Numerous challenges, such as censorship, high dimensionality and small sample size, prevent researchers from using deep l...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9832968/ https://www.ncbi.nlm.nih.gov/pubmed/36698767 http://dx.doi.org/10.1093/bioadv/vbac100 |
_version_ | 1784868167013629952 |
---|---|
author | Hsu, Te-Cheng Lin, Che |
author_facet | Hsu, Te-Cheng Lin, Che |
author_sort | Hsu, Te-Cheng |
collection | PubMed |
description | MOTIVATION: Cancer is one of the world’s leading mortality causes, and its prognosis is hard to predict due to complicated biological interactions among heterogeneous data types. Numerous challenges, such as censorship, high dimensionality and small sample size, prevent researchers from using deep learning models for precise prediction. RESULTS: We propose a robust Semi-supervised Cancer prognosis classifier with bAyesian variational autoeNcoder (SCAN) as a structured machine-learning framework for cancer prognosis prediction. SCAN incorporates semi-supervised learning for predicting 5-year disease-specific survival and overall survival in breast and non-small cell lung cancer (NSCLC) patients, respectively. SCAN achieved significantly better AUROC scores than all existing benchmarks (81.73% for breast cancer; 80.46% for NSCLC), including our previously proposed bimodal neural network classifiers (77.71% for breast cancer; 78.67% for NSCLC). Independent validation results showed that SCAN still achieved better AUROC scores (74.74% for breast; 72.80% for NSCLC) than the bimodal neural network classifiers (64.13% for breast; 67.07% for NSCLC). SCAN is general and can potentially be trained on more patient data. This paves the foundation for personalized medicine for early cancer risk screening. AVAILABILITY AND IMPLEMENTATION: The source codes reproducing the main results are available on GitHub: https://gitfront.io/r/user-4316673/36e8714573f3fbfa0b24690af5d1a9d5ca159cf4/scan/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. |
format | Online Article Text |
id | pubmed-9832968 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98329682023-01-24 Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder Hsu, Te-Cheng Lin, Che Bioinform Adv Original Paper MOTIVATION: Cancer is one of the world’s leading mortality causes, and its prognosis is hard to predict due to complicated biological interactions among heterogeneous data types. Numerous challenges, such as censorship, high dimensionality and small sample size, prevent researchers from using deep learning models for precise prediction. RESULTS: We propose a robust Semi-supervised Cancer prognosis classifier with bAyesian variational autoeNcoder (SCAN) as a structured machine-learning framework for cancer prognosis prediction. SCAN incorporates semi-supervised learning for predicting 5-year disease-specific survival and overall survival in breast and non-small cell lung cancer (NSCLC) patients, respectively. SCAN achieved significantly better AUROC scores than all existing benchmarks (81.73% for breast cancer; 80.46% for NSCLC), including our previously proposed bimodal neural network classifiers (77.71% for breast cancer; 78.67% for NSCLC). Independent validation results showed that SCAN still achieved better AUROC scores (74.74% for breast; 72.80% for NSCLC) than the bimodal neural network classifiers (64.13% for breast; 67.07% for NSCLC). SCAN is general and can potentially be trained on more patient data. This paves the foundation for personalized medicine for early cancer risk screening. AVAILABILITY AND IMPLEMENTATION: The source codes reproducing the main results are available on GitHub: https://gitfront.io/r/user-4316673/36e8714573f3fbfa0b24690af5d1a9d5ca159cf4/scan/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Oxford University Press 2023-01-09 /pmc/articles/PMC9832968/ /pubmed/36698767 http://dx.doi.org/10.1093/bioadv/vbac100 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Hsu, Te-Cheng Lin, Che Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder |
title | Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder |
title_full | Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder |
title_fullStr | Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder |
title_full_unstemmed | Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder |
title_short | Learning from small medical data—robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder |
title_sort | learning from small medical data—robust semi-supervised cancer prognosis classifier with bayesian variational autoencoder |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9832968/ https://www.ncbi.nlm.nih.gov/pubmed/36698767 http://dx.doi.org/10.1093/bioadv/vbac100 |
work_keys_str_mv | AT hsutecheng learningfromsmallmedicaldatarobustsemisupervisedcancerprognosisclassifierwithbayesianvariationalautoencoder AT linche learningfromsmallmedicaldatarobustsemisupervisedcancerprognosisclassifierwithbayesianvariationalautoencoder |