Cargando…

PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning

Metastatic cancer accounts for over 90% of all cancer deaths, and evaluations of metastasis potential are vital for minimizing the metastasis-associated mortality and achieving optimal clinical decision-making. Computational assessment of metastasis potential based on large-scale transcriptomic canc...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Junyi, Lu, Xiaoyu, Chang, Wennan, Wan, Changlin, Lu, Xiongbin, Zhang, Chi, Cao, Sha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8992993/
https://www.ncbi.nlm.nih.gov/pubmed/35349572
http://dx.doi.org/10.1371/journal.pcbi.1009956
_version_ 1784683820152258560
author Zhou, Junyi
Lu, Xiaoyu
Chang, Wennan
Wan, Changlin
Lu, Xiongbin
Zhang, Chi
Cao, Sha
author_facet Zhou, Junyi
Lu, Xiaoyu
Chang, Wennan
Wan, Changlin
Lu, Xiongbin
Zhang, Chi
Cao, Sha
author_sort Zhou, Junyi
collection PubMed
description Metastatic cancer accounts for over 90% of all cancer deaths, and evaluations of metastasis potential are vital for minimizing the metastasis-associated mortality and achieving optimal clinical decision-making. Computational assessment of metastasis potential based on large-scale transcriptomic cancer data is challenging because metastasis events are not always clinically detectable. The under-diagnosis of metastasis events results in biased classification labels, and classification tools using biased labels may lead to inaccurate estimations of metastasis potential. This issue is further complicated by the unknown metastasis prevalence at the population level, the small number of confirmed metastasis cases, and the high dimensionality of the candidate molecular features. Our proposed algorithm, called Positive and unlabeled Learning from Unbalanced cases and Sparse structures (PLUS), is the first to use a positive and unlabeled learning framework to account for the under-detection of metastasis events in building a classifier. PLUS is specifically tailored for studying metastasis that deals with the unbalanced instance allocation as well as unknown metastasis prevalence, which are not considered by other methods. PLUS achieves superior performance on synthetic datasets compared with other state-of-the-art methods. Application of PLUS to The Cancer Genome Atlas Pan-Cancer gene expression data generated metastasis potential predictions that show good agreement with the clinical follow-up data, in addition to predictive genes that have been validated by independent single-cell RNA-sequencing datasets.
format Online
Article
Text
id pubmed-8992993
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-89929932022-04-09 PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning Zhou, Junyi Lu, Xiaoyu Chang, Wennan Wan, Changlin Lu, Xiongbin Zhang, Chi Cao, Sha PLoS Comput Biol Research Article Metastatic cancer accounts for over 90% of all cancer deaths, and evaluations of metastasis potential are vital for minimizing the metastasis-associated mortality and achieving optimal clinical decision-making. Computational assessment of metastasis potential based on large-scale transcriptomic cancer data is challenging because metastasis events are not always clinically detectable. The under-diagnosis of metastasis events results in biased classification labels, and classification tools using biased labels may lead to inaccurate estimations of metastasis potential. This issue is further complicated by the unknown metastasis prevalence at the population level, the small number of confirmed metastasis cases, and the high dimensionality of the candidate molecular features. Our proposed algorithm, called Positive and unlabeled Learning from Unbalanced cases and Sparse structures (PLUS), is the first to use a positive and unlabeled learning framework to account for the under-detection of metastasis events in building a classifier. PLUS is specifically tailored for studying metastasis that deals with the unbalanced instance allocation as well as unknown metastasis prevalence, which are not considered by other methods. PLUS achieves superior performance on synthetic datasets compared with other state-of-the-art methods. Application of PLUS to The Cancer Genome Atlas Pan-Cancer gene expression data generated metastasis potential predictions that show good agreement with the clinical follow-up data, in addition to predictive genes that have been validated by independent single-cell RNA-sequencing datasets. Public Library of Science 2022-03-29 /pmc/articles/PMC8992993/ /pubmed/35349572 http://dx.doi.org/10.1371/journal.pcbi.1009956 Text en © 2022 Zhou et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhou, Junyi
Lu, Xiaoyu
Chang, Wennan
Wan, Changlin
Lu, Xiongbin
Zhang, Chi
Cao, Sha
PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning
title PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning
title_full PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning
title_fullStr PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning
title_full_unstemmed PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning
title_short PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning
title_sort plus: predicting cancer metastasis potential based on positive and unlabeled learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8992993/
https://www.ncbi.nlm.nih.gov/pubmed/35349572
http://dx.doi.org/10.1371/journal.pcbi.1009956
work_keys_str_mv AT zhoujunyi pluspredictingcancermetastasispotentialbasedonpositiveandunlabeledlearning
AT luxiaoyu pluspredictingcancermetastasispotentialbasedonpositiveandunlabeledlearning
AT changwennan pluspredictingcancermetastasispotentialbasedonpositiveandunlabeledlearning
AT wanchanglin pluspredictingcancermetastasispotentialbasedonpositiveandunlabeledlearning
AT luxiongbin pluspredictingcancermetastasispotentialbasedonpositiveandunlabeledlearning
AT zhangchi pluspredictingcancermetastasispotentialbasedonpositiveandunlabeledlearning
AT caosha pluspredictingcancermetastasispotentialbasedonpositiveandunlabeledlearning