Cargando…
PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning
Metastatic cancer accounts for over 90% of all cancer deaths, and evaluations of metastasis potential are vital for minimizing the metastasis-associated mortality and achieving optimal clinical decision-making. Computational assessment of metastasis potential based on large-scale transcriptomic canc...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8992993/ https://www.ncbi.nlm.nih.gov/pubmed/35349572 http://dx.doi.org/10.1371/journal.pcbi.1009956 |
_version_ | 1784683820152258560 |
---|---|
author | Zhou, Junyi Lu, Xiaoyu Chang, Wennan Wan, Changlin Lu, Xiongbin Zhang, Chi Cao, Sha |
author_facet | Zhou, Junyi Lu, Xiaoyu Chang, Wennan Wan, Changlin Lu, Xiongbin Zhang, Chi Cao, Sha |
author_sort | Zhou, Junyi |
collection | PubMed |
description | Metastatic cancer accounts for over 90% of all cancer deaths, and evaluations of metastasis potential are vital for minimizing the metastasis-associated mortality and achieving optimal clinical decision-making. Computational assessment of metastasis potential based on large-scale transcriptomic cancer data is challenging because metastasis events are not always clinically detectable. The under-diagnosis of metastasis events results in biased classification labels, and classification tools using biased labels may lead to inaccurate estimations of metastasis potential. This issue is further complicated by the unknown metastasis prevalence at the population level, the small number of confirmed metastasis cases, and the high dimensionality of the candidate molecular features. Our proposed algorithm, called Positive and unlabeled Learning from Unbalanced cases and Sparse structures (PLUS), is the first to use a positive and unlabeled learning framework to account for the under-detection of metastasis events in building a classifier. PLUS is specifically tailored for studying metastasis that deals with the unbalanced instance allocation as well as unknown metastasis prevalence, which are not considered by other methods. PLUS achieves superior performance on synthetic datasets compared with other state-of-the-art methods. Application of PLUS to The Cancer Genome Atlas Pan-Cancer gene expression data generated metastasis potential predictions that show good agreement with the clinical follow-up data, in addition to predictive genes that have been validated by independent single-cell RNA-sequencing datasets. |
format | Online Article Text |
id | pubmed-8992993 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-89929932022-04-09 PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning Zhou, Junyi Lu, Xiaoyu Chang, Wennan Wan, Changlin Lu, Xiongbin Zhang, Chi Cao, Sha PLoS Comput Biol Research Article Metastatic cancer accounts for over 90% of all cancer deaths, and evaluations of metastasis potential are vital for minimizing the metastasis-associated mortality and achieving optimal clinical decision-making. Computational assessment of metastasis potential based on large-scale transcriptomic cancer data is challenging because metastasis events are not always clinically detectable. The under-diagnosis of metastasis events results in biased classification labels, and classification tools using biased labels may lead to inaccurate estimations of metastasis potential. This issue is further complicated by the unknown metastasis prevalence at the population level, the small number of confirmed metastasis cases, and the high dimensionality of the candidate molecular features. Our proposed algorithm, called Positive and unlabeled Learning from Unbalanced cases and Sparse structures (PLUS), is the first to use a positive and unlabeled learning framework to account for the under-detection of metastasis events in building a classifier. PLUS is specifically tailored for studying metastasis that deals with the unbalanced instance allocation as well as unknown metastasis prevalence, which are not considered by other methods. PLUS achieves superior performance on synthetic datasets compared with other state-of-the-art methods. Application of PLUS to The Cancer Genome Atlas Pan-Cancer gene expression data generated metastasis potential predictions that show good agreement with the clinical follow-up data, in addition to predictive genes that have been validated by independent single-cell RNA-sequencing datasets. Public Library of Science 2022-03-29 /pmc/articles/PMC8992993/ /pubmed/35349572 http://dx.doi.org/10.1371/journal.pcbi.1009956 Text en © 2022 Zhou et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Zhou, Junyi Lu, Xiaoyu Chang, Wennan Wan, Changlin Lu, Xiongbin Zhang, Chi Cao, Sha PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning |
title | PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning |
title_full | PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning |
title_fullStr | PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning |
title_full_unstemmed | PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning |
title_short | PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning |
title_sort | plus: predicting cancer metastasis potential based on positive and unlabeled learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8992993/ https://www.ncbi.nlm.nih.gov/pubmed/35349572 http://dx.doi.org/10.1371/journal.pcbi.1009956 |
work_keys_str_mv | AT zhoujunyi pluspredictingcancermetastasispotentialbasedonpositiveandunlabeledlearning AT luxiaoyu pluspredictingcancermetastasispotentialbasedonpositiveandunlabeledlearning AT changwennan pluspredictingcancermetastasispotentialbasedonpositiveandunlabeledlearning AT wanchanglin pluspredictingcancermetastasispotentialbasedonpositiveandunlabeledlearning AT luxiongbin pluspredictingcancermetastasispotentialbasedonpositiveandunlabeledlearning AT zhangchi pluspredictingcancermetastasispotentialbasedonpositiveandunlabeledlearning AT caosha pluspredictingcancermetastasispotentialbasedonpositiveandunlabeledlearning |