Cargando…
PreCanCell: An ensemble learning algorithm for predicting cancer and non-cancer cells from single-cell transcriptomes
We propose PreCanCell, a novel algorithm for predicting malignant and non-malignant cells from single-cell transcriptomes. PreCanCell first identifies the differentially expressed genes (DEGs) between malignant and non-malignant cells commonly in five common cancer types-associated single-cell trans...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10371765/ https://www.ncbi.nlm.nih.gov/pubmed/37501705 http://dx.doi.org/10.1016/j.csbj.2023.07.009 |
_version_ | 1785078221671235584 |
---|---|
author | Yang, Tao Yan, Qiyu Long, Rongzhuo Liu, Zhixian Wang, Xiaosheng |
author_facet | Yang, Tao Yan, Qiyu Long, Rongzhuo Liu, Zhixian Wang, Xiaosheng |
author_sort | Yang, Tao |
collection | PubMed |
description | We propose PreCanCell, a novel algorithm for predicting malignant and non-malignant cells from single-cell transcriptomes. PreCanCell first identifies the differentially expressed genes (DEGs) between malignant and non-malignant cells commonly in five common cancer types-associated single-cell transcriptome datasets. The five common cancer types include renal cell carcinoma (RCC), head and neck squamous cell carcinoma (HNSCC), melanoma, lung adenocarcinoma (LUAD), and breast cancer (BC). With each of the five datasets as the training set and the DEGs as the features, a single cell is classified as malignant or non-malignant by k-NN (k = 5). Finally, the single cell is determined as malignant or non-malignant by the majority vote of the five k-NN classification results. We tested the predictive performance of PreCanCell in 19 single-cell datasets, and reported classification accuracy, sensitivity, specificity, balanced accuracy (the average of sensitivity and specificity) and the area under the receiver operating characteristic curve (AUROC). In all these datasets, PreCanCell achieved above 0.8 accuracy, sensitivity, specificity, balanced accuracy and AUROC. Finally, we compared the predictive performance of PreCanCell with that of seven other algorithms, including CHETAH, SciBet, SCINA, scmap-cell, scmap-cluster, SingleR, and ikarus. Compared to these algorithms, PreCanCell displays the advantages of higher accuracy and simpler implementation. We have developed an R package for the PreCanCell algorithm, which is available at https://github.com/WangX-Lab/PreCanCell. |
format | Online Article Text |
id | pubmed-10371765 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-103717652023-07-27 PreCanCell: An ensemble learning algorithm for predicting cancer and non-cancer cells from single-cell transcriptomes Yang, Tao Yan, Qiyu Long, Rongzhuo Liu, Zhixian Wang, Xiaosheng Comput Struct Biotechnol J Research Article We propose PreCanCell, a novel algorithm for predicting malignant and non-malignant cells from single-cell transcriptomes. PreCanCell first identifies the differentially expressed genes (DEGs) between malignant and non-malignant cells commonly in five common cancer types-associated single-cell transcriptome datasets. The five common cancer types include renal cell carcinoma (RCC), head and neck squamous cell carcinoma (HNSCC), melanoma, lung adenocarcinoma (LUAD), and breast cancer (BC). With each of the five datasets as the training set and the DEGs as the features, a single cell is classified as malignant or non-malignant by k-NN (k = 5). Finally, the single cell is determined as malignant or non-malignant by the majority vote of the five k-NN classification results. We tested the predictive performance of PreCanCell in 19 single-cell datasets, and reported classification accuracy, sensitivity, specificity, balanced accuracy (the average of sensitivity and specificity) and the area under the receiver operating characteristic curve (AUROC). In all these datasets, PreCanCell achieved above 0.8 accuracy, sensitivity, specificity, balanced accuracy and AUROC. Finally, we compared the predictive performance of PreCanCell with that of seven other algorithms, including CHETAH, SciBet, SCINA, scmap-cell, scmap-cluster, SingleR, and ikarus. Compared to these algorithms, PreCanCell displays the advantages of higher accuracy and simpler implementation. We have developed an R package for the PreCanCell algorithm, which is available at https://github.com/WangX-Lab/PreCanCell. Research Network of Computational and Structural Biotechnology 2023-07-11 /pmc/articles/PMC10371765/ /pubmed/37501705 http://dx.doi.org/10.1016/j.csbj.2023.07.009 Text en © 2023 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Article Yang, Tao Yan, Qiyu Long, Rongzhuo Liu, Zhixian Wang, Xiaosheng PreCanCell: An ensemble learning algorithm for predicting cancer and non-cancer cells from single-cell transcriptomes |
title | PreCanCell: An ensemble learning algorithm for predicting cancer and non-cancer cells from single-cell transcriptomes |
title_full | PreCanCell: An ensemble learning algorithm for predicting cancer and non-cancer cells from single-cell transcriptomes |
title_fullStr | PreCanCell: An ensemble learning algorithm for predicting cancer and non-cancer cells from single-cell transcriptomes |
title_full_unstemmed | PreCanCell: An ensemble learning algorithm for predicting cancer and non-cancer cells from single-cell transcriptomes |
title_short | PreCanCell: An ensemble learning algorithm for predicting cancer and non-cancer cells from single-cell transcriptomes |
title_sort | precancell: an ensemble learning algorithm for predicting cancer and non-cancer cells from single-cell transcriptomes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10371765/ https://www.ncbi.nlm.nih.gov/pubmed/37501705 http://dx.doi.org/10.1016/j.csbj.2023.07.009 |
work_keys_str_mv | AT yangtao precancellanensemblelearningalgorithmforpredictingcancerandnoncancercellsfromsinglecelltranscriptomes AT yanqiyu precancellanensemblelearningalgorithmforpredictingcancerandnoncancercellsfromsinglecelltranscriptomes AT longrongzhuo precancellanensemblelearningalgorithmforpredictingcancerandnoncancercellsfromsinglecelltranscriptomes AT liuzhixian precancellanensemblelearningalgorithmforpredictingcancerandnoncancercellsfromsinglecelltranscriptomes AT wangxiaosheng precancellanensemblelearningalgorithmforpredictingcancerandnoncancercellsfromsinglecelltranscriptomes |