Cargando…

Improving differential diagnosis of pulmonary large cell neuroendocrine carcinoma and small cell lung cancer via a transcriptomic, biological pathway-based machine learning model

BACKGROUND: Accurately differentiating between pulmonary large cell neuroendocrine carcinomas (LCNEC) and small cell lung cancer (SCLC) is crucial to make appropriate therapeutic decisions. Here, a classifier was constructed based on transcriptome data to improve the diagnostic accuracy for LCNEC an...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Junhong, Hou, Likun, Zhang, Wei, Dong, Zhengwei, Zhang, Lei, Wu, Chunyan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Neoplasia Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8450252/
https://www.ncbi.nlm.nih.gov/pubmed/34530194
http://dx.doi.org/10.1016/j.tranon.2021.101222
_version_ 1784569598423597056
author Guo, Junhong
Hou, Likun
Zhang, Wei
Dong, Zhengwei
Zhang, Lei
Wu, Chunyan
author_facet Guo, Junhong
Hou, Likun
Zhang, Wei
Dong, Zhengwei
Zhang, Lei
Wu, Chunyan
author_sort Guo, Junhong
collection PubMed
description BACKGROUND: Accurately differentiating between pulmonary large cell neuroendocrine carcinomas (LCNEC) and small cell lung cancer (SCLC) is crucial to make appropriate therapeutic decisions. Here, a classifier was constructed based on transcriptome data to improve the diagnostic accuracy for LCNEC and SCLC. METHODS: 13,959 genes mapped to 186 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were included. Gene Set Variation Analysis (GSVA) algorithm was used to enrich and score each KEGG pathway from RNA-sequencing data of each sample. A prediction model based on GSVA score was constructed and trained via ridge regression based on RNA-sequencing datasets from 3 published studies. It was validated by another independent RNA-sequencing dataset. Clinical feasibility was tested by comparing model predicated result using RNA-sequencing data derived from hard-to-diagnose samples of lung neuroendocrine cancer to conventional histology-based diagnosis. RESULTS: This model achieved a ROC-AUC of 0.949 and a concordance rate of 0.75 for the entire prediction efficiency. Of the 27 borderline samples, 17/27 (63.0%) were predicted as LCNEC, 7/27 were predicted as SCLC, and the remainder was NSCLC. Only 8 cases (29.6%) with LCNEC were diagnosed by pathologists, which was significantly lower than the results predicted by the model. Furthermore, cases with predicted LCNEC by the model had a significant longer disease-free survival than those where the model predicted SCLC (P = 0.0043). CONCLUSION: This model was able to give an accurate prediction of LCNEC and SCLC. It may assist clinicians to make the optimal decision for patients with pulmonary neuroendocrine tumors in choosing appropriate treatment.
format Online
Article
Text
id pubmed-8450252
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Neoplasia Press
record_format MEDLINE/PubMed
spelling pubmed-84502522021-10-01 Improving differential diagnosis of pulmonary large cell neuroendocrine carcinoma and small cell lung cancer via a transcriptomic, biological pathway-based machine learning model Guo, Junhong Hou, Likun Zhang, Wei Dong, Zhengwei Zhang, Lei Wu, Chunyan Transl Oncol Original Research BACKGROUND: Accurately differentiating between pulmonary large cell neuroendocrine carcinomas (LCNEC) and small cell lung cancer (SCLC) is crucial to make appropriate therapeutic decisions. Here, a classifier was constructed based on transcriptome data to improve the diagnostic accuracy for LCNEC and SCLC. METHODS: 13,959 genes mapped to 186 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were included. Gene Set Variation Analysis (GSVA) algorithm was used to enrich and score each KEGG pathway from RNA-sequencing data of each sample. A prediction model based on GSVA score was constructed and trained via ridge regression based on RNA-sequencing datasets from 3 published studies. It was validated by another independent RNA-sequencing dataset. Clinical feasibility was tested by comparing model predicated result using RNA-sequencing data derived from hard-to-diagnose samples of lung neuroendocrine cancer to conventional histology-based diagnosis. RESULTS: This model achieved a ROC-AUC of 0.949 and a concordance rate of 0.75 for the entire prediction efficiency. Of the 27 borderline samples, 17/27 (63.0%) were predicted as LCNEC, 7/27 were predicted as SCLC, and the remainder was NSCLC. Only 8 cases (29.6%) with LCNEC were diagnosed by pathologists, which was significantly lower than the results predicted by the model. Furthermore, cases with predicted LCNEC by the model had a significant longer disease-free survival than those where the model predicted SCLC (P = 0.0043). CONCLUSION: This model was able to give an accurate prediction of LCNEC and SCLC. It may assist clinicians to make the optimal decision for patients with pulmonary neuroendocrine tumors in choosing appropriate treatment. Neoplasia Press 2021-09-14 /pmc/articles/PMC8450252/ /pubmed/34530194 http://dx.doi.org/10.1016/j.tranon.2021.101222 Text en © 2021 The Authors. Published by Elsevier Inc. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Research
Guo, Junhong
Hou, Likun
Zhang, Wei
Dong, Zhengwei
Zhang, Lei
Wu, Chunyan
Improving differential diagnosis of pulmonary large cell neuroendocrine carcinoma and small cell lung cancer via a transcriptomic, biological pathway-based machine learning model
title Improving differential diagnosis of pulmonary large cell neuroendocrine carcinoma and small cell lung cancer via a transcriptomic, biological pathway-based machine learning model
title_full Improving differential diagnosis of pulmonary large cell neuroendocrine carcinoma and small cell lung cancer via a transcriptomic, biological pathway-based machine learning model
title_fullStr Improving differential diagnosis of pulmonary large cell neuroendocrine carcinoma and small cell lung cancer via a transcriptomic, biological pathway-based machine learning model
title_full_unstemmed Improving differential diagnosis of pulmonary large cell neuroendocrine carcinoma and small cell lung cancer via a transcriptomic, biological pathway-based machine learning model
title_short Improving differential diagnosis of pulmonary large cell neuroendocrine carcinoma and small cell lung cancer via a transcriptomic, biological pathway-based machine learning model
title_sort improving differential diagnosis of pulmonary large cell neuroendocrine carcinoma and small cell lung cancer via a transcriptomic, biological pathway-based machine learning model
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8450252/
https://www.ncbi.nlm.nih.gov/pubmed/34530194
http://dx.doi.org/10.1016/j.tranon.2021.101222
work_keys_str_mv AT guojunhong improvingdifferentialdiagnosisofpulmonarylargecellneuroendocrinecarcinomaandsmallcelllungcancerviaatranscriptomicbiologicalpathwaybasedmachinelearningmodel
AT houlikun improvingdifferentialdiagnosisofpulmonarylargecellneuroendocrinecarcinomaandsmallcelllungcancerviaatranscriptomicbiologicalpathwaybasedmachinelearningmodel
AT zhangwei improvingdifferentialdiagnosisofpulmonarylargecellneuroendocrinecarcinomaandsmallcelllungcancerviaatranscriptomicbiologicalpathwaybasedmachinelearningmodel
AT dongzhengwei improvingdifferentialdiagnosisofpulmonarylargecellneuroendocrinecarcinomaandsmallcelllungcancerviaatranscriptomicbiologicalpathwaybasedmachinelearningmodel
AT zhanglei improvingdifferentialdiagnosisofpulmonarylargecellneuroendocrinecarcinomaandsmallcelllungcancerviaatranscriptomicbiologicalpathwaybasedmachinelearningmodel
AT wuchunyan improvingdifferentialdiagnosisofpulmonarylargecellneuroendocrinecarcinomaandsmallcelllungcancerviaatranscriptomicbiologicalpathwaybasedmachinelearningmodel