Cargando…

A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer

PURPOSE: There is a lack of tools for identifying the site of origin in mucinous cancer. This study aimed to evaluate the performance of a transcriptome-based classifier for identifying the site of origin in mucinous cancer. MATERIALS AND METHODS: Transcriptomic data of 1878 non-mucinous and 82 muci...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahn, Taejin, Kim, Kidong, Kim, Hyojin, Kim, Sarah, Park, Sangick, Lee, Kyoungbun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9669684/
https://www.ncbi.nlm.nih.gov/pubmed/36408331
http://dx.doi.org/10.1177/11769351221135141
_version_ 1784832173863337984
author Ahn, Taejin
Kim, Kidong
Kim, Hyojin
Kim, Sarah
Park, Sangick
Lee, Kyoungbun
author_facet Ahn, Taejin
Kim, Kidong
Kim, Hyojin
Kim, Sarah
Park, Sangick
Lee, Kyoungbun
author_sort Ahn, Taejin
collection PubMed
description PURPOSE: There is a lack of tools for identifying the site of origin in mucinous cancer. This study aimed to evaluate the performance of a transcriptome-based classifier for identifying the site of origin in mucinous cancer. MATERIALS AND METHODS: Transcriptomic data of 1878 non-mucinous and 82 mucinous cancer specimens, with 7 sites of origin, namely, the uterine cervix (CESC), colon (COAD), pancreas (PAAD), stomach (STAD), uterine endometrium (UCEC), uterine carcinosarcoma (UCS), and ovary (OV), obtained from The Cancer Genome Atlas, were used as the training and validation sets, respectively. Transcriptomic data of 14 mucinous cancer specimens from a tissue archive were used as the test set. For identifying the site of origin, a set of 100 differentially expressed genes for each site of origin was selected. After removing multiple iterations of the same gene, 427 genes were chosen, and their RNA expression profiles, at each site of origin, were used to train the deep neural network classifier. The performance of the classifier was estimated using the training, validation, and test sets. RESULTS: The accuracy of the model in the training set was 0.998, while that in the validation set was 0.939 (77/82). In the test set which is newly sequenced from a tissue archive, the model showed an accuracy of 0.857 (12/14). t-SNE analysis revealed that samples in the test set were part of the clusters obtained for the training set. CONCLUSION: Although limited by small sample size, we showed that a transcriptome-based classifier could correctly identify the site of origin of mucinous cancer.
format Online
Article
Text
id pubmed-9669684
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-96696842022-11-18 A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer Ahn, Taejin Kim, Kidong Kim, Hyojin Kim, Sarah Park, Sangick Lee, Kyoungbun Cancer Inform Original Research PURPOSE: There is a lack of tools for identifying the site of origin in mucinous cancer. This study aimed to evaluate the performance of a transcriptome-based classifier for identifying the site of origin in mucinous cancer. MATERIALS AND METHODS: Transcriptomic data of 1878 non-mucinous and 82 mucinous cancer specimens, with 7 sites of origin, namely, the uterine cervix (CESC), colon (COAD), pancreas (PAAD), stomach (STAD), uterine endometrium (UCEC), uterine carcinosarcoma (UCS), and ovary (OV), obtained from The Cancer Genome Atlas, were used as the training and validation sets, respectively. Transcriptomic data of 14 mucinous cancer specimens from a tissue archive were used as the test set. For identifying the site of origin, a set of 100 differentially expressed genes for each site of origin was selected. After removing multiple iterations of the same gene, 427 genes were chosen, and their RNA expression profiles, at each site of origin, were used to train the deep neural network classifier. The performance of the classifier was estimated using the training, validation, and test sets. RESULTS: The accuracy of the model in the training set was 0.998, while that in the validation set was 0.939 (77/82). In the test set which is newly sequenced from a tissue archive, the model showed an accuracy of 0.857 (12/14). t-SNE analysis revealed that samples in the test set were part of the clusters obtained for the training set. CONCLUSION: Although limited by small sample size, we showed that a transcriptome-based classifier could correctly identify the site of origin of mucinous cancer. SAGE Publications 2022-11-15 /pmc/articles/PMC9669684/ /pubmed/36408331 http://dx.doi.org/10.1177/11769351221135141 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Ahn, Taejin
Kim, Kidong
Kim, Hyojin
Kim, Sarah
Park, Sangick
Lee, Kyoungbun
A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer
title A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer
title_full A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer
title_fullStr A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer
title_full_unstemmed A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer
title_short A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer
title_sort transcriptome-based deep neural network classifier for identifying the site of origin in mucinous cancer
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9669684/
https://www.ncbi.nlm.nih.gov/pubmed/36408331
http://dx.doi.org/10.1177/11769351221135141
work_keys_str_mv AT ahntaejin atranscriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer
AT kimkidong atranscriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer
AT kimhyojin atranscriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer
AT kimsarah atranscriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer
AT parksangick atranscriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer
AT leekyoungbun atranscriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer
AT ahntaejin transcriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer
AT kimkidong transcriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer
AT kimhyojin transcriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer
AT kimsarah transcriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer
AT parksangick transcriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer
AT leekyoungbun transcriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer