Cargando…
A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer
PURPOSE: There is a lack of tools for identifying the site of origin in mucinous cancer. This study aimed to evaluate the performance of a transcriptome-based classifier for identifying the site of origin in mucinous cancer. MATERIALS AND METHODS: Transcriptomic data of 1878 non-mucinous and 82 muci...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9669684/ https://www.ncbi.nlm.nih.gov/pubmed/36408331 http://dx.doi.org/10.1177/11769351221135141 |
_version_ | 1784832173863337984 |
---|---|
author | Ahn, Taejin Kim, Kidong Kim, Hyojin Kim, Sarah Park, Sangick Lee, Kyoungbun |
author_facet | Ahn, Taejin Kim, Kidong Kim, Hyojin Kim, Sarah Park, Sangick Lee, Kyoungbun |
author_sort | Ahn, Taejin |
collection | PubMed |
description | PURPOSE: There is a lack of tools for identifying the site of origin in mucinous cancer. This study aimed to evaluate the performance of a transcriptome-based classifier for identifying the site of origin in mucinous cancer. MATERIALS AND METHODS: Transcriptomic data of 1878 non-mucinous and 82 mucinous cancer specimens, with 7 sites of origin, namely, the uterine cervix (CESC), colon (COAD), pancreas (PAAD), stomach (STAD), uterine endometrium (UCEC), uterine carcinosarcoma (UCS), and ovary (OV), obtained from The Cancer Genome Atlas, were used as the training and validation sets, respectively. Transcriptomic data of 14 mucinous cancer specimens from a tissue archive were used as the test set. For identifying the site of origin, a set of 100 differentially expressed genes for each site of origin was selected. After removing multiple iterations of the same gene, 427 genes were chosen, and their RNA expression profiles, at each site of origin, were used to train the deep neural network classifier. The performance of the classifier was estimated using the training, validation, and test sets. RESULTS: The accuracy of the model in the training set was 0.998, while that in the validation set was 0.939 (77/82). In the test set which is newly sequenced from a tissue archive, the model showed an accuracy of 0.857 (12/14). t-SNE analysis revealed that samples in the test set were part of the clusters obtained for the training set. CONCLUSION: Although limited by small sample size, we showed that a transcriptome-based classifier could correctly identify the site of origin of mucinous cancer. |
format | Online Article Text |
id | pubmed-9669684 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-96696842022-11-18 A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer Ahn, Taejin Kim, Kidong Kim, Hyojin Kim, Sarah Park, Sangick Lee, Kyoungbun Cancer Inform Original Research PURPOSE: There is a lack of tools for identifying the site of origin in mucinous cancer. This study aimed to evaluate the performance of a transcriptome-based classifier for identifying the site of origin in mucinous cancer. MATERIALS AND METHODS: Transcriptomic data of 1878 non-mucinous and 82 mucinous cancer specimens, with 7 sites of origin, namely, the uterine cervix (CESC), colon (COAD), pancreas (PAAD), stomach (STAD), uterine endometrium (UCEC), uterine carcinosarcoma (UCS), and ovary (OV), obtained from The Cancer Genome Atlas, were used as the training and validation sets, respectively. Transcriptomic data of 14 mucinous cancer specimens from a tissue archive were used as the test set. For identifying the site of origin, a set of 100 differentially expressed genes for each site of origin was selected. After removing multiple iterations of the same gene, 427 genes were chosen, and their RNA expression profiles, at each site of origin, were used to train the deep neural network classifier. The performance of the classifier was estimated using the training, validation, and test sets. RESULTS: The accuracy of the model in the training set was 0.998, while that in the validation set was 0.939 (77/82). In the test set which is newly sequenced from a tissue archive, the model showed an accuracy of 0.857 (12/14). t-SNE analysis revealed that samples in the test set were part of the clusters obtained for the training set. CONCLUSION: Although limited by small sample size, we showed that a transcriptome-based classifier could correctly identify the site of origin of mucinous cancer. SAGE Publications 2022-11-15 /pmc/articles/PMC9669684/ /pubmed/36408331 http://dx.doi.org/10.1177/11769351221135141 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Research Ahn, Taejin Kim, Kidong Kim, Hyojin Kim, Sarah Park, Sangick Lee, Kyoungbun A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer |
title | A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer |
title_full | A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer |
title_fullStr | A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer |
title_full_unstemmed | A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer |
title_short | A transcriptome-Based Deep Neural Network Classifier for Identifying the Site of Origin in Mucinous Cancer |
title_sort | transcriptome-based deep neural network classifier for identifying the site of origin in mucinous cancer |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9669684/ https://www.ncbi.nlm.nih.gov/pubmed/36408331 http://dx.doi.org/10.1177/11769351221135141 |
work_keys_str_mv | AT ahntaejin atranscriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer AT kimkidong atranscriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer AT kimhyojin atranscriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer AT kimsarah atranscriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer AT parksangick atranscriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer AT leekyoungbun atranscriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer AT ahntaejin transcriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer AT kimkidong transcriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer AT kimhyojin transcriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer AT kimsarah transcriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer AT parksangick transcriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer AT leekyoungbun transcriptomebaseddeepneuralnetworkclassifierforidentifyingthesiteoforigininmucinouscancer |