Cargando…

Authentication of differential gene expression in oral squamous cell carcinoma using machine learning applications

BACKGROUND: Recently, the possibility of tumour classification based on genetic data has been investigated. However, genetic datasets are difficult to handle because of their massive size and complexity of manipulation. In the present study, we examined the diagnostic performance of machine learning...

Descripción completa

Detalles Bibliográficos
Autores principales: Pratama, Rian, Hwang, Jae Joon, Lee, Ji Hye, Song, Giltae, Park, Hae Ryoun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8164276/
https://www.ncbi.nlm.nih.gov/pubmed/34051764
http://dx.doi.org/10.1186/s12903-021-01642-9
_version_ 1783701081521389568
author Pratama, Rian
Hwang, Jae Joon
Lee, Ji Hye
Song, Giltae
Park, Hae Ryoun
author_facet Pratama, Rian
Hwang, Jae Joon
Lee, Ji Hye
Song, Giltae
Park, Hae Ryoun
author_sort Pratama, Rian
collection PubMed
description BACKGROUND: Recently, the possibility of tumour classification based on genetic data has been investigated. However, genetic datasets are difficult to handle because of their massive size and complexity of manipulation. In the present study, we examined the diagnostic performance of machine learning applications using imaging-based classifications of oral squamous cell carcinoma (OSCC) gene sets. METHODS: RNA sequencing data from SCC tissues from various sites, including oral, non-oral head and neck, oesophageal, and cervical regions, were downloaded from The Cancer Genome Atlas (TCGA). The feature genes were extracted through a convolutional neural network (CNN) and machine learning, and the performance of each analysis was compared. RESULTS: The ability of the machine learning analysis to classify OSCC tumours was excellent. However, the tool exhibited poorer performance in discriminating histopathologically dissimilar cancers derived from the same type of tissue than in differentiating cancers of the same histopathologic type with different tissue origins, revealing that the differential gene expression pattern is a more important factor than the histopathologic features for differentiating cancer types. CONCLUSION: The CNN-based diagnostic model and the visualisation methods using RNA sequencing data were useful for correctly categorising OSCC. The analysis showed differentially expressed genes in multiwise comparisons of various types of SCCs, such as KCNA10, FOSL2, and PRDM16, and extracted leader genes from pairwise comparisons were FGF20, DLC1, and ZNF705D. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12903-021-01642-9.
format Online
Article
Text
id pubmed-8164276
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-81642762021-06-01 Authentication of differential gene expression in oral squamous cell carcinoma using machine learning applications Pratama, Rian Hwang, Jae Joon Lee, Ji Hye Song, Giltae Park, Hae Ryoun BMC Oral Health Research BACKGROUND: Recently, the possibility of tumour classification based on genetic data has been investigated. However, genetic datasets are difficult to handle because of their massive size and complexity of manipulation. In the present study, we examined the diagnostic performance of machine learning applications using imaging-based classifications of oral squamous cell carcinoma (OSCC) gene sets. METHODS: RNA sequencing data from SCC tissues from various sites, including oral, non-oral head and neck, oesophageal, and cervical regions, were downloaded from The Cancer Genome Atlas (TCGA). The feature genes were extracted through a convolutional neural network (CNN) and machine learning, and the performance of each analysis was compared. RESULTS: The ability of the machine learning analysis to classify OSCC tumours was excellent. However, the tool exhibited poorer performance in discriminating histopathologically dissimilar cancers derived from the same type of tissue than in differentiating cancers of the same histopathologic type with different tissue origins, revealing that the differential gene expression pattern is a more important factor than the histopathologic features for differentiating cancer types. CONCLUSION: The CNN-based diagnostic model and the visualisation methods using RNA sequencing data were useful for correctly categorising OSCC. The analysis showed differentially expressed genes in multiwise comparisons of various types of SCCs, such as KCNA10, FOSL2, and PRDM16, and extracted leader genes from pairwise comparisons were FGF20, DLC1, and ZNF705D. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12903-021-01642-9. BioMed Central 2021-05-29 /pmc/articles/PMC8164276/ /pubmed/34051764 http://dx.doi.org/10.1186/s12903-021-01642-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Pratama, Rian
Hwang, Jae Joon
Lee, Ji Hye
Song, Giltae
Park, Hae Ryoun
Authentication of differential gene expression in oral squamous cell carcinoma using machine learning applications
title Authentication of differential gene expression in oral squamous cell carcinoma using machine learning applications
title_full Authentication of differential gene expression in oral squamous cell carcinoma using machine learning applications
title_fullStr Authentication of differential gene expression in oral squamous cell carcinoma using machine learning applications
title_full_unstemmed Authentication of differential gene expression in oral squamous cell carcinoma using machine learning applications
title_short Authentication of differential gene expression in oral squamous cell carcinoma using machine learning applications
title_sort authentication of differential gene expression in oral squamous cell carcinoma using machine learning applications
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8164276/
https://www.ncbi.nlm.nih.gov/pubmed/34051764
http://dx.doi.org/10.1186/s12903-021-01642-9
work_keys_str_mv AT pratamarian authenticationofdifferentialgeneexpressioninoralsquamouscellcarcinomausingmachinelearningapplications
AT hwangjaejoon authenticationofdifferentialgeneexpressioninoralsquamouscellcarcinomausingmachinelearningapplications
AT leejihye authenticationofdifferentialgeneexpressioninoralsquamouscellcarcinomausingmachinelearningapplications
AT songgiltae authenticationofdifferentialgeneexpressioninoralsquamouscellcarcinomausingmachinelearningapplications
AT parkhaeryoun authenticationofdifferentialgeneexpressioninoralsquamouscellcarcinomausingmachinelearningapplications