Cargando…

Machine Learning-Based Genome-Wide Salivary DNA Methylation Analysis for Identification of Noninvasive Biomarkers in Oral Cancer Diagnosis

SIMPLE SUMMARY: Because tissue biopsy is the gold standard for diagnosing oral cancer, it is often performed to confirm disease during screening, management, and monitoring. However, many reports are negative. Salivary biomarkers can provide the preliminary stratification of suspicious lesions to en...

Descripción completa

Detalles Bibliográficos
Autores principales: Adeoye, John, Wan, Chi Ching Joan, Zheng, Li-Wu, Thomson, Peter, Choi, Siu-Wai, Su, Yu-Xiong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9563273/
https://www.ncbi.nlm.nih.gov/pubmed/36230858
http://dx.doi.org/10.3390/cancers14194935
_version_ 1784808363311235072
author Adeoye, John
Wan, Chi Ching Joan
Zheng, Li-Wu
Thomson, Peter
Choi, Siu-Wai
Su, Yu-Xiong
author_facet Adeoye, John
Wan, Chi Ching Joan
Zheng, Li-Wu
Thomson, Peter
Choi, Siu-Wai
Su, Yu-Xiong
author_sort Adeoye, John
collection PubMed
description SIMPLE SUMMARY: Because tissue biopsy is the gold standard for diagnosing oral cancer, it is often performed to confirm disease during screening, management, and monitoring. However, many reports are negative. Salivary biomarkers can provide the preliminary stratification of suspicious lesions to encourage patient selection in clinical practice. However, the discovery and implementation of salivary biomarkers still need to be refined. Therefore, in this study, we successfully utilized machine learning techniques to select optimal methylome biomarkers that may be applied for oral cancer diagnoses. ABSTRACT: This study aims to examine the feasibility of ML-assisted salivary-liquid-biopsy platforms using genome-wide methylation analysis at the base-pair and regional resolution for delineating oral squamous cell carcinoma (OSCC) and oral potentially malignant disorders (OPMDs). A nested cohort of patients with OSCC and OPMDs was randomly selected from among patients with oral mucosal diseases. Saliva samples were collected, and DNA extracted from cell pellets was processed for reduced-representation bisulfite sequencing. Reads with a minimum of 10× coverage were used to identify differentially methylated CpG sites (DMCs) and 100 bp regions (DMRs). The performance of eight ML models and three feature-selection methods (ANOVA, MRMR, and LASSO) were then compared to determine the optimal biomarker models based on DMCs and DMRs. A total of 1745 DMCs and 105 DMRs were identified for detecting OSCC. The proportion of hypomethylated and hypermethylated DMCs was similar (51% vs. 49%), while most DMRs were hypermethylated (62.9%). Furthermore, more DMRs than DMCs were annotated to promoter regions (36% vs. 16%) and more DMCs than DMRs were annotated to intergenic regions (50% vs. 36%). Of all the ML models compared, the linear SVM model based on 11 optimal DMRs selected by LASSO had a perfect AUC, recall, specificity, and calibration (1.00) for OSCC detection. Overall, genome-wide DNA methylation techniques can be applied directly to saliva samples for biomarker discovery and ML-based platforms may be useful in stratifying OSCC during disease screening and monitoring.
format Online
Article
Text
id pubmed-9563273
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95632732022-10-15 Machine Learning-Based Genome-Wide Salivary DNA Methylation Analysis for Identification of Noninvasive Biomarkers in Oral Cancer Diagnosis Adeoye, John Wan, Chi Ching Joan Zheng, Li-Wu Thomson, Peter Choi, Siu-Wai Su, Yu-Xiong Cancers (Basel) Article SIMPLE SUMMARY: Because tissue biopsy is the gold standard for diagnosing oral cancer, it is often performed to confirm disease during screening, management, and monitoring. However, many reports are negative. Salivary biomarkers can provide the preliminary stratification of suspicious lesions to encourage patient selection in clinical practice. However, the discovery and implementation of salivary biomarkers still need to be refined. Therefore, in this study, we successfully utilized machine learning techniques to select optimal methylome biomarkers that may be applied for oral cancer diagnoses. ABSTRACT: This study aims to examine the feasibility of ML-assisted salivary-liquid-biopsy platforms using genome-wide methylation analysis at the base-pair and regional resolution for delineating oral squamous cell carcinoma (OSCC) and oral potentially malignant disorders (OPMDs). A nested cohort of patients with OSCC and OPMDs was randomly selected from among patients with oral mucosal diseases. Saliva samples were collected, and DNA extracted from cell pellets was processed for reduced-representation bisulfite sequencing. Reads with a minimum of 10× coverage were used to identify differentially methylated CpG sites (DMCs) and 100 bp regions (DMRs). The performance of eight ML models and three feature-selection methods (ANOVA, MRMR, and LASSO) were then compared to determine the optimal biomarker models based on DMCs and DMRs. A total of 1745 DMCs and 105 DMRs were identified for detecting OSCC. The proportion of hypomethylated and hypermethylated DMCs was similar (51% vs. 49%), while most DMRs were hypermethylated (62.9%). Furthermore, more DMRs than DMCs were annotated to promoter regions (36% vs. 16%) and more DMCs than DMRs were annotated to intergenic regions (50% vs. 36%). Of all the ML models compared, the linear SVM model based on 11 optimal DMRs selected by LASSO had a perfect AUC, recall, specificity, and calibration (1.00) for OSCC detection. Overall, genome-wide DNA methylation techniques can be applied directly to saliva samples for biomarker discovery and ML-based platforms may be useful in stratifying OSCC during disease screening and monitoring. MDPI 2022-10-08 /pmc/articles/PMC9563273/ /pubmed/36230858 http://dx.doi.org/10.3390/cancers14194935 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Adeoye, John
Wan, Chi Ching Joan
Zheng, Li-Wu
Thomson, Peter
Choi, Siu-Wai
Su, Yu-Xiong
Machine Learning-Based Genome-Wide Salivary DNA Methylation Analysis for Identification of Noninvasive Biomarkers in Oral Cancer Diagnosis
title Machine Learning-Based Genome-Wide Salivary DNA Methylation Analysis for Identification of Noninvasive Biomarkers in Oral Cancer Diagnosis
title_full Machine Learning-Based Genome-Wide Salivary DNA Methylation Analysis for Identification of Noninvasive Biomarkers in Oral Cancer Diagnosis
title_fullStr Machine Learning-Based Genome-Wide Salivary DNA Methylation Analysis for Identification of Noninvasive Biomarkers in Oral Cancer Diagnosis
title_full_unstemmed Machine Learning-Based Genome-Wide Salivary DNA Methylation Analysis for Identification of Noninvasive Biomarkers in Oral Cancer Diagnosis
title_short Machine Learning-Based Genome-Wide Salivary DNA Methylation Analysis for Identification of Noninvasive Biomarkers in Oral Cancer Diagnosis
title_sort machine learning-based genome-wide salivary dna methylation analysis for identification of noninvasive biomarkers in oral cancer diagnosis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9563273/
https://www.ncbi.nlm.nih.gov/pubmed/36230858
http://dx.doi.org/10.3390/cancers14194935
work_keys_str_mv AT adeoyejohn machinelearningbasedgenomewidesalivarydnamethylationanalysisforidentificationofnoninvasivebiomarkersinoralcancerdiagnosis
AT wanchichingjoan machinelearningbasedgenomewidesalivarydnamethylationanalysisforidentificationofnoninvasivebiomarkersinoralcancerdiagnosis
AT zhengliwu machinelearningbasedgenomewidesalivarydnamethylationanalysisforidentificationofnoninvasivebiomarkersinoralcancerdiagnosis
AT thomsonpeter machinelearningbasedgenomewidesalivarydnamethylationanalysisforidentificationofnoninvasivebiomarkersinoralcancerdiagnosis
AT choisiuwai machinelearningbasedgenomewidesalivarydnamethylationanalysisforidentificationofnoninvasivebiomarkersinoralcancerdiagnosis
AT suyuxiong machinelearningbasedgenomewidesalivarydnamethylationanalysisforidentificationofnoninvasivebiomarkersinoralcancerdiagnosis