Cargando…

A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures

BACKGROUND: A key mechanism of obesity involves dysregulation of metabolic and inflammatory markers. This study aimed to identify salivary biomarkers and other factors associated with obesity using an ensemble data mining approach. METHODS: For a random cohort of over 700 subjects from 8137 Kuwait c...

Descripción completa

Detalles Bibliográficos
Autores principales: Shi, Ping, Goodson, J. Max
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6545762/
https://www.ncbi.nlm.nih.gov/pubmed/31236292
http://dx.doi.org/10.1155/2019/9570218
_version_ 1783423440095543296
author Shi, Ping
Goodson, J. Max
author_facet Shi, Ping
Goodson, J. Max
author_sort Shi, Ping
collection PubMed
description BACKGROUND: A key mechanism of obesity involves dysregulation of metabolic and inflammatory markers. This study aimed to identify salivary biomarkers and other factors associated with obesity using an ensemble data mining approach. METHODS: For a random cohort of over 700 subjects from 8137 Kuwait children (10.00 ± 0.67 years), four data mining methods were applied to identify important variables associated with obesity, including logistic regression by lasso regularization (Lasso), multivariate adaptive regression spline (MARS), random forests (RF), and boosting classification trees (BT). Each algorithm generated a variable importance rank list, based on an internal cross-validation procedure. An aggregated importance ranking was constructed by averaging the rank ordering of variables from individual list, weighted by the classification performance of respective models. Subsequently, the subset of top-ranking variables that were identified with at least three algorithms was evaluated by classification performance using receiver operating characteristic (ROC) analysis with bootstrap percentile resampling. RESULTS: Obesity was defined either by the waist circumference (OBW) or by the body mass index (BMI) (OBWHO). We identified C-reactive protein (CRP), insulin, leptin, adiponectin, as salivary biomarkers associated with OBW, plus a clinical feature fitness level. A similar set of biomarkers was identified for OBWHO, but not including leptin. Tree-based clustering analysis revealed patterns that were significantly different between the OBW and OBWHO subjects. CONCLUSION: A data mining approach based on multiple algorithms is useful for identifying factors associated with phenotypes, especially in cases where relationships are not salient, and a consensus from multiple methods can help produce a more generalizable subset of features. In this case, we have demonstrated that evaluation using the waist circumference includes association with high levels of salivary leptin, which is not seen with evaluation by BMI.
format Online
Article
Text
id pubmed-6545762
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-65457622019-06-24 A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures Shi, Ping Goodson, J. Max J Obes Research Article BACKGROUND: A key mechanism of obesity involves dysregulation of metabolic and inflammatory markers. This study aimed to identify salivary biomarkers and other factors associated with obesity using an ensemble data mining approach. METHODS: For a random cohort of over 700 subjects from 8137 Kuwait children (10.00 ± 0.67 years), four data mining methods were applied to identify important variables associated with obesity, including logistic regression by lasso regularization (Lasso), multivariate adaptive regression spline (MARS), random forests (RF), and boosting classification trees (BT). Each algorithm generated a variable importance rank list, based on an internal cross-validation procedure. An aggregated importance ranking was constructed by averaging the rank ordering of variables from individual list, weighted by the classification performance of respective models. Subsequently, the subset of top-ranking variables that were identified with at least three algorithms was evaluated by classification performance using receiver operating characteristic (ROC) analysis with bootstrap percentile resampling. RESULTS: Obesity was defined either by the waist circumference (OBW) or by the body mass index (BMI) (OBWHO). We identified C-reactive protein (CRP), insulin, leptin, adiponectin, as salivary biomarkers associated with OBW, plus a clinical feature fitness level. A similar set of biomarkers was identified for OBWHO, but not including leptin. Tree-based clustering analysis revealed patterns that were significantly different between the OBW and OBWHO subjects. CONCLUSION: A data mining approach based on multiple algorithms is useful for identifying factors associated with phenotypes, especially in cases where relationships are not salient, and a consensus from multiple methods can help produce a more generalizable subset of features. In this case, we have demonstrated that evaluation using the waist circumference includes association with high levels of salivary leptin, which is not seen with evaluation by BMI. Hindawi 2019-05-19 /pmc/articles/PMC6545762/ /pubmed/31236292 http://dx.doi.org/10.1155/2019/9570218 Text en Copyright © 2019 Ping Shi and J. Max Goodson. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Shi, Ping
Goodson, J. Max
A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures
title A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures
title_full A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures
title_fullStr A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures
title_full_unstemmed A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures
title_short A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures
title_sort data mining approach identified salivary biomarkers that discriminate between two obesity measures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6545762/
https://www.ncbi.nlm.nih.gov/pubmed/31236292
http://dx.doi.org/10.1155/2019/9570218
work_keys_str_mv AT shiping adataminingapproachidentifiedsalivarybiomarkersthatdiscriminatebetweentwoobesitymeasures
AT goodsonjmax adataminingapproachidentifiedsalivarybiomarkersthatdiscriminatebetweentwoobesitymeasures
AT shiping dataminingapproachidentifiedsalivarybiomarkersthatdiscriminatebetweentwoobesitymeasures
AT goodsonjmax dataminingapproachidentifiedsalivarybiomarkersthatdiscriminatebetweentwoobesitymeasures