Cargando…
A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures
BACKGROUND: A key mechanism of obesity involves dysregulation of metabolic and inflammatory markers. This study aimed to identify salivary biomarkers and other factors associated with obesity using an ensemble data mining approach. METHODS: For a random cohort of over 700 subjects from 8137 Kuwait c...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6545762/ https://www.ncbi.nlm.nih.gov/pubmed/31236292 http://dx.doi.org/10.1155/2019/9570218 |
_version_ | 1783423440095543296 |
---|---|
author | Shi, Ping Goodson, J. Max |
author_facet | Shi, Ping Goodson, J. Max |
author_sort | Shi, Ping |
collection | PubMed |
description | BACKGROUND: A key mechanism of obesity involves dysregulation of metabolic and inflammatory markers. This study aimed to identify salivary biomarkers and other factors associated with obesity using an ensemble data mining approach. METHODS: For a random cohort of over 700 subjects from 8137 Kuwait children (10.00 ± 0.67 years), four data mining methods were applied to identify important variables associated with obesity, including logistic regression by lasso regularization (Lasso), multivariate adaptive regression spline (MARS), random forests (RF), and boosting classification trees (BT). Each algorithm generated a variable importance rank list, based on an internal cross-validation procedure. An aggregated importance ranking was constructed by averaging the rank ordering of variables from individual list, weighted by the classification performance of respective models. Subsequently, the subset of top-ranking variables that were identified with at least three algorithms was evaluated by classification performance using receiver operating characteristic (ROC) analysis with bootstrap percentile resampling. RESULTS: Obesity was defined either by the waist circumference (OBW) or by the body mass index (BMI) (OBWHO). We identified C-reactive protein (CRP), insulin, leptin, adiponectin, as salivary biomarkers associated with OBW, plus a clinical feature fitness level. A similar set of biomarkers was identified for OBWHO, but not including leptin. Tree-based clustering analysis revealed patterns that were significantly different between the OBW and OBWHO subjects. CONCLUSION: A data mining approach based on multiple algorithms is useful for identifying factors associated with phenotypes, especially in cases where relationships are not salient, and a consensus from multiple methods can help produce a more generalizable subset of features. In this case, we have demonstrated that evaluation using the waist circumference includes association with high levels of salivary leptin, which is not seen with evaluation by BMI. |
format | Online Article Text |
id | pubmed-6545762 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-65457622019-06-24 A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures Shi, Ping Goodson, J. Max J Obes Research Article BACKGROUND: A key mechanism of obesity involves dysregulation of metabolic and inflammatory markers. This study aimed to identify salivary biomarkers and other factors associated with obesity using an ensemble data mining approach. METHODS: For a random cohort of over 700 subjects from 8137 Kuwait children (10.00 ± 0.67 years), four data mining methods were applied to identify important variables associated with obesity, including logistic regression by lasso regularization (Lasso), multivariate adaptive regression spline (MARS), random forests (RF), and boosting classification trees (BT). Each algorithm generated a variable importance rank list, based on an internal cross-validation procedure. An aggregated importance ranking was constructed by averaging the rank ordering of variables from individual list, weighted by the classification performance of respective models. Subsequently, the subset of top-ranking variables that were identified with at least three algorithms was evaluated by classification performance using receiver operating characteristic (ROC) analysis with bootstrap percentile resampling. RESULTS: Obesity was defined either by the waist circumference (OBW) or by the body mass index (BMI) (OBWHO). We identified C-reactive protein (CRP), insulin, leptin, adiponectin, as salivary biomarkers associated with OBW, plus a clinical feature fitness level. A similar set of biomarkers was identified for OBWHO, but not including leptin. Tree-based clustering analysis revealed patterns that were significantly different between the OBW and OBWHO subjects. CONCLUSION: A data mining approach based on multiple algorithms is useful for identifying factors associated with phenotypes, especially in cases where relationships are not salient, and a consensus from multiple methods can help produce a more generalizable subset of features. In this case, we have demonstrated that evaluation using the waist circumference includes association with high levels of salivary leptin, which is not seen with evaluation by BMI. Hindawi 2019-05-19 /pmc/articles/PMC6545762/ /pubmed/31236292 http://dx.doi.org/10.1155/2019/9570218 Text en Copyright © 2019 Ping Shi and J. Max Goodson. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Shi, Ping Goodson, J. Max A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures |
title | A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures |
title_full | A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures |
title_fullStr | A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures |
title_full_unstemmed | A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures |
title_short | A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures |
title_sort | data mining approach identified salivary biomarkers that discriminate between two obesity measures |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6545762/ https://www.ncbi.nlm.nih.gov/pubmed/31236292 http://dx.doi.org/10.1155/2019/9570218 |
work_keys_str_mv | AT shiping adataminingapproachidentifiedsalivarybiomarkersthatdiscriminatebetweentwoobesitymeasures AT goodsonjmax adataminingapproachidentifiedsalivarybiomarkersthatdiscriminatebetweentwoobesitymeasures AT shiping dataminingapproachidentifiedsalivarybiomarkersthatdiscriminatebetweentwoobesitymeasures AT goodsonjmax dataminingapproachidentifiedsalivarybiomarkersthatdiscriminatebetweentwoobesitymeasures |