Cargando…
Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data
(1)H Nuclear Magnetic Resonance (NMR)-based metabolic profiling is very promising for the diagnostic of the stages of chronic kidney disease (CKD). Because of the high dimension of NMR spectra datasets and the complex mixture of metabolites in biological samples, the identification of discriminant b...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5115883/ https://www.ncbi.nlm.nih.gov/pubmed/27861591 http://dx.doi.org/10.1371/journal.pone.0166905 |
_version_ | 1782468589256704000 |
---|---|
author | Luck, Margaux Bertho, Gildas Bateson, Mathilde Karras, Alexandre Yartseva, Anastasia Thervet, Eric Damon, Cecilia Pallet, Nicolas |
author_facet | Luck, Margaux Bertho, Gildas Bateson, Mathilde Karras, Alexandre Yartseva, Anastasia Thervet, Eric Damon, Cecilia Pallet, Nicolas |
author_sort | Luck, Margaux |
collection | PubMed |
description | (1)H Nuclear Magnetic Resonance (NMR)-based metabolic profiling is very promising for the diagnostic of the stages of chronic kidney disease (CKD). Because of the high dimension of NMR spectra datasets and the complex mixture of metabolites in biological samples, the identification of discriminant biomarkers of a disease is challenging. None of the widely used chemometric methods in NMR metabolomics performs a local exhaustive exploration of the data. We developed a descriptive and easily understandable approach that searches for discriminant local phenomena using an original exhaustive rule-mining algorithm in order to predict two groups of patients: 1) patients having low to mild CKD stages with no renal failure and 2) patients having moderate to established CKD stages with renal failure. Our predictive algorithm explores the m-dimensional variable space to capture the local overdensities of the two groups of patients under the form of easily interpretable rules. Afterwards, a L2-penalized logistic regression on the discriminant rules was used to build predictive models of the CKD stages. We explored a complex multi-source dataset that included the clinical, demographic, clinical chemistry, renal pathology and urine metabolomic data of a cohort of 110 patients. Given this multi-source dataset and the complex nature of metabolomic data, we analyzed 1- and 2-dimensional rules in order to integrate the information carried by the interactions between the variables. The results indicated that our local algorithm is a valuable analytical method for the precise characterization of multivariate CKD stage profiles and as efficient as the classical global model using chi2 variable section with an approximately 70% of good classification level. The resulting predictive models predominantly identify urinary metabolites (such as 3-hydroxyisovalerate, carnitine, citrate, dimethylsulfone, creatinine and N-methylnicotinamide) as relevant variables indicating that CKD significantly affects the urinary metabolome. In addition, the simple knowledge of the concentration of urinary metabolites classifies the CKD stage of the patients correctly. |
format | Online Article Text |
id | pubmed-5115883 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-51158832016-12-08 Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data Luck, Margaux Bertho, Gildas Bateson, Mathilde Karras, Alexandre Yartseva, Anastasia Thervet, Eric Damon, Cecilia Pallet, Nicolas PLoS One Research Article (1)H Nuclear Magnetic Resonance (NMR)-based metabolic profiling is very promising for the diagnostic of the stages of chronic kidney disease (CKD). Because of the high dimension of NMR spectra datasets and the complex mixture of metabolites in biological samples, the identification of discriminant biomarkers of a disease is challenging. None of the widely used chemometric methods in NMR metabolomics performs a local exhaustive exploration of the data. We developed a descriptive and easily understandable approach that searches for discriminant local phenomena using an original exhaustive rule-mining algorithm in order to predict two groups of patients: 1) patients having low to mild CKD stages with no renal failure and 2) patients having moderate to established CKD stages with renal failure. Our predictive algorithm explores the m-dimensional variable space to capture the local overdensities of the two groups of patients under the form of easily interpretable rules. Afterwards, a L2-penalized logistic regression on the discriminant rules was used to build predictive models of the CKD stages. We explored a complex multi-source dataset that included the clinical, demographic, clinical chemistry, renal pathology and urine metabolomic data of a cohort of 110 patients. Given this multi-source dataset and the complex nature of metabolomic data, we analyzed 1- and 2-dimensional rules in order to integrate the information carried by the interactions between the variables. The results indicated that our local algorithm is a valuable analytical method for the precise characterization of multivariate CKD stage profiles and as efficient as the classical global model using chi2 variable section with an approximately 70% of good classification level. The resulting predictive models predominantly identify urinary metabolites (such as 3-hydroxyisovalerate, carnitine, citrate, dimethylsulfone, creatinine and N-methylnicotinamide) as relevant variables indicating that CKD significantly affects the urinary metabolome. In addition, the simple knowledge of the concentration of urinary metabolites classifies the CKD stage of the patients correctly. Public Library of Science 2016-11-18 /pmc/articles/PMC5115883/ /pubmed/27861591 http://dx.doi.org/10.1371/journal.pone.0166905 Text en © 2016 Luck et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Luck, Margaux Bertho, Gildas Bateson, Mathilde Karras, Alexandre Yartseva, Anastasia Thervet, Eric Damon, Cecilia Pallet, Nicolas Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data |
title | Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data |
title_full | Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data |
title_fullStr | Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data |
title_full_unstemmed | Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data |
title_short | Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data |
title_sort | rule-mining for the early prediction of chronic kidney disease based on metabolomics and multi-source data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5115883/ https://www.ncbi.nlm.nih.gov/pubmed/27861591 http://dx.doi.org/10.1371/journal.pone.0166905 |
work_keys_str_mv | AT luckmargaux ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata AT berthogildas ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata AT batesonmathilde ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata AT karrasalexandre ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata AT yartsevaanastasia ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata AT therveteric ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata AT damoncecilia ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata AT palletnicolas ruleminingfortheearlypredictionofchronickidneydiseasebasedonmetabolomicsandmultisourcedata |