Cargando…

The General Explanation Method with NMR Spectroscopy Enables the Identification of Metabolite Profiles Specific for Normal and Tumor Cell Lines

Machine learning models in metabolomics, despite their great prediction accuracy, are still not widely adopted owing to the lack of an efficient explanation for their predictions. In this study, we propose the use of the general explanation method to explain the predictions of a machine learning mod...

Descripción completa

Detalles Bibliográficos
Autores principales: Pečnik, Klemen, Todorović, Vesna, Bošnjak, Maša, Čemažar, Maja, Kononenko, Igor, Serša, Gregor, Plavec, Janez
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6220813/
https://www.ncbi.nlm.nih.gov/pubmed/30067305
http://dx.doi.org/10.1002/cbic.201800392
Descripción
Sumario:Machine learning models in metabolomics, despite their great prediction accuracy, are still not widely adopted owing to the lack of an efficient explanation for their predictions. In this study, we propose the use of the general explanation method to explain the predictions of a machine learning model to gain detailed insight into metabolic differences between biological systems. The method was tested on a dataset of (1)H NMR spectra acquired on normal lung and mesothelial cell lines and their tumor counterparts. Initially, the random forests and artificial neural network models were applied to the dataset, and excellent prediction accuracy was achieved. The predictions of the models were explained with the general explanation method, which enabled identification of discriminating metabolic concentration differences between individual cell lines and enabled the construction of their specific metabolic concentration profiles. This intuitive and robust method holds great promise for in‐depth understanding of the mechanisms that underline phenotypes as well as for biomarker discovery in complex diseases.