Cargando…

Improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection

To solve major limitations in algorithms for the metabolite-based prediction of psychiatric phenotypes, a novel prediction model for depressive symptoms based on nonlinear feature selection machine learning, the Hilbert–Schmidt independence criterion least absolute shrinkage and selection operator (...

Descripción completa

Detalles Bibliográficos
Autores principales: Takahashi, Yuta, Ueki, Masao, Yamada, Makoto, Tamiya, Gen, Motoike, Ikuko N., Saigusa, Daisuke, Sakurai, Miyuki, Nagami, Fuji, Ogishima, Soichi, Koshiba, Seizo, Kinoshita, Kengo, Yamamoto, Masayuki, Tomita, Hiroaki
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7237664/
https://www.ncbi.nlm.nih.gov/pubmed/32427830
http://dx.doi.org/10.1038/s41398-020-0831-9
_version_ 1783536374119399424
author Takahashi, Yuta
Ueki, Masao
Yamada, Makoto
Tamiya, Gen
Motoike, Ikuko N.
Saigusa, Daisuke
Sakurai, Miyuki
Nagami, Fuji
Ogishima, Soichi
Koshiba, Seizo
Kinoshita, Kengo
Yamamoto, Masayuki
Tomita, Hiroaki
author_facet Takahashi, Yuta
Ueki, Masao
Yamada, Makoto
Tamiya, Gen
Motoike, Ikuko N.
Saigusa, Daisuke
Sakurai, Miyuki
Nagami, Fuji
Ogishima, Soichi
Koshiba, Seizo
Kinoshita, Kengo
Yamamoto, Masayuki
Tomita, Hiroaki
author_sort Takahashi, Yuta
collection PubMed
description To solve major limitations in algorithms for the metabolite-based prediction of psychiatric phenotypes, a novel prediction model for depressive symptoms based on nonlinear feature selection machine learning, the Hilbert–Schmidt independence criterion least absolute shrinkage and selection operator (HSIC Lasso) algorithm, was developed and applied to a metabolomic dataset with the largest sample size to date. In total, 897 population-based subjects were recruited from the communities affected by the Great East Japan Earthquake; 306 metabolite features (37 metabolites identified by nuclear magnetic resonance measurements and 269 characterized metabolites based on the intensities from mass spectrometry) were utilized to build prediction models for depressive symptoms as evaluated by the Center for Epidemiologic Studies-Depression Scale (CES-D). The nested fivefold cross-validation was used for developing and evaluating the prediction models. The HSIC Lasso-based prediction model showed better predictive power than the other prediction models, including Lasso, support vector machine, partial least squares, random forest, and neural network. l-leucine, 3-hydroxyisobutyrate, and gamma-linolenyl carnitine frequently contributed to the prediction. We have demonstrated that the HSIC Lasso-based prediction model integrating nonlinear feature selection showed improved predictive power for depressive symptoms based on metabolome data as well as on risk metabolites based on nonlinear statistics in the Japanese population. Further studies should use HSIC Lasso-based prediction models with different ethnicities to investigate the generality of each risk metabolite for predicting depressive symptoms.
format Online
Article
Text
id pubmed-7237664
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-72376642020-06-04 Improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection Takahashi, Yuta Ueki, Masao Yamada, Makoto Tamiya, Gen Motoike, Ikuko N. Saigusa, Daisuke Sakurai, Miyuki Nagami, Fuji Ogishima, Soichi Koshiba, Seizo Kinoshita, Kengo Yamamoto, Masayuki Tomita, Hiroaki Transl Psychiatry Article To solve major limitations in algorithms for the metabolite-based prediction of psychiatric phenotypes, a novel prediction model for depressive symptoms based on nonlinear feature selection machine learning, the Hilbert–Schmidt independence criterion least absolute shrinkage and selection operator (HSIC Lasso) algorithm, was developed and applied to a metabolomic dataset with the largest sample size to date. In total, 897 population-based subjects were recruited from the communities affected by the Great East Japan Earthquake; 306 metabolite features (37 metabolites identified by nuclear magnetic resonance measurements and 269 characterized metabolites based on the intensities from mass spectrometry) were utilized to build prediction models for depressive symptoms as evaluated by the Center for Epidemiologic Studies-Depression Scale (CES-D). The nested fivefold cross-validation was used for developing and evaluating the prediction models. The HSIC Lasso-based prediction model showed better predictive power than the other prediction models, including Lasso, support vector machine, partial least squares, random forest, and neural network. l-leucine, 3-hydroxyisobutyrate, and gamma-linolenyl carnitine frequently contributed to the prediction. We have demonstrated that the HSIC Lasso-based prediction model integrating nonlinear feature selection showed improved predictive power for depressive symptoms based on metabolome data as well as on risk metabolites based on nonlinear statistics in the Japanese population. Further studies should use HSIC Lasso-based prediction models with different ethnicities to investigate the generality of each risk metabolite for predicting depressive symptoms. Nature Publishing Group UK 2020-05-19 /pmc/articles/PMC7237664/ /pubmed/32427830 http://dx.doi.org/10.1038/s41398-020-0831-9 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Takahashi, Yuta
Ueki, Masao
Yamada, Makoto
Tamiya, Gen
Motoike, Ikuko N.
Saigusa, Daisuke
Sakurai, Miyuki
Nagami, Fuji
Ogishima, Soichi
Koshiba, Seizo
Kinoshita, Kengo
Yamamoto, Masayuki
Tomita, Hiroaki
Improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection
title Improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection
title_full Improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection
title_fullStr Improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection
title_full_unstemmed Improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection
title_short Improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection
title_sort improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7237664/
https://www.ncbi.nlm.nih.gov/pubmed/32427830
http://dx.doi.org/10.1038/s41398-020-0831-9
work_keys_str_mv AT takahashiyuta improvedmetabolomicdatabasedpredictionofdepressivesymptomsusingnonlinearmachinelearningwithfeatureselection
AT uekimasao improvedmetabolomicdatabasedpredictionofdepressivesymptomsusingnonlinearmachinelearningwithfeatureselection
AT yamadamakoto improvedmetabolomicdatabasedpredictionofdepressivesymptomsusingnonlinearmachinelearningwithfeatureselection
AT tamiyagen improvedmetabolomicdatabasedpredictionofdepressivesymptomsusingnonlinearmachinelearningwithfeatureselection
AT motoikeikukon improvedmetabolomicdatabasedpredictionofdepressivesymptomsusingnonlinearmachinelearningwithfeatureselection
AT saigusadaisuke improvedmetabolomicdatabasedpredictionofdepressivesymptomsusingnonlinearmachinelearningwithfeatureselection
AT sakuraimiyuki improvedmetabolomicdatabasedpredictionofdepressivesymptomsusingnonlinearmachinelearningwithfeatureselection
AT nagamifuji improvedmetabolomicdatabasedpredictionofdepressivesymptomsusingnonlinearmachinelearningwithfeatureselection
AT ogishimasoichi improvedmetabolomicdatabasedpredictionofdepressivesymptomsusingnonlinearmachinelearningwithfeatureselection
AT koshibaseizo improvedmetabolomicdatabasedpredictionofdepressivesymptomsusingnonlinearmachinelearningwithfeatureselection
AT kinoshitakengo improvedmetabolomicdatabasedpredictionofdepressivesymptomsusingnonlinearmachinelearningwithfeatureselection
AT yamamotomasayuki improvedmetabolomicdatabasedpredictionofdepressivesymptomsusingnonlinearmachinelearningwithfeatureselection
AT tomitahiroaki improvedmetabolomicdatabasedpredictionofdepressivesymptomsusingnonlinearmachinelearningwithfeatureselection