Cargando…

Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry

The requirement of innovative big data analytics has become a critical success factor for research in biological psychiatry. Integrative analyses across distributed data resources are considered essential for untangling the biological complexity of mental illnesses. However, little is known about al...

Descripción completa

Detalles Bibliográficos
Autores principales: Cao, Han, Meyer-Lindenberg, Andreas, Schwarz, Emanuel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6274760/
https://www.ncbi.nlm.nih.gov/pubmed/30380679
http://dx.doi.org/10.3390/ijms19113387
_version_ 1783377681884119040
author Cao, Han
Meyer-Lindenberg, Andreas
Schwarz, Emanuel
author_facet Cao, Han
Meyer-Lindenberg, Andreas
Schwarz, Emanuel
author_sort Cao, Han
collection PubMed
description The requirement of innovative big data analytics has become a critical success factor for research in biological psychiatry. Integrative analyses across distributed data resources are considered essential for untangling the biological complexity of mental illnesses. However, little is known about algorithm properties for such integrative machine learning. Here, we performed a comparative analysis of eight machine learning algorithms for identification of reproducible biological fingerprints across data sources, using five transcriptome-wide expression datasets of schizophrenia patients and controls as a use case. We found that multi-task learning (MTL) with network structure (MTL_NET) showed superior accuracy compared to other MTL formulations as well as single task learning, and tied performance with support vector machines (SVM). Compared to SVM, MTL_NET showed significant benefits regarding the variability of accuracy estimates, as well as its robustness to cross-dataset and sampling variability. These results support the utility of this algorithm as a flexible tool for integrative machine learning in psychiatry.
format Online
Article
Text
id pubmed-6274760
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-62747602018-12-15 Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry Cao, Han Meyer-Lindenberg, Andreas Schwarz, Emanuel Int J Mol Sci Article The requirement of innovative big data analytics has become a critical success factor for research in biological psychiatry. Integrative analyses across distributed data resources are considered essential for untangling the biological complexity of mental illnesses. However, little is known about algorithm properties for such integrative machine learning. Here, we performed a comparative analysis of eight machine learning algorithms for identification of reproducible biological fingerprints across data sources, using five transcriptome-wide expression datasets of schizophrenia patients and controls as a use case. We found that multi-task learning (MTL) with network structure (MTL_NET) showed superior accuracy compared to other MTL formulations as well as single task learning, and tied performance with support vector machines (SVM). Compared to SVM, MTL_NET showed significant benefits regarding the variability of accuracy estimates, as well as its robustness to cross-dataset and sampling variability. These results support the utility of this algorithm as a flexible tool for integrative machine learning in psychiatry. MDPI 2018-10-29 /pmc/articles/PMC6274760/ /pubmed/30380679 http://dx.doi.org/10.3390/ijms19113387 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Cao, Han
Meyer-Lindenberg, Andreas
Schwarz, Emanuel
Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry
title Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry
title_full Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry
title_fullStr Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry
title_full_unstemmed Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry
title_short Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry
title_sort comparative evaluation of machine learning strategies for analyzing big data in psychiatry
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6274760/
https://www.ncbi.nlm.nih.gov/pubmed/30380679
http://dx.doi.org/10.3390/ijms19113387
work_keys_str_mv AT caohan comparativeevaluationofmachinelearningstrategiesforanalyzingbigdatainpsychiatry
AT meyerlindenbergandreas comparativeevaluationofmachinelearningstrategiesforanalyzingbigdatainpsychiatry
AT schwarzemanuel comparativeevaluationofmachinelearningstrategiesforanalyzingbigdatainpsychiatry