Cargando…
Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry
The requirement of innovative big data analytics has become a critical success factor for research in biological psychiatry. Integrative analyses across distributed data resources are considered essential for untangling the biological complexity of mental illnesses. However, little is known about al...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6274760/ https://www.ncbi.nlm.nih.gov/pubmed/30380679 http://dx.doi.org/10.3390/ijms19113387 |
_version_ | 1783377681884119040 |
---|---|
author | Cao, Han Meyer-Lindenberg, Andreas Schwarz, Emanuel |
author_facet | Cao, Han Meyer-Lindenberg, Andreas Schwarz, Emanuel |
author_sort | Cao, Han |
collection | PubMed |
description | The requirement of innovative big data analytics has become a critical success factor for research in biological psychiatry. Integrative analyses across distributed data resources are considered essential for untangling the biological complexity of mental illnesses. However, little is known about algorithm properties for such integrative machine learning. Here, we performed a comparative analysis of eight machine learning algorithms for identification of reproducible biological fingerprints across data sources, using five transcriptome-wide expression datasets of schizophrenia patients and controls as a use case. We found that multi-task learning (MTL) with network structure (MTL_NET) showed superior accuracy compared to other MTL formulations as well as single task learning, and tied performance with support vector machines (SVM). Compared to SVM, MTL_NET showed significant benefits regarding the variability of accuracy estimates, as well as its robustness to cross-dataset and sampling variability. These results support the utility of this algorithm as a flexible tool for integrative machine learning in psychiatry. |
format | Online Article Text |
id | pubmed-6274760 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-62747602018-12-15 Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry Cao, Han Meyer-Lindenberg, Andreas Schwarz, Emanuel Int J Mol Sci Article The requirement of innovative big data analytics has become a critical success factor for research in biological psychiatry. Integrative analyses across distributed data resources are considered essential for untangling the biological complexity of mental illnesses. However, little is known about algorithm properties for such integrative machine learning. Here, we performed a comparative analysis of eight machine learning algorithms for identification of reproducible biological fingerprints across data sources, using five transcriptome-wide expression datasets of schizophrenia patients and controls as a use case. We found that multi-task learning (MTL) with network structure (MTL_NET) showed superior accuracy compared to other MTL formulations as well as single task learning, and tied performance with support vector machines (SVM). Compared to SVM, MTL_NET showed significant benefits regarding the variability of accuracy estimates, as well as its robustness to cross-dataset and sampling variability. These results support the utility of this algorithm as a flexible tool for integrative machine learning in psychiatry. MDPI 2018-10-29 /pmc/articles/PMC6274760/ /pubmed/30380679 http://dx.doi.org/10.3390/ijms19113387 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Cao, Han Meyer-Lindenberg, Andreas Schwarz, Emanuel Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry |
title | Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry |
title_full | Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry |
title_fullStr | Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry |
title_full_unstemmed | Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry |
title_short | Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry |
title_sort | comparative evaluation of machine learning strategies for analyzing big data in psychiatry |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6274760/ https://www.ncbi.nlm.nih.gov/pubmed/30380679 http://dx.doi.org/10.3390/ijms19113387 |
work_keys_str_mv | AT caohan comparativeevaluationofmachinelearningstrategiesforanalyzingbigdatainpsychiatry AT meyerlindenbergandreas comparativeevaluationofmachinelearningstrategiesforanalyzingbigdatainpsychiatry AT schwarzemanuel comparativeevaluationofmachinelearningstrategiesforanalyzingbigdatainpsychiatry |