Cargando…

MildInt: Deep Learning-Based Multimodal Longitudinal Data Integration Framework

As large amounts of heterogeneous biomedical data become available, numerous methods for integrating such datasets have been developed to extract complementary knowledge from multiple domains of sources. Recently, a deep learning approach has shown promising results in a variety of research areas. H...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Garam, Kang, Byungkon, Nho, Kwangsik, Sohn, Kyung-Ah, Kim, Dokyoon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6611503/
https://www.ncbi.nlm.nih.gov/pubmed/31316553
http://dx.doi.org/10.3389/fgene.2019.00617
_version_ 1783432707430154240
author Lee, Garam
Kang, Byungkon
Nho, Kwangsik
Sohn, Kyung-Ah
Kim, Dokyoon
author_facet Lee, Garam
Kang, Byungkon
Nho, Kwangsik
Sohn, Kyung-Ah
Kim, Dokyoon
author_sort Lee, Garam
collection PubMed
description As large amounts of heterogeneous biomedical data become available, numerous methods for integrating such datasets have been developed to extract complementary knowledge from multiple domains of sources. Recently, a deep learning approach has shown promising results in a variety of research areas. However, applying the deep learning approach requires expertise for constructing a deep architecture that can take multimodal longitudinal data. Thus, in this paper, a deep learning-based python package for data integration is developed. The python package deep learning-based multimodal longitudinal data integration framework (MildInt) provides the preconstructed deep learning architecture for a classification task. MildInt contains two learning phases: learning feature representation from each modality of data and training a classifier for the final decision. Adopting deep architecture in the first phase leads to learning more task-relevant feature representation than a linear model. In the second phase, linear regression classifier is used for detecting and investigating biomarkers from multimodal data. Thus, by combining the linear model and the deep learning model, higher accuracy and better interpretability can be achieved. We validated the performance of our package using simulation data and real data. For the real data, as a pilot study, we used clinical and multimodal neuroimaging datasets in Alzheimer’s disease to predict the disease progression. MildInt is capable of integrating multiple forms of numerical data including time series and non-time series data for extracting complementary features from the multimodal dataset.
format Online
Article
Text
id pubmed-6611503
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-66115032019-07-17 MildInt: Deep Learning-Based Multimodal Longitudinal Data Integration Framework Lee, Garam Kang, Byungkon Nho, Kwangsik Sohn, Kyung-Ah Kim, Dokyoon Front Genet Genetics As large amounts of heterogeneous biomedical data become available, numerous methods for integrating such datasets have been developed to extract complementary knowledge from multiple domains of sources. Recently, a deep learning approach has shown promising results in a variety of research areas. However, applying the deep learning approach requires expertise for constructing a deep architecture that can take multimodal longitudinal data. Thus, in this paper, a deep learning-based python package for data integration is developed. The python package deep learning-based multimodal longitudinal data integration framework (MildInt) provides the preconstructed deep learning architecture for a classification task. MildInt contains two learning phases: learning feature representation from each modality of data and training a classifier for the final decision. Adopting deep architecture in the first phase leads to learning more task-relevant feature representation than a linear model. In the second phase, linear regression classifier is used for detecting and investigating biomarkers from multimodal data. Thus, by combining the linear model and the deep learning model, higher accuracy and better interpretability can be achieved. We validated the performance of our package using simulation data and real data. For the real data, as a pilot study, we used clinical and multimodal neuroimaging datasets in Alzheimer’s disease to predict the disease progression. MildInt is capable of integrating multiple forms of numerical data including time series and non-time series data for extracting complementary features from the multimodal dataset. Frontiers Media S.A. 2019-06-28 /pmc/articles/PMC6611503/ /pubmed/31316553 http://dx.doi.org/10.3389/fgene.2019.00617 Text en Copyright © 2019 Lee, Kang, Nho, Sohn and Kim http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Lee, Garam
Kang, Byungkon
Nho, Kwangsik
Sohn, Kyung-Ah
Kim, Dokyoon
MildInt: Deep Learning-Based Multimodal Longitudinal Data Integration Framework
title MildInt: Deep Learning-Based Multimodal Longitudinal Data Integration Framework
title_full MildInt: Deep Learning-Based Multimodal Longitudinal Data Integration Framework
title_fullStr MildInt: Deep Learning-Based Multimodal Longitudinal Data Integration Framework
title_full_unstemmed MildInt: Deep Learning-Based Multimodal Longitudinal Data Integration Framework
title_short MildInt: Deep Learning-Based Multimodal Longitudinal Data Integration Framework
title_sort mildint: deep learning-based multimodal longitudinal data integration framework
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6611503/
https://www.ncbi.nlm.nih.gov/pubmed/31316553
http://dx.doi.org/10.3389/fgene.2019.00617
work_keys_str_mv AT leegaram mildintdeeplearningbasedmultimodallongitudinaldataintegrationframework
AT kangbyungkon mildintdeeplearningbasedmultimodallongitudinaldataintegrationframework
AT nhokwangsik mildintdeeplearningbasedmultimodallongitudinaldataintegrationframework
AT sohnkyungah mildintdeeplearningbasedmultimodallongitudinaldataintegrationframework
AT kimdokyoon mildintdeeplearningbasedmultimodallongitudinaldataintegrationframework