Cargando…

Integrative Deep Learning for Identifying Differentially Expressed (DE) Biomarkers

As a large amount of genetic data are accumulated, an effective analytical method and a significant interpretation are required. Recently, various methods of machine learning have emerged to process genetic data. In addition, machine learning analysis tools using statistical models have been propose...

Descripción completa

Detalles Bibliográficos
Autores principales: Lim, Jayeon, Bang, SoYoun, Kim, Jiyeon, Park, Cheolyong, Cho, JunSang, Kim, SungHwan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6935456/
https://www.ncbi.nlm.nih.gov/pubmed/31915462
http://dx.doi.org/10.1155/2019/8418760
_version_ 1783483578619789312
author Lim, Jayeon
Bang, SoYoun
Kim, Jiyeon
Park, Cheolyong
Cho, JunSang
Kim, SungHwan
author_facet Lim, Jayeon
Bang, SoYoun
Kim, Jiyeon
Park, Cheolyong
Cho, JunSang
Kim, SungHwan
author_sort Lim, Jayeon
collection PubMed
description As a large amount of genetic data are accumulated, an effective analytical method and a significant interpretation are required. Recently, various methods of machine learning have emerged to process genetic data. In addition, machine learning analysis tools using statistical models have been proposed. In this study, we propose adding an integrated layer to the deep learning structure, which would enable the effective analysis of genetic data and the discovery of significant biomarkers of diseases. We conducted a simulation study in order to compare the proposed method with metalogistic regression and meta-SVM methods. The objective function with lasso penalty is used for parameter estimation, and the Youden J index is used for model comparison. The simulation results indicate that the proposed method is more robust for the variance of the data than metalogistic regression and meta-SVM methods. We also conducted real data (breast cancer data (TCGA)) analysis. Based on the results of gene set enrichment analysis, we obtained that TCGA multiple omics data involve significantly enriched pathways which contain information related to breast cancer. Therefore, it is expected that the proposed method will be helpful to discover biomarkers.
format Online
Article
Text
id pubmed-6935456
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-69354562020-01-08 Integrative Deep Learning for Identifying Differentially Expressed (DE) Biomarkers Lim, Jayeon Bang, SoYoun Kim, Jiyeon Park, Cheolyong Cho, JunSang Kim, SungHwan Comput Math Methods Med Research Article As a large amount of genetic data are accumulated, an effective analytical method and a significant interpretation are required. Recently, various methods of machine learning have emerged to process genetic data. In addition, machine learning analysis tools using statistical models have been proposed. In this study, we propose adding an integrated layer to the deep learning structure, which would enable the effective analysis of genetic data and the discovery of significant biomarkers of diseases. We conducted a simulation study in order to compare the proposed method with metalogistic regression and meta-SVM methods. The objective function with lasso penalty is used for parameter estimation, and the Youden J index is used for model comparison. The simulation results indicate that the proposed method is more robust for the variance of the data than metalogistic regression and meta-SVM methods. We also conducted real data (breast cancer data (TCGA)) analysis. Based on the results of gene set enrichment analysis, we obtained that TCGA multiple omics data involve significantly enriched pathways which contain information related to breast cancer. Therefore, it is expected that the proposed method will be helpful to discover biomarkers. Hindawi 2019-11-02 /pmc/articles/PMC6935456/ /pubmed/31915462 http://dx.doi.org/10.1155/2019/8418760 Text en Copyright © 2019 Jayeon Lim et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Lim, Jayeon
Bang, SoYoun
Kim, Jiyeon
Park, Cheolyong
Cho, JunSang
Kim, SungHwan
Integrative Deep Learning for Identifying Differentially Expressed (DE) Biomarkers
title Integrative Deep Learning for Identifying Differentially Expressed (DE) Biomarkers
title_full Integrative Deep Learning for Identifying Differentially Expressed (DE) Biomarkers
title_fullStr Integrative Deep Learning for Identifying Differentially Expressed (DE) Biomarkers
title_full_unstemmed Integrative Deep Learning for Identifying Differentially Expressed (DE) Biomarkers
title_short Integrative Deep Learning for Identifying Differentially Expressed (DE) Biomarkers
title_sort integrative deep learning for identifying differentially expressed (de) biomarkers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6935456/
https://www.ncbi.nlm.nih.gov/pubmed/31915462
http://dx.doi.org/10.1155/2019/8418760
work_keys_str_mv AT limjayeon integrativedeeplearningforidentifyingdifferentiallyexpresseddebiomarkers
AT bangsoyoun integrativedeeplearningforidentifyingdifferentiallyexpresseddebiomarkers
AT kimjiyeon integrativedeeplearningforidentifyingdifferentiallyexpresseddebiomarkers
AT parkcheolyong integrativedeeplearningforidentifyingdifferentiallyexpresseddebiomarkers
AT chojunsang integrativedeeplearningforidentifyingdifferentiallyexpresseddebiomarkers
AT kimsunghwan integrativedeeplearningforidentifyingdifferentiallyexpresseddebiomarkers