Cargando…

A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data

BACKGROUND: Stratification of patient subpopulations that respond favorably to treatment or experience and adverse reaction is an essential step toward development of new personalized therapies and diagnostics. It is currently feasible to generate omic-scale biological measurements for all patients...

Descripción completa

Detalles Bibliográficos
Autores principales: Kang, Tianyu, Ding, Wei, Zhang, Luoyan, Ziemek, Daniel, Zarringhalam, Kourosh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5735940/
https://www.ncbi.nlm.nih.gov/pubmed/29258445
http://dx.doi.org/10.1186/s12859-017-1984-2
_version_ 1783287298647916544
author Kang, Tianyu
Ding, Wei
Zhang, Luoyan
Ziemek, Daniel
Zarringhalam, Kourosh
author_facet Kang, Tianyu
Ding, Wei
Zhang, Luoyan
Ziemek, Daniel
Zarringhalam, Kourosh
author_sort Kang, Tianyu
collection PubMed
description BACKGROUND: Stratification of patient subpopulations that respond favorably to treatment or experience and adverse reaction is an essential step toward development of new personalized therapies and diagnostics. It is currently feasible to generate omic-scale biological measurements for all patients in a study, providing an opportunity for machine learning models to identify molecular markers for disease diagnosis and progression. However, the high variability of genetic background in human populations hampers the reproducibility of omic-scale markers. In this paper, we develop a biological network-based regularized artificial neural network model for prediction of phenotype from transcriptomic measurements in clinical trials. To improve model sparsity and the overall reproducibility of the model, we incorporate regularization for simultaneous shrinkage of gene sets based on active upstream regulatory mechanisms into the model. RESULTS: We benchmark our method against various regression, support vector machines and artificial neural network models and demonstrate the ability of our method in predicting the clinical outcomes using clinical trial data on acute rejection in kidney transplantation and response to Infliximab in ulcerative colitis. We show that integration of prior biological knowledge into the classification as developed in this paper, significantly improves the robustness and generalizability of predictions to independent datasets. We provide a Java code of our algorithm along with a parsed version of the STRING DB database. CONCLUSION: In summary, we present a method for prediction of clinical phenotypes using baseline genome-wide expression data that makes use of prior biological knowledge on gene-regulatory interactions in order to increase robustness and reproducibility of omic-scale markers. The integrated group-wise regularization methods increases the interpretability of biological signatures and gives stable performance estimates across independent test sets.
format Online
Article
Text
id pubmed-5735940
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57359402017-12-21 A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data Kang, Tianyu Ding, Wei Zhang, Luoyan Ziemek, Daniel Zarringhalam, Kourosh BMC Bioinformatics Research Article BACKGROUND: Stratification of patient subpopulations that respond favorably to treatment or experience and adverse reaction is an essential step toward development of new personalized therapies and diagnostics. It is currently feasible to generate omic-scale biological measurements for all patients in a study, providing an opportunity for machine learning models to identify molecular markers for disease diagnosis and progression. However, the high variability of genetic background in human populations hampers the reproducibility of omic-scale markers. In this paper, we develop a biological network-based regularized artificial neural network model for prediction of phenotype from transcriptomic measurements in clinical trials. To improve model sparsity and the overall reproducibility of the model, we incorporate regularization for simultaneous shrinkage of gene sets based on active upstream regulatory mechanisms into the model. RESULTS: We benchmark our method against various regression, support vector machines and artificial neural network models and demonstrate the ability of our method in predicting the clinical outcomes using clinical trial data on acute rejection in kidney transplantation and response to Infliximab in ulcerative colitis. We show that integration of prior biological knowledge into the classification as developed in this paper, significantly improves the robustness and generalizability of predictions to independent datasets. We provide a Java code of our algorithm along with a parsed version of the STRING DB database. CONCLUSION: In summary, we present a method for prediction of clinical phenotypes using baseline genome-wide expression data that makes use of prior biological knowledge on gene-regulatory interactions in order to increase robustness and reproducibility of omic-scale markers. The integrated group-wise regularization methods increases the interpretability of biological signatures and gives stable performance estimates across independent test sets. BioMed Central 2017-12-19 /pmc/articles/PMC5735940/ /pubmed/29258445 http://dx.doi.org/10.1186/s12859-017-1984-2 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Kang, Tianyu
Ding, Wei
Zhang, Luoyan
Ziemek, Daniel
Zarringhalam, Kourosh
A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data
title A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data
title_full A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data
title_fullStr A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data
title_full_unstemmed A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data
title_short A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data
title_sort biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5735940/
https://www.ncbi.nlm.nih.gov/pubmed/29258445
http://dx.doi.org/10.1186/s12859-017-1984-2
work_keys_str_mv AT kangtianyu abiologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata
AT dingwei abiologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata
AT zhangluoyan abiologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata
AT ziemekdaniel abiologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata
AT zarringhalamkourosh abiologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata
AT kangtianyu biologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata
AT dingwei biologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata
AT zhangluoyan biologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata
AT ziemekdaniel biologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata
AT zarringhalamkourosh biologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata