Cargando…
A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data
BACKGROUND: Stratification of patient subpopulations that respond favorably to treatment or experience and adverse reaction is an essential step toward development of new personalized therapies and diagnostics. It is currently feasible to generate omic-scale biological measurements for all patients...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5735940/ https://www.ncbi.nlm.nih.gov/pubmed/29258445 http://dx.doi.org/10.1186/s12859-017-1984-2 |
_version_ | 1783287298647916544 |
---|---|
author | Kang, Tianyu Ding, Wei Zhang, Luoyan Ziemek, Daniel Zarringhalam, Kourosh |
author_facet | Kang, Tianyu Ding, Wei Zhang, Luoyan Ziemek, Daniel Zarringhalam, Kourosh |
author_sort | Kang, Tianyu |
collection | PubMed |
description | BACKGROUND: Stratification of patient subpopulations that respond favorably to treatment or experience and adverse reaction is an essential step toward development of new personalized therapies and diagnostics. It is currently feasible to generate omic-scale biological measurements for all patients in a study, providing an opportunity for machine learning models to identify molecular markers for disease diagnosis and progression. However, the high variability of genetic background in human populations hampers the reproducibility of omic-scale markers. In this paper, we develop a biological network-based regularized artificial neural network model for prediction of phenotype from transcriptomic measurements in clinical trials. To improve model sparsity and the overall reproducibility of the model, we incorporate regularization for simultaneous shrinkage of gene sets based on active upstream regulatory mechanisms into the model. RESULTS: We benchmark our method against various regression, support vector machines and artificial neural network models and demonstrate the ability of our method in predicting the clinical outcomes using clinical trial data on acute rejection in kidney transplantation and response to Infliximab in ulcerative colitis. We show that integration of prior biological knowledge into the classification as developed in this paper, significantly improves the robustness and generalizability of predictions to independent datasets. We provide a Java code of our algorithm along with a parsed version of the STRING DB database. CONCLUSION: In summary, we present a method for prediction of clinical phenotypes using baseline genome-wide expression data that makes use of prior biological knowledge on gene-regulatory interactions in order to increase robustness and reproducibility of omic-scale markers. The integrated group-wise regularization methods increases the interpretability of biological signatures and gives stable performance estimates across independent test sets. |
format | Online Article Text |
id | pubmed-5735940 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-57359402017-12-21 A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data Kang, Tianyu Ding, Wei Zhang, Luoyan Ziemek, Daniel Zarringhalam, Kourosh BMC Bioinformatics Research Article BACKGROUND: Stratification of patient subpopulations that respond favorably to treatment or experience and adverse reaction is an essential step toward development of new personalized therapies and diagnostics. It is currently feasible to generate omic-scale biological measurements for all patients in a study, providing an opportunity for machine learning models to identify molecular markers for disease diagnosis and progression. However, the high variability of genetic background in human populations hampers the reproducibility of omic-scale markers. In this paper, we develop a biological network-based regularized artificial neural network model for prediction of phenotype from transcriptomic measurements in clinical trials. To improve model sparsity and the overall reproducibility of the model, we incorporate regularization for simultaneous shrinkage of gene sets based on active upstream regulatory mechanisms into the model. RESULTS: We benchmark our method against various regression, support vector machines and artificial neural network models and demonstrate the ability of our method in predicting the clinical outcomes using clinical trial data on acute rejection in kidney transplantation and response to Infliximab in ulcerative colitis. We show that integration of prior biological knowledge into the classification as developed in this paper, significantly improves the robustness and generalizability of predictions to independent datasets. We provide a Java code of our algorithm along with a parsed version of the STRING DB database. CONCLUSION: In summary, we present a method for prediction of clinical phenotypes using baseline genome-wide expression data that makes use of prior biological knowledge on gene-regulatory interactions in order to increase robustness and reproducibility of omic-scale markers. The integrated group-wise regularization methods increases the interpretability of biological signatures and gives stable performance estimates across independent test sets. BioMed Central 2017-12-19 /pmc/articles/PMC5735940/ /pubmed/29258445 http://dx.doi.org/10.1186/s12859-017-1984-2 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Kang, Tianyu Ding, Wei Zhang, Luoyan Ziemek, Daniel Zarringhalam, Kourosh A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data |
title | A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data |
title_full | A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data |
title_fullStr | A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data |
title_full_unstemmed | A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data |
title_short | A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data |
title_sort | biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5735940/ https://www.ncbi.nlm.nih.gov/pubmed/29258445 http://dx.doi.org/10.1186/s12859-017-1984-2 |
work_keys_str_mv | AT kangtianyu abiologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata AT dingwei abiologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata AT zhangluoyan abiologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata AT ziemekdaniel abiologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata AT zarringhalamkourosh abiologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata AT kangtianyu biologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata AT dingwei biologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata AT zhangluoyan biologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata AT ziemekdaniel biologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata AT zarringhalamkourosh biologicalnetworkbasedregularizedartificialneuralnetworkmodelforrobustphenotypepredictionfromgeneexpressiondata |