Cargando…

A Copula Based Approach for Design of Multivariate Random Forests for Drug Sensitivity Prediction

Modeling sensitivity to drugs based on genetic characterizations is a significant challenge in the area of systems medicine. Ensemble based approaches such as Random Forests have been shown to perform well in both individual sensitivity prediction studies and team science based prediction challenges...

Descripción completa

Detalles Bibliográficos
Autores principales: Haider, Saad, Rahman, Raziur, Ghosh, Souparno, Pal, Ranadip
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4684346/
https://www.ncbi.nlm.nih.gov/pubmed/26658256
http://dx.doi.org/10.1371/journal.pone.0144490
_version_ 1782406174572806144
author Haider, Saad
Rahman, Raziur
Ghosh, Souparno
Pal, Ranadip
author_facet Haider, Saad
Rahman, Raziur
Ghosh, Souparno
Pal, Ranadip
author_sort Haider, Saad
collection PubMed
description Modeling sensitivity to drugs based on genetic characterizations is a significant challenge in the area of systems medicine. Ensemble based approaches such as Random Forests have been shown to perform well in both individual sensitivity prediction studies and team science based prediction challenges. However, Random Forests generate a deterministic predictive model for each drug based on the genetic characterization of the cell lines and ignores the relationship between different drug sensitivities during model generation. This application motivates the need for generation of multivariate ensemble learning techniques that can increase prediction accuracy and improve variable importance ranking by incorporating the relationships between different output responses. In this article, we propose a novel cost criterion that captures the dissimilarity in the output response structure between the training data and node samples as the difference in the two empirical copulas. We illustrate that copulas are suitable for capturing the multivariate structure of output responses independent of the marginal distributions and the copula based multivariate random forest framework can provide higher accuracy prediction and improved variable selection. The proposed framework has been validated on genomics of drug sensitivity for cancer and cancer cell line encyclopedia database.
format Online
Article
Text
id pubmed-4684346
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-46843462015-12-31 A Copula Based Approach for Design of Multivariate Random Forests for Drug Sensitivity Prediction Haider, Saad Rahman, Raziur Ghosh, Souparno Pal, Ranadip PLoS One Research Article Modeling sensitivity to drugs based on genetic characterizations is a significant challenge in the area of systems medicine. Ensemble based approaches such as Random Forests have been shown to perform well in both individual sensitivity prediction studies and team science based prediction challenges. However, Random Forests generate a deterministic predictive model for each drug based on the genetic characterization of the cell lines and ignores the relationship between different drug sensitivities during model generation. This application motivates the need for generation of multivariate ensemble learning techniques that can increase prediction accuracy and improve variable importance ranking by incorporating the relationships between different output responses. In this article, we propose a novel cost criterion that captures the dissimilarity in the output response structure between the training data and node samples as the difference in the two empirical copulas. We illustrate that copulas are suitable for capturing the multivariate structure of output responses independent of the marginal distributions and the copula based multivariate random forest framework can provide higher accuracy prediction and improved variable selection. The proposed framework has been validated on genomics of drug sensitivity for cancer and cancer cell line encyclopedia database. Public Library of Science 2015-12-10 /pmc/articles/PMC4684346/ /pubmed/26658256 http://dx.doi.org/10.1371/journal.pone.0144490 Text en © 2015 Haider et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Haider, Saad
Rahman, Raziur
Ghosh, Souparno
Pal, Ranadip
A Copula Based Approach for Design of Multivariate Random Forests for Drug Sensitivity Prediction
title A Copula Based Approach for Design of Multivariate Random Forests for Drug Sensitivity Prediction
title_full A Copula Based Approach for Design of Multivariate Random Forests for Drug Sensitivity Prediction
title_fullStr A Copula Based Approach for Design of Multivariate Random Forests for Drug Sensitivity Prediction
title_full_unstemmed A Copula Based Approach for Design of Multivariate Random Forests for Drug Sensitivity Prediction
title_short A Copula Based Approach for Design of Multivariate Random Forests for Drug Sensitivity Prediction
title_sort copula based approach for design of multivariate random forests for drug sensitivity prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4684346/
https://www.ncbi.nlm.nih.gov/pubmed/26658256
http://dx.doi.org/10.1371/journal.pone.0144490
work_keys_str_mv AT haidersaad acopulabasedapproachfordesignofmultivariaterandomforestsfordrugsensitivityprediction
AT rahmanraziur acopulabasedapproachfordesignofmultivariaterandomforestsfordrugsensitivityprediction
AT ghoshsouparno acopulabasedapproachfordesignofmultivariaterandomforestsfordrugsensitivityprediction
AT palranadip acopulabasedapproachfordesignofmultivariaterandomforestsfordrugsensitivityprediction
AT haidersaad copulabasedapproachfordesignofmultivariaterandomforestsfordrugsensitivityprediction
AT rahmanraziur copulabasedapproachfordesignofmultivariaterandomforestsfordrugsensitivityprediction
AT ghoshsouparno copulabasedapproachfordesignofmultivariaterandomforestsfordrugsensitivityprediction
AT palranadip copulabasedapproachfordesignofmultivariaterandomforestsfordrugsensitivityprediction