Cargando…

Investigation of REFINED CNN ensemble learning for anti-cancer drug sensitivity prediction

MOTIVATION: Anti-cancer drug sensitivity prediction using deep learning models for individual cell line is a significant challenge in personalized medicine. Recently developed REFINED (REpresentation of Features as Images with NEighborhood Dependencies) CNN (Convolutional Neural Network)-based model...

Descripción completa

Detalles Bibliográficos
Autores principales: Bazgir, Omid, Ghosh, Souparno, Pal, Ranadip
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275339/
https://www.ncbi.nlm.nih.gov/pubmed/34252971
http://dx.doi.org/10.1093/bioinformatics/btab336
_version_ 1783721693524525056
author Bazgir, Omid
Ghosh, Souparno
Pal, Ranadip
author_facet Bazgir, Omid
Ghosh, Souparno
Pal, Ranadip
author_sort Bazgir, Omid
collection PubMed
description MOTIVATION: Anti-cancer drug sensitivity prediction using deep learning models for individual cell line is a significant challenge in personalized medicine. Recently developed REFINED (REpresentation of Features as Images with NEighborhood Dependencies) CNN (Convolutional Neural Network)-based models have shown promising results in improving drug sensitivity prediction. The primary idea behind REFINED-CNN is representing high dimensional vectors as compact images with spatial correlations that can benefit from CNN architectures. However, the mapping from a high dimensional vector to a compact 2D image depends on the a priori choice of the distance metric and projection scheme with limited empirical procedures guiding these choices. RESULTS: In this article, we consider an ensemble of REFINED-CNN built under different choices of distance metrics and/or projection schemes that can improve upon a single projection based REFINED-CNN model. Results, illustrated using NCI60 and NCI-ALMANAC databases, demonstrate that the ensemble approaches can provide significant improvement in prediction performance as compared to individual models. We also develop the theoretical framework for combining different distance metrics to arrive at a single 2D mapping. Results demonstrated that distance-averaged REFINED-CNN produced comparable performance as obtained from stacking REFINED-CNN ensemble but with significantly lower computational cost. AVAILABILITY AND IMPLEMENTATION: The source code, scripts, and data used in the paper have been deposited in GitHub (https://github.com/omidbazgirTTU/IntegratedREFINED). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8275339
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-82753392021-07-13 Investigation of REFINED CNN ensemble learning for anti-cancer drug sensitivity prediction Bazgir, Omid Ghosh, Souparno Pal, Ranadip Bioinformatics Biomedical Informatics MOTIVATION: Anti-cancer drug sensitivity prediction using deep learning models for individual cell line is a significant challenge in personalized medicine. Recently developed REFINED (REpresentation of Features as Images with NEighborhood Dependencies) CNN (Convolutional Neural Network)-based models have shown promising results in improving drug sensitivity prediction. The primary idea behind REFINED-CNN is representing high dimensional vectors as compact images with spatial correlations that can benefit from CNN architectures. However, the mapping from a high dimensional vector to a compact 2D image depends on the a priori choice of the distance metric and projection scheme with limited empirical procedures guiding these choices. RESULTS: In this article, we consider an ensemble of REFINED-CNN built under different choices of distance metrics and/or projection schemes that can improve upon a single projection based REFINED-CNN model. Results, illustrated using NCI60 and NCI-ALMANAC databases, demonstrate that the ensemble approaches can provide significant improvement in prediction performance as compared to individual models. We also develop the theoretical framework for combining different distance metrics to arrive at a single 2D mapping. Results demonstrated that distance-averaged REFINED-CNN produced comparable performance as obtained from stacking REFINED-CNN ensemble but with significantly lower computational cost. AVAILABILITY AND IMPLEMENTATION: The source code, scripts, and data used in the paper have been deposited in GitHub (https://github.com/omidbazgirTTU/IntegratedREFINED). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-07-12 /pmc/articles/PMC8275339/ /pubmed/34252971 http://dx.doi.org/10.1093/bioinformatics/btab336 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Biomedical Informatics
Bazgir, Omid
Ghosh, Souparno
Pal, Ranadip
Investigation of REFINED CNN ensemble learning for anti-cancer drug sensitivity prediction
title Investigation of REFINED CNN ensemble learning for anti-cancer drug sensitivity prediction
title_full Investigation of REFINED CNN ensemble learning for anti-cancer drug sensitivity prediction
title_fullStr Investigation of REFINED CNN ensemble learning for anti-cancer drug sensitivity prediction
title_full_unstemmed Investigation of REFINED CNN ensemble learning for anti-cancer drug sensitivity prediction
title_short Investigation of REFINED CNN ensemble learning for anti-cancer drug sensitivity prediction
title_sort investigation of refined cnn ensemble learning for anti-cancer drug sensitivity prediction
topic Biomedical Informatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275339/
https://www.ncbi.nlm.nih.gov/pubmed/34252971
http://dx.doi.org/10.1093/bioinformatics/btab336
work_keys_str_mv AT bazgiromid investigationofrefinedcnnensemblelearningforanticancerdrugsensitivityprediction
AT ghoshsouparno investigationofrefinedcnnensemblelearningforanticancerdrugsensitivityprediction
AT palranadip investigationofrefinedcnnensemblelearningforanticancerdrugsensitivityprediction