Cargando…

Converting tabular data into images for deep learning with convolutional neural networks

Convolutional neural networks (CNNs) have been successfully used in many applications where important information about data is embedded in the order of features, such as speech and imaging. However, most tabular data do not assume a spatial relationship between features, and thus are unsuitable for...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Yitan, Brettin, Thomas, Xia, Fangfang, Partin, Alexander, Shukla, Maulik, Yoo, Hyunseung, Evrard, Yvonne A., Doroshow, James H., Stevens, Rick L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8166880/
https://www.ncbi.nlm.nih.gov/pubmed/34059739
http://dx.doi.org/10.1038/s41598-021-90923-y
_version_ 1783701588089503744
author Zhu, Yitan
Brettin, Thomas
Xia, Fangfang
Partin, Alexander
Shukla, Maulik
Yoo, Hyunseung
Evrard, Yvonne A.
Doroshow, James H.
Stevens, Rick L.
author_facet Zhu, Yitan
Brettin, Thomas
Xia, Fangfang
Partin, Alexander
Shukla, Maulik
Yoo, Hyunseung
Evrard, Yvonne A.
Doroshow, James H.
Stevens, Rick L.
author_sort Zhu, Yitan
collection PubMed
description Convolutional neural networks (CNNs) have been successfully used in many applications where important information about data is embedded in the order of features, such as speech and imaging. However, most tabular data do not assume a spatial relationship between features, and thus are unsuitable for modeling using CNNs. To meet this challenge, we develop a novel algorithm, image generator for tabular data (IGTD), to transform tabular data into images by assigning features to pixel positions so that similar features are close to each other in the image. The algorithm searches for an optimized assignment by minimizing the difference between the ranking of distances between features and the ranking of distances between their assigned pixels in the image. We apply IGTD to transform gene expression profiles of cancer cell lines (CCLs) and molecular descriptors of drugs into their respective image representations. Compared with existing transformation methods, IGTD generates compact image representations with better preservation of feature neighborhood structure. Evaluated on benchmark drug screening datasets, CNNs trained on IGTD image representations of CCLs and drugs exhibit a better performance of predicting anti-cancer drug response than both CNNs trained on alternative image representations and prediction models trained on the original tabular data.
format Online
Article
Text
id pubmed-8166880
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-81668802021-06-01 Converting tabular data into images for deep learning with convolutional neural networks Zhu, Yitan Brettin, Thomas Xia, Fangfang Partin, Alexander Shukla, Maulik Yoo, Hyunseung Evrard, Yvonne A. Doroshow, James H. Stevens, Rick L. Sci Rep Article Convolutional neural networks (CNNs) have been successfully used in many applications where important information about data is embedded in the order of features, such as speech and imaging. However, most tabular data do not assume a spatial relationship between features, and thus are unsuitable for modeling using CNNs. To meet this challenge, we develop a novel algorithm, image generator for tabular data (IGTD), to transform tabular data into images by assigning features to pixel positions so that similar features are close to each other in the image. The algorithm searches for an optimized assignment by minimizing the difference between the ranking of distances between features and the ranking of distances between their assigned pixels in the image. We apply IGTD to transform gene expression profiles of cancer cell lines (CCLs) and molecular descriptors of drugs into their respective image representations. Compared with existing transformation methods, IGTD generates compact image representations with better preservation of feature neighborhood structure. Evaluated on benchmark drug screening datasets, CNNs trained on IGTD image representations of CCLs and drugs exhibit a better performance of predicting anti-cancer drug response than both CNNs trained on alternative image representations and prediction models trained on the original tabular data. Nature Publishing Group UK 2021-05-31 /pmc/articles/PMC8166880/ /pubmed/34059739 http://dx.doi.org/10.1038/s41598-021-90923-y Text en © This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Zhu, Yitan
Brettin, Thomas
Xia, Fangfang
Partin, Alexander
Shukla, Maulik
Yoo, Hyunseung
Evrard, Yvonne A.
Doroshow, James H.
Stevens, Rick L.
Converting tabular data into images for deep learning with convolutional neural networks
title Converting tabular data into images for deep learning with convolutional neural networks
title_full Converting tabular data into images for deep learning with convolutional neural networks
title_fullStr Converting tabular data into images for deep learning with convolutional neural networks
title_full_unstemmed Converting tabular data into images for deep learning with convolutional neural networks
title_short Converting tabular data into images for deep learning with convolutional neural networks
title_sort converting tabular data into images for deep learning with convolutional neural networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8166880/
https://www.ncbi.nlm.nih.gov/pubmed/34059739
http://dx.doi.org/10.1038/s41598-021-90923-y
work_keys_str_mv AT zhuyitan convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks
AT brettinthomas convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks
AT xiafangfang convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks
AT partinalexander convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks
AT shuklamaulik convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks
AT yoohyunseung convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks
AT evrardyvonnea convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks
AT doroshowjamesh convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks
AT stevensrickl convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks