Cargando…
Converting tabular data into images for deep learning with convolutional neural networks
Convolutional neural networks (CNNs) have been successfully used in many applications where important information about data is embedded in the order of features, such as speech and imaging. However, most tabular data do not assume a spatial relationship between features, and thus are unsuitable for...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8166880/ https://www.ncbi.nlm.nih.gov/pubmed/34059739 http://dx.doi.org/10.1038/s41598-021-90923-y |
_version_ | 1783701588089503744 |
---|---|
author | Zhu, Yitan Brettin, Thomas Xia, Fangfang Partin, Alexander Shukla, Maulik Yoo, Hyunseung Evrard, Yvonne A. Doroshow, James H. Stevens, Rick L. |
author_facet | Zhu, Yitan Brettin, Thomas Xia, Fangfang Partin, Alexander Shukla, Maulik Yoo, Hyunseung Evrard, Yvonne A. Doroshow, James H. Stevens, Rick L. |
author_sort | Zhu, Yitan |
collection | PubMed |
description | Convolutional neural networks (CNNs) have been successfully used in many applications where important information about data is embedded in the order of features, such as speech and imaging. However, most tabular data do not assume a spatial relationship between features, and thus are unsuitable for modeling using CNNs. To meet this challenge, we develop a novel algorithm, image generator for tabular data (IGTD), to transform tabular data into images by assigning features to pixel positions so that similar features are close to each other in the image. The algorithm searches for an optimized assignment by minimizing the difference between the ranking of distances between features and the ranking of distances between their assigned pixels in the image. We apply IGTD to transform gene expression profiles of cancer cell lines (CCLs) and molecular descriptors of drugs into their respective image representations. Compared with existing transformation methods, IGTD generates compact image representations with better preservation of feature neighborhood structure. Evaluated on benchmark drug screening datasets, CNNs trained on IGTD image representations of CCLs and drugs exhibit a better performance of predicting anti-cancer drug response than both CNNs trained on alternative image representations and prediction models trained on the original tabular data. |
format | Online Article Text |
id | pubmed-8166880 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-81668802021-06-01 Converting tabular data into images for deep learning with convolutional neural networks Zhu, Yitan Brettin, Thomas Xia, Fangfang Partin, Alexander Shukla, Maulik Yoo, Hyunseung Evrard, Yvonne A. Doroshow, James H. Stevens, Rick L. Sci Rep Article Convolutional neural networks (CNNs) have been successfully used in many applications where important information about data is embedded in the order of features, such as speech and imaging. However, most tabular data do not assume a spatial relationship between features, and thus are unsuitable for modeling using CNNs. To meet this challenge, we develop a novel algorithm, image generator for tabular data (IGTD), to transform tabular data into images by assigning features to pixel positions so that similar features are close to each other in the image. The algorithm searches for an optimized assignment by minimizing the difference between the ranking of distances between features and the ranking of distances between their assigned pixels in the image. We apply IGTD to transform gene expression profiles of cancer cell lines (CCLs) and molecular descriptors of drugs into their respective image representations. Compared with existing transformation methods, IGTD generates compact image representations with better preservation of feature neighborhood structure. Evaluated on benchmark drug screening datasets, CNNs trained on IGTD image representations of CCLs and drugs exhibit a better performance of predicting anti-cancer drug response than both CNNs trained on alternative image representations and prediction models trained on the original tabular data. Nature Publishing Group UK 2021-05-31 /pmc/articles/PMC8166880/ /pubmed/34059739 http://dx.doi.org/10.1038/s41598-021-90923-y Text en © This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Zhu, Yitan Brettin, Thomas Xia, Fangfang Partin, Alexander Shukla, Maulik Yoo, Hyunseung Evrard, Yvonne A. Doroshow, James H. Stevens, Rick L. Converting tabular data into images for deep learning with convolutional neural networks |
title | Converting tabular data into images for deep learning with convolutional neural networks |
title_full | Converting tabular data into images for deep learning with convolutional neural networks |
title_fullStr | Converting tabular data into images for deep learning with convolutional neural networks |
title_full_unstemmed | Converting tabular data into images for deep learning with convolutional neural networks |
title_short | Converting tabular data into images for deep learning with convolutional neural networks |
title_sort | converting tabular data into images for deep learning with convolutional neural networks |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8166880/ https://www.ncbi.nlm.nih.gov/pubmed/34059739 http://dx.doi.org/10.1038/s41598-021-90923-y |
work_keys_str_mv | AT zhuyitan convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT brettinthomas convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT xiafangfang convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT partinalexander convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT shuklamaulik convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT yoohyunseung convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT evrardyvonnea convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT doroshowjamesh convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT stevensrickl convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks |