Cargando…

Converting tabular data into images for deep learning with convolutional neural networks

Convolutional neural networks (CNNs) have been successfully used in many applications where important information about data is embedded in the order of features, such as speech and imaging. However, most tabular data do not assume a spatial relationship between features, and thus are unsuitable for...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhu, Yitan, Brettin, Thomas, Xia, Fangfang, Partin, Alexander, Shukla, Maulik, Yoo, Hyunseung, Evrard, Yvonne A., Doroshow, James H., Stevens, Rick L.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8166880/ https://www.ncbi.nlm.nih.gov/pubmed/34059739 http://dx.doi.org/10.1038/s41598-021-90923-y

_version_	1783701588089503744
author	Zhu, Yitan Brettin, Thomas Xia, Fangfang Partin, Alexander Shukla, Maulik Yoo, Hyunseung Evrard, Yvonne A. Doroshow, James H. Stevens, Rick L.
author_facet	Zhu, Yitan Brettin, Thomas Xia, Fangfang Partin, Alexander Shukla, Maulik Yoo, Hyunseung Evrard, Yvonne A. Doroshow, James H. Stevens, Rick L.
author_sort	Zhu, Yitan
collection	PubMed
description	Convolutional neural networks (CNNs) have been successfully used in many applications where important information about data is embedded in the order of features, such as speech and imaging. However, most tabular data do not assume a spatial relationship between features, and thus are unsuitable for modeling using CNNs. To meet this challenge, we develop a novel algorithm, image generator for tabular data (IGTD), to transform tabular data into images by assigning features to pixel positions so that similar features are close to each other in the image. The algorithm searches for an optimized assignment by minimizing the difference between the ranking of distances between features and the ranking of distances between their assigned pixels in the image. We apply IGTD to transform gene expression profiles of cancer cell lines (CCLs) and molecular descriptors of drugs into their respective image representations. Compared with existing transformation methods, IGTD generates compact image representations with better preservation of feature neighborhood structure. Evaluated on benchmark drug screening datasets, CNNs trained on IGTD image representations of CCLs and drugs exhibit a better performance of predicting anti-cancer drug response than both CNNs trained on alternative image representations and prediction models trained on the original tabular data.
format	Online Article Text
id	pubmed-8166880
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-81668802021-06-01 Converting tabular data into images for deep learning with convolutional neural networks Zhu, Yitan Brettin, Thomas Xia, Fangfang Partin, Alexander Shukla, Maulik Yoo, Hyunseung Evrard, Yvonne A. Doroshow, James H. Stevens, Rick L. Sci Rep Article Convolutional neural networks (CNNs) have been successfully used in many applications where important information about data is embedded in the order of features, such as speech and imaging. However, most tabular data do not assume a spatial relationship between features, and thus are unsuitable for modeling using CNNs. To meet this challenge, we develop a novel algorithm, image generator for tabular data (IGTD), to transform tabular data into images by assigning features to pixel positions so that similar features are close to each other in the image. The algorithm searches for an optimized assignment by minimizing the difference between the ranking of distances between features and the ranking of distances between their assigned pixels in the image. We apply IGTD to transform gene expression profiles of cancer cell lines (CCLs) and molecular descriptors of drugs into their respective image representations. Compared with existing transformation methods, IGTD generates compact image representations with better preservation of feature neighborhood structure. Evaluated on benchmark drug screening datasets, CNNs trained on IGTD image representations of CCLs and drugs exhibit a better performance of predicting anti-cancer drug response than both CNNs trained on alternative image representations and prediction models trained on the original tabular data. Nature Publishing Group UK 2021-05-31 /pmc/articles/PMC8166880/ /pubmed/34059739 http://dx.doi.org/10.1038/s41598-021-90923-y Text en © This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Zhu, Yitan Brettin, Thomas Xia, Fangfang Partin, Alexander Shukla, Maulik Yoo, Hyunseung Evrard, Yvonne A. Doroshow, James H. Stevens, Rick L. Converting tabular data into images for deep learning with convolutional neural networks
title	Converting tabular data into images for deep learning with convolutional neural networks
title_full	Converting tabular data into images for deep learning with convolutional neural networks
title_fullStr	Converting tabular data into images for deep learning with convolutional neural networks
title_full_unstemmed	Converting tabular data into images for deep learning with convolutional neural networks
title_short	Converting tabular data into images for deep learning with convolutional neural networks
title_sort	converting tabular data into images for deep learning with convolutional neural networks
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8166880/ https://www.ncbi.nlm.nih.gov/pubmed/34059739 http://dx.doi.org/10.1038/s41598-021-90923-y
work_keys_str_mv	AT zhuyitan convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT brettinthomas convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT xiafangfang convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT partinalexander convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT shuklamaulik convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT yoohyunseung convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT evrardyvonnea convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT doroshowjamesh convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks AT stevensrickl convertingtabulardataintoimagesfordeeplearningwithconvolutionalneuralnetworks

Converting tabular data into images for deep learning with convolutional neural networks

Ejemplares similares