Cargando…

DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification

Fast and effective algorithms for deep learning on 3D shapes are keys to innovate mechanical and electronic engineering design workflow. In this paper, an efficient 3D shape to 2D images projection algorithm and a shallow 2.5D convolutional neural network architecture is proposed. A smaller convolut...

Descripción completa

Detalles Bibliográficos
Autor principal: Xia, Qingfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665673/
https://www.ncbi.nlm.nih.gov/pubmed/38027921
http://dx.doi.org/10.1016/j.heliyon.2023.e21515
_version_ 1785138877317513216
author Xia, Qingfeng
author_facet Xia, Qingfeng
author_sort Xia, Qingfeng
collection PubMed
description Fast and effective algorithms for deep learning on 3D shapes are keys to innovate mechanical and electronic engineering design workflow. In this paper, an efficient 3D shape to 2D images projection algorithm and a shallow 2.5D convolutional neural network architecture is proposed. A smaller convolutional neural network (CNN) model is achieved by information enrichment at the preprocessing stage, i.e. 3D geometry is compressed into 2D “thickness view” and “depth view”. Fusing the depth view and thickness view (DTV) from the same projection view into a dual-channel grayscale image, can improve information locality for geometry and topology feature extraction. This approach bridges the gap between mature image deep learning technologies to the applications of 3D shape. Enhanced by several essential scalar geometry properties and only 3 projection views, a mixed CNN and multiple linear parameter (MLP) neural network model achives a validation accuracy of 92 % for ModelNet10 mesh-based dataset, while the training time is one order of magnitude less than the original multi-view CNN approach. This study also creates new 3D shape datasets from 2 open source CAD projects. Higher validation accuracy is obtained for realistic CAD datasets, i.e. 97 % for FreeCAD's mechanical part library and 95 % for KiCAD electronic part library. The training cost reduces to tens of minutes on a laptop CPU, given the smaller input data size and shallow neural network design. It is expected that this approach can be adapted for other machine learning scenarios involved in CAD geometry.
format Online
Article
Text
id pubmed-10665673
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-106656732023-10-31 DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification Xia, Qingfeng Heliyon Research Article Fast and effective algorithms for deep learning on 3D shapes are keys to innovate mechanical and electronic engineering design workflow. In this paper, an efficient 3D shape to 2D images projection algorithm and a shallow 2.5D convolutional neural network architecture is proposed. A smaller convolutional neural network (CNN) model is achieved by information enrichment at the preprocessing stage, i.e. 3D geometry is compressed into 2D “thickness view” and “depth view”. Fusing the depth view and thickness view (DTV) from the same projection view into a dual-channel grayscale image, can improve information locality for geometry and topology feature extraction. This approach bridges the gap between mature image deep learning technologies to the applications of 3D shape. Enhanced by several essential scalar geometry properties and only 3 projection views, a mixed CNN and multiple linear parameter (MLP) neural network model achives a validation accuracy of 92 % for ModelNet10 mesh-based dataset, while the training time is one order of magnitude less than the original multi-view CNN approach. This study also creates new 3D shape datasets from 2 open source CAD projects. Higher validation accuracy is obtained for realistic CAD datasets, i.e. 97 % for FreeCAD's mechanical part library and 95 % for KiCAD electronic part library. The training cost reduces to tens of minutes on a laptop CPU, given the smaller input data size and shallow neural network design. It is expected that this approach can be adapted for other machine learning scenarios involved in CAD geometry. Elsevier 2023-10-31 /pmc/articles/PMC10665673/ /pubmed/38027921 http://dx.doi.org/10.1016/j.heliyon.2023.e21515 Text en © 2023 The Author. Published by Elsevier Ltd. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Xia, Qingfeng
DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification
title DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification
title_full DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification
title_fullStr DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification
title_full_unstemmed DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification
title_short DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification
title_sort dtv-cnn: neural network based on depth and thickness views for efficient 3d shape classification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665673/
https://www.ncbi.nlm.nih.gov/pubmed/38027921
http://dx.doi.org/10.1016/j.heliyon.2023.e21515
work_keys_str_mv AT xiaqingfeng dtvcnnneuralnetworkbasedondepthandthicknessviewsforefficient3dshapeclassification