Cargando…
DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification
Fast and effective algorithms for deep learning on 3D shapes are keys to innovate mechanical and electronic engineering design workflow. In this paper, an efficient 3D shape to 2D images projection algorithm and a shallow 2.5D convolutional neural network architecture is proposed. A smaller convolut...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665673/ https://www.ncbi.nlm.nih.gov/pubmed/38027921 http://dx.doi.org/10.1016/j.heliyon.2023.e21515 |
_version_ | 1785138877317513216 |
---|---|
author | Xia, Qingfeng |
author_facet | Xia, Qingfeng |
author_sort | Xia, Qingfeng |
collection | PubMed |
description | Fast and effective algorithms for deep learning on 3D shapes are keys to innovate mechanical and electronic engineering design workflow. In this paper, an efficient 3D shape to 2D images projection algorithm and a shallow 2.5D convolutional neural network architecture is proposed. A smaller convolutional neural network (CNN) model is achieved by information enrichment at the preprocessing stage, i.e. 3D geometry is compressed into 2D “thickness view” and “depth view”. Fusing the depth view and thickness view (DTV) from the same projection view into a dual-channel grayscale image, can improve information locality for geometry and topology feature extraction. This approach bridges the gap between mature image deep learning technologies to the applications of 3D shape. Enhanced by several essential scalar geometry properties and only 3 projection views, a mixed CNN and multiple linear parameter (MLP) neural network model achives a validation accuracy of 92 % for ModelNet10 mesh-based dataset, while the training time is one order of magnitude less than the original multi-view CNN approach. This study also creates new 3D shape datasets from 2 open source CAD projects. Higher validation accuracy is obtained for realistic CAD datasets, i.e. 97 % for FreeCAD's mechanical part library and 95 % for KiCAD electronic part library. The training cost reduces to tens of minutes on a laptop CPU, given the smaller input data size and shallow neural network design. It is expected that this approach can be adapted for other machine learning scenarios involved in CAD geometry. |
format | Online Article Text |
id | pubmed-10665673 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-106656732023-10-31 DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification Xia, Qingfeng Heliyon Research Article Fast and effective algorithms for deep learning on 3D shapes are keys to innovate mechanical and electronic engineering design workflow. In this paper, an efficient 3D shape to 2D images projection algorithm and a shallow 2.5D convolutional neural network architecture is proposed. A smaller convolutional neural network (CNN) model is achieved by information enrichment at the preprocessing stage, i.e. 3D geometry is compressed into 2D “thickness view” and “depth view”. Fusing the depth view and thickness view (DTV) from the same projection view into a dual-channel grayscale image, can improve information locality for geometry and topology feature extraction. This approach bridges the gap between mature image deep learning technologies to the applications of 3D shape. Enhanced by several essential scalar geometry properties and only 3 projection views, a mixed CNN and multiple linear parameter (MLP) neural network model achives a validation accuracy of 92 % for ModelNet10 mesh-based dataset, while the training time is one order of magnitude less than the original multi-view CNN approach. This study also creates new 3D shape datasets from 2 open source CAD projects. Higher validation accuracy is obtained for realistic CAD datasets, i.e. 97 % for FreeCAD's mechanical part library and 95 % for KiCAD electronic part library. The training cost reduces to tens of minutes on a laptop CPU, given the smaller input data size and shallow neural network design. It is expected that this approach can be adapted for other machine learning scenarios involved in CAD geometry. Elsevier 2023-10-31 /pmc/articles/PMC10665673/ /pubmed/38027921 http://dx.doi.org/10.1016/j.heliyon.2023.e21515 Text en © 2023 The Author. Published by Elsevier Ltd. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Xia, Qingfeng DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification |
title | DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification |
title_full | DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification |
title_fullStr | DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification |
title_full_unstemmed | DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification |
title_short | DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification |
title_sort | dtv-cnn: neural network based on depth and thickness views for efficient 3d shape classification |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665673/ https://www.ncbi.nlm.nih.gov/pubmed/38027921 http://dx.doi.org/10.1016/j.heliyon.2023.e21515 |
work_keys_str_mv | AT xiaqingfeng dtvcnnneuralnetworkbasedondepthandthicknessviewsforefficient3dshapeclassification |