Cargando…

Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration

Three-dimensional point cloud registration, which aims to find the transformation that best aligns two point clouds, is a widely studied problem in computer vision with a wide spectrum of applications, such as underground mining. Many learning-based approaches have been developed and have demonstrat...

Descripción completa

Detalles Bibliográficos
Autores principales: Xia, Xiaokai, Fan, Zhiqiang, Xiao, Gang, Chen, Fangyue, Liu, Yu, Hu, Yiheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10145325/
https://www.ncbi.nlm.nih.gov/pubmed/37112464
http://dx.doi.org/10.3390/s23084123
_version_ 1785034306463203328
author Xia, Xiaokai
Fan, Zhiqiang
Xiao, Gang
Chen, Fangyue
Liu, Yu
Hu, Yiheng
author_facet Xia, Xiaokai
Fan, Zhiqiang
Xiao, Gang
Chen, Fangyue
Liu, Yu
Hu, Yiheng
author_sort Xia, Xiaokai
collection PubMed
description Three-dimensional point cloud registration, which aims to find the transformation that best aligns two point clouds, is a widely studied problem in computer vision with a wide spectrum of applications, such as underground mining. Many learning-based approaches have been developed and have demonstrated their effectiveness for point cloud registration. Particularly, attention-based models have achieved outstanding performance due to the extra contextual information captured by attention mechanisms. To avoid the high computation cost brought by attention mechanisms, an encoder–decoder framework is often employed to hierarchically extract the features where the attention module is only applied in the middle. This leads to the compromised effectiveness of the attention module. To tackle this issue, we propose a novel model with the attention layers embedded in both the encoder and decoder stages. In our model, the self-attentional layers are applied in the encoder to consider the relationship between points inside each point cloud, while the decoder utilizes cross-attentional layers to enrich features with contextual information. Extensive experiments conducted on public datasets prove that our model is able to achieve quality results on a registration task.
format Online
Article
Text
id pubmed-10145325
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-101453252023-04-29 Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration Xia, Xiaokai Fan, Zhiqiang Xiao, Gang Chen, Fangyue Liu, Yu Hu, Yiheng Sensors (Basel) Article Three-dimensional point cloud registration, which aims to find the transformation that best aligns two point clouds, is a widely studied problem in computer vision with a wide spectrum of applications, such as underground mining. Many learning-based approaches have been developed and have demonstrated their effectiveness for point cloud registration. Particularly, attention-based models have achieved outstanding performance due to the extra contextual information captured by attention mechanisms. To avoid the high computation cost brought by attention mechanisms, an encoder–decoder framework is often employed to hierarchically extract the features where the attention module is only applied in the middle. This leads to the compromised effectiveness of the attention module. To tackle this issue, we propose a novel model with the attention layers embedded in both the encoder and decoder stages. In our model, the self-attentional layers are applied in the encoder to consider the relationship between points inside each point cloud, while the decoder utilizes cross-attentional layers to enrich features with contextual information. Extensive experiments conducted on public datasets prove that our model is able to achieve quality results on a registration task. MDPI 2023-04-20 /pmc/articles/PMC10145325/ /pubmed/37112464 http://dx.doi.org/10.3390/s23084123 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xia, Xiaokai
Fan, Zhiqiang
Xiao, Gang
Chen, Fangyue
Liu, Yu
Hu, Yiheng
Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration
title Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration
title_full Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration
title_fullStr Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration
title_full_unstemmed Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration
title_short Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration
title_sort learning representative features by deep attention network for 3d point cloud registration
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10145325/
https://www.ncbi.nlm.nih.gov/pubmed/37112464
http://dx.doi.org/10.3390/s23084123
work_keys_str_mv AT xiaxiaokai learningrepresentativefeaturesbydeepattentionnetworkfor3dpointcloudregistration
AT fanzhiqiang learningrepresentativefeaturesbydeepattentionnetworkfor3dpointcloudregistration
AT xiaogang learningrepresentativefeaturesbydeepattentionnetworkfor3dpointcloudregistration
AT chenfangyue learningrepresentativefeaturesbydeepattentionnetworkfor3dpointcloudregistration
AT liuyu learningrepresentativefeaturesbydeepattentionnetworkfor3dpointcloudregistration
AT huyiheng learningrepresentativefeaturesbydeepattentionnetworkfor3dpointcloudregistration