Cargando…
Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration
Three-dimensional point cloud registration, which aims to find the transformation that best aligns two point clouds, is a widely studied problem in computer vision with a wide spectrum of applications, such as underground mining. Many learning-based approaches have been developed and have demonstrat...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10145325/ https://www.ncbi.nlm.nih.gov/pubmed/37112464 http://dx.doi.org/10.3390/s23084123 |
_version_ | 1785034306463203328 |
---|---|
author | Xia, Xiaokai Fan, Zhiqiang Xiao, Gang Chen, Fangyue Liu, Yu Hu, Yiheng |
author_facet | Xia, Xiaokai Fan, Zhiqiang Xiao, Gang Chen, Fangyue Liu, Yu Hu, Yiheng |
author_sort | Xia, Xiaokai |
collection | PubMed |
description | Three-dimensional point cloud registration, which aims to find the transformation that best aligns two point clouds, is a widely studied problem in computer vision with a wide spectrum of applications, such as underground mining. Many learning-based approaches have been developed and have demonstrated their effectiveness for point cloud registration. Particularly, attention-based models have achieved outstanding performance due to the extra contextual information captured by attention mechanisms. To avoid the high computation cost brought by attention mechanisms, an encoder–decoder framework is often employed to hierarchically extract the features where the attention module is only applied in the middle. This leads to the compromised effectiveness of the attention module. To tackle this issue, we propose a novel model with the attention layers embedded in both the encoder and decoder stages. In our model, the self-attentional layers are applied in the encoder to consider the relationship between points inside each point cloud, while the decoder utilizes cross-attentional layers to enrich features with contextual information. Extensive experiments conducted on public datasets prove that our model is able to achieve quality results on a registration task. |
format | Online Article Text |
id | pubmed-10145325 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-101453252023-04-29 Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration Xia, Xiaokai Fan, Zhiqiang Xiao, Gang Chen, Fangyue Liu, Yu Hu, Yiheng Sensors (Basel) Article Three-dimensional point cloud registration, which aims to find the transformation that best aligns two point clouds, is a widely studied problem in computer vision with a wide spectrum of applications, such as underground mining. Many learning-based approaches have been developed and have demonstrated their effectiveness for point cloud registration. Particularly, attention-based models have achieved outstanding performance due to the extra contextual information captured by attention mechanisms. To avoid the high computation cost brought by attention mechanisms, an encoder–decoder framework is often employed to hierarchically extract the features where the attention module is only applied in the middle. This leads to the compromised effectiveness of the attention module. To tackle this issue, we propose a novel model with the attention layers embedded in both the encoder and decoder stages. In our model, the self-attentional layers are applied in the encoder to consider the relationship between points inside each point cloud, while the decoder utilizes cross-attentional layers to enrich features with contextual information. Extensive experiments conducted on public datasets prove that our model is able to achieve quality results on a registration task. MDPI 2023-04-20 /pmc/articles/PMC10145325/ /pubmed/37112464 http://dx.doi.org/10.3390/s23084123 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Xia, Xiaokai Fan, Zhiqiang Xiao, Gang Chen, Fangyue Liu, Yu Hu, Yiheng Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration |
title | Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration |
title_full | Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration |
title_fullStr | Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration |
title_full_unstemmed | Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration |
title_short | Learning Representative Features by Deep Attention Network for 3D Point Cloud Registration |
title_sort | learning representative features by deep attention network for 3d point cloud registration |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10145325/ https://www.ncbi.nlm.nih.gov/pubmed/37112464 http://dx.doi.org/10.3390/s23084123 |
work_keys_str_mv | AT xiaxiaokai learningrepresentativefeaturesbydeepattentionnetworkfor3dpointcloudregistration AT fanzhiqiang learningrepresentativefeaturesbydeepattentionnetworkfor3dpointcloudregistration AT xiaogang learningrepresentativefeaturesbydeepattentionnetworkfor3dpointcloudregistration AT chenfangyue learningrepresentativefeaturesbydeepattentionnetworkfor3dpointcloudregistration AT liuyu learningrepresentativefeaturesbydeepattentionnetworkfor3dpointcloudregistration AT huyiheng learningrepresentativefeaturesbydeepattentionnetworkfor3dpointcloudregistration |