Cargando…

Utilizing Mixed Training and Multi-Head Attention to Address Data Shift in AI-Based Electromagnetic Solvers for Nano-Structured Metamaterials

When designing nano-structured metamaterials with an iterative optimization method, a fast deep learning solver is desirable to replace a time-consuming numerical solver, and the related issue of data shift is a subtle yet easily overlooked challenge. In this work, we explore the data shift challeng...

Descripción completa

Detalles Bibliográficos
Autores principales: Zeng, Zhenjia, Wang, Lei, Wu, Yiran, Hu, Zhipeng, Evans, Julian, Zhu, Xinhua, Ye, Gaoao, He, Sailing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10609168/
https://www.ncbi.nlm.nih.gov/pubmed/37887929
http://dx.doi.org/10.3390/nano13202778
_version_ 1785127950162591744
author Zeng, Zhenjia
Wang, Lei
Wu, Yiran
Hu, Zhipeng
Evans, Julian
Zhu, Xinhua
Ye, Gaoao
He, Sailing
author_facet Zeng, Zhenjia
Wang, Lei
Wu, Yiran
Hu, Zhipeng
Evans, Julian
Zhu, Xinhua
Ye, Gaoao
He, Sailing
author_sort Zeng, Zhenjia
collection PubMed
description When designing nano-structured metamaterials with an iterative optimization method, a fast deep learning solver is desirable to replace a time-consuming numerical solver, and the related issue of data shift is a subtle yet easily overlooked challenge. In this work, we explore the data shift challenge in an AI-based electromagnetic solver and present innovative solutions. Using a one-dimensional grating coupler as a case study, we demonstrate the presence of data shift through the probability density method and principal component analysis, and show the degradation of neural network performance through experiments dealing with data affected by data shift. We propose three effective strategies to mitigate the effects of data shift: mixed training, adding multi-head attention, and a comprehensive approach that combines both. The experimental results validate the efficacy of these approaches in addressing data shift. Specifically, the combination of mixed training and multi-head attention significantly reduces the mean absolute error, by approximately 36%, when applied to data affected by data shift. Our work provides crucial insights and guidance for AI-based electromagnetic solvers in the optimal design of nano-structured metamaterials.
format Online
Article
Text
id pubmed-10609168
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106091682023-10-28 Utilizing Mixed Training and Multi-Head Attention to Address Data Shift in AI-Based Electromagnetic Solvers for Nano-Structured Metamaterials Zeng, Zhenjia Wang, Lei Wu, Yiran Hu, Zhipeng Evans, Julian Zhu, Xinhua Ye, Gaoao He, Sailing Nanomaterials (Basel) Article When designing nano-structured metamaterials with an iterative optimization method, a fast deep learning solver is desirable to replace a time-consuming numerical solver, and the related issue of data shift is a subtle yet easily overlooked challenge. In this work, we explore the data shift challenge in an AI-based electromagnetic solver and present innovative solutions. Using a one-dimensional grating coupler as a case study, we demonstrate the presence of data shift through the probability density method and principal component analysis, and show the degradation of neural network performance through experiments dealing with data affected by data shift. We propose three effective strategies to mitigate the effects of data shift: mixed training, adding multi-head attention, and a comprehensive approach that combines both. The experimental results validate the efficacy of these approaches in addressing data shift. Specifically, the combination of mixed training and multi-head attention significantly reduces the mean absolute error, by approximately 36%, when applied to data affected by data shift. Our work provides crucial insights and guidance for AI-based electromagnetic solvers in the optimal design of nano-structured metamaterials. MDPI 2023-10-17 /pmc/articles/PMC10609168/ /pubmed/37887929 http://dx.doi.org/10.3390/nano13202778 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zeng, Zhenjia
Wang, Lei
Wu, Yiran
Hu, Zhipeng
Evans, Julian
Zhu, Xinhua
Ye, Gaoao
He, Sailing
Utilizing Mixed Training and Multi-Head Attention to Address Data Shift in AI-Based Electromagnetic Solvers for Nano-Structured Metamaterials
title Utilizing Mixed Training and Multi-Head Attention to Address Data Shift in AI-Based Electromagnetic Solvers for Nano-Structured Metamaterials
title_full Utilizing Mixed Training and Multi-Head Attention to Address Data Shift in AI-Based Electromagnetic Solvers for Nano-Structured Metamaterials
title_fullStr Utilizing Mixed Training and Multi-Head Attention to Address Data Shift in AI-Based Electromagnetic Solvers for Nano-Structured Metamaterials
title_full_unstemmed Utilizing Mixed Training and Multi-Head Attention to Address Data Shift in AI-Based Electromagnetic Solvers for Nano-Structured Metamaterials
title_short Utilizing Mixed Training and Multi-Head Attention to Address Data Shift in AI-Based Electromagnetic Solvers for Nano-Structured Metamaterials
title_sort utilizing mixed training and multi-head attention to address data shift in ai-based electromagnetic solvers for nano-structured metamaterials
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10609168/
https://www.ncbi.nlm.nih.gov/pubmed/37887929
http://dx.doi.org/10.3390/nano13202778
work_keys_str_mv AT zengzhenjia utilizingmixedtrainingandmultiheadattentiontoaddressdatashiftinaibasedelectromagneticsolversfornanostructuredmetamaterials
AT wanglei utilizingmixedtrainingandmultiheadattentiontoaddressdatashiftinaibasedelectromagneticsolversfornanostructuredmetamaterials
AT wuyiran utilizingmixedtrainingandmultiheadattentiontoaddressdatashiftinaibasedelectromagneticsolversfornanostructuredmetamaterials
AT huzhipeng utilizingmixedtrainingandmultiheadattentiontoaddressdatashiftinaibasedelectromagneticsolversfornanostructuredmetamaterials
AT evansjulian utilizingmixedtrainingandmultiheadattentiontoaddressdatashiftinaibasedelectromagneticsolversfornanostructuredmetamaterials
AT zhuxinhua utilizingmixedtrainingandmultiheadattentiontoaddressdatashiftinaibasedelectromagneticsolversfornanostructuredmetamaterials
AT yegaoao utilizingmixedtrainingandmultiheadattentiontoaddressdatashiftinaibasedelectromagneticsolversfornanostructuredmetamaterials
AT hesailing utilizingmixedtrainingandmultiheadattentiontoaddressdatashiftinaibasedelectromagneticsolversfornanostructuredmetamaterials