Cargando…

Jamming Strategy Optimization through Dual Q-Learning Model against Adaptive Radar

Modern adaptive radars can switch work modes to perform various missions and simultaneously use pulse parameter agility in each mode to improve survivability, which leads to a multiplicative increase in the decision-making complexity and declining performance of the existing jamming methods. In this...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Hongdi, Zhang, Hongtao, He, Yuan, Sun, Yong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8747401/ https://www.ncbi.nlm.nih.gov/pubmed/35009688 http://dx.doi.org/10.3390/s22010145

_version_	1784630827575934976
author	Liu, Hongdi Zhang, Hongtao He, Yuan Sun, Yong
author_facet	Liu, Hongdi Zhang, Hongtao He, Yuan Sun, Yong
author_sort	Liu, Hongdi
collection	PubMed
description	Modern adaptive radars can switch work modes to perform various missions and simultaneously use pulse parameter agility in each mode to improve survivability, which leads to a multiplicative increase in the decision-making complexity and declining performance of the existing jamming methods. In this paper, a two-level jamming decision-making framework is developed, based on which a dual Q-learning (DQL) model is proposed to optimize the jamming strategy and a dynamic method for jamming effectiveness evaluation is designed to update the model. Specifically, the jamming procedure is modeled as a finite Markov decision process. On this basis, the high-dimensional jamming action space is disassembled into two low-dimensional subspaces containing jamming mode and pulse parameters respectively, then two specialized Q-learning models with interaction are built to obtain the optimal solution. Moreover, the jamming effectiveness is evaluated through indicator vector distance measuring to acquire the feedback for the DQL model, where indicators are dynamically weighted to adapt to the environment. The experiments demonstrate the advantage of the proposed method in learning radar joint strategy of mode switching and parameter agility, shown as improving the average jamming-to-signal radio (JSR) by 4.05% while reducing the convergence time by 34.94% compared with the normal Q-learning method.
format	Online Article Text
id	pubmed-8747401
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-87474012022-01-11 Jamming Strategy Optimization through Dual Q-Learning Model against Adaptive Radar Liu, Hongdi Zhang, Hongtao He, Yuan Sun, Yong Sensors (Basel) Article Modern adaptive radars can switch work modes to perform various missions and simultaneously use pulse parameter agility in each mode to improve survivability, which leads to a multiplicative increase in the decision-making complexity and declining performance of the existing jamming methods. In this paper, a two-level jamming decision-making framework is developed, based on which a dual Q-learning (DQL) model is proposed to optimize the jamming strategy and a dynamic method for jamming effectiveness evaluation is designed to update the model. Specifically, the jamming procedure is modeled as a finite Markov decision process. On this basis, the high-dimensional jamming action space is disassembled into two low-dimensional subspaces containing jamming mode and pulse parameters respectively, then two specialized Q-learning models with interaction are built to obtain the optimal solution. Moreover, the jamming effectiveness is evaluated through indicator vector distance measuring to acquire the feedback for the DQL model, where indicators are dynamically weighted to adapt to the environment. The experiments demonstrate the advantage of the proposed method in learning radar joint strategy of mode switching and parameter agility, shown as improving the average jamming-to-signal radio (JSR) by 4.05% while reducing the convergence time by 34.94% compared with the normal Q-learning method. MDPI 2021-12-26 /pmc/articles/PMC8747401/ /pubmed/35009688 http://dx.doi.org/10.3390/s22010145 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Liu, Hongdi Zhang, Hongtao He, Yuan Sun, Yong Jamming Strategy Optimization through Dual Q-Learning Model against Adaptive Radar
title	Jamming Strategy Optimization through Dual Q-Learning Model against Adaptive Radar
title_full	Jamming Strategy Optimization through Dual Q-Learning Model against Adaptive Radar
title_fullStr	Jamming Strategy Optimization through Dual Q-Learning Model against Adaptive Radar
title_full_unstemmed	Jamming Strategy Optimization through Dual Q-Learning Model against Adaptive Radar
title_short	Jamming Strategy Optimization through Dual Q-Learning Model against Adaptive Radar
title_sort	jamming strategy optimization through dual q-learning model against adaptive radar
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8747401/ https://www.ncbi.nlm.nih.gov/pubmed/35009688 http://dx.doi.org/10.3390/s22010145
work_keys_str_mv	AT liuhongdi jammingstrategyoptimizationthroughdualqlearningmodelagainstadaptiveradar AT zhanghongtao jammingstrategyoptimizationthroughdualqlearningmodelagainstadaptiveradar AT heyuan jammingstrategyoptimizationthroughdualqlearningmodelagainstadaptiveradar AT sunyong jammingstrategyoptimizationthroughdualqlearningmodelagainstadaptiveradar

Jamming Strategy Optimization through Dual Q-Learning Model against Adaptive Radar

Ejemplares similares