Cargando…

Real-time Controlling Dynamics Sensing in Air Traffic System

In order to obtain real-time controlling dynamics in air traffic system, a framework is proposed to introduce and process air traffic control (ATC) speech via radiotelephony communication. An automatic speech recognition (ASR) and controlling instruction understanding (CIU)-based pipeline is designe...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Yi, Tan, Xianlong, Yang, Bo, Yang, Kai, Zhang, Jianwei, Yu, Jing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6387294/
https://www.ncbi.nlm.nih.gov/pubmed/30736452
http://dx.doi.org/10.3390/s19030679
_version_ 1783397548580405248
author Lin, Yi
Tan, Xianlong
Yang, Bo
Yang, Kai
Zhang, Jianwei
Yu, Jing
author_facet Lin, Yi
Tan, Xianlong
Yang, Bo
Yang, Kai
Zhang, Jianwei
Yu, Jing
author_sort Lin, Yi
collection PubMed
description In order to obtain real-time controlling dynamics in air traffic system, a framework is proposed to introduce and process air traffic control (ATC) speech via radiotelephony communication. An automatic speech recognition (ASR) and controlling instruction understanding (CIU)-based pipeline is designed to convert the ATC speech into ATC related elements, i.e., controlling intent and parameters. A correction procedure is also proposed to improve the reliability of the information obtained by the proposed framework. In the ASR model, acoustic model (AM), pronunciation model (PM), and phoneme- and word-based language model (LM) are proposed to unify multilingual ASR into one model. In this work, based on their tasks, the AM and PM are defined as speech recognition and machine translation problems respectively. Two-dimensional convolution and average-pooling layers are designed to solve special challenges of ASR in ATC. An encoder–decoder architecture-based neural network is proposed to translate phoneme labels into word labels, which achieves the purpose of ASR. In the CIU model, a recurrent neural network-based joint model is proposed to detect the controlling intent and label the controlling parameters, in which the two tasks are solved in one network to enhance the performance with each other based on ATC communication rules. The ATC speech is now converted into ATC related elements by the proposed ASR and CIU model. To further improve the accuracy of the sensing framework, a correction procedure is proposed to revise minor mistakes in ASR decoding results based on the flight information, such as flight plan, ADS-B. The proposed models are trained using real operating data and applied to a civil aviation airport in China to evaluate their performance. Experimental results show that the proposed framework can obtain real-time controlling dynamics with high performance, only 4% word-error rate. Meanwhile, the decoding efficiency can also meet the requirement of real-time applications, i.e., an average 0.147 real time factor. With the proposed framework and obtained traffic dynamics, current ATC applications can be accomplished with higher accuracy. In addition, the proposed ASR pipeline has high reusability, which allows us to apply it to other controlling scenes and languages with minor changes.
format Online
Article
Text
id pubmed-6387294
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-63872942019-02-26 Real-time Controlling Dynamics Sensing in Air Traffic System Lin, Yi Tan, Xianlong Yang, Bo Yang, Kai Zhang, Jianwei Yu, Jing Sensors (Basel) Article In order to obtain real-time controlling dynamics in air traffic system, a framework is proposed to introduce and process air traffic control (ATC) speech via radiotelephony communication. An automatic speech recognition (ASR) and controlling instruction understanding (CIU)-based pipeline is designed to convert the ATC speech into ATC related elements, i.e., controlling intent and parameters. A correction procedure is also proposed to improve the reliability of the information obtained by the proposed framework. In the ASR model, acoustic model (AM), pronunciation model (PM), and phoneme- and word-based language model (LM) are proposed to unify multilingual ASR into one model. In this work, based on their tasks, the AM and PM are defined as speech recognition and machine translation problems respectively. Two-dimensional convolution and average-pooling layers are designed to solve special challenges of ASR in ATC. An encoder–decoder architecture-based neural network is proposed to translate phoneme labels into word labels, which achieves the purpose of ASR. In the CIU model, a recurrent neural network-based joint model is proposed to detect the controlling intent and label the controlling parameters, in which the two tasks are solved in one network to enhance the performance with each other based on ATC communication rules. The ATC speech is now converted into ATC related elements by the proposed ASR and CIU model. To further improve the accuracy of the sensing framework, a correction procedure is proposed to revise minor mistakes in ASR decoding results based on the flight information, such as flight plan, ADS-B. The proposed models are trained using real operating data and applied to a civil aviation airport in China to evaluate their performance. Experimental results show that the proposed framework can obtain real-time controlling dynamics with high performance, only 4% word-error rate. Meanwhile, the decoding efficiency can also meet the requirement of real-time applications, i.e., an average 0.147 real time factor. With the proposed framework and obtained traffic dynamics, current ATC applications can be accomplished with higher accuracy. In addition, the proposed ASR pipeline has high reusability, which allows us to apply it to other controlling scenes and languages with minor changes. MDPI 2019-02-07 /pmc/articles/PMC6387294/ /pubmed/30736452 http://dx.doi.org/10.3390/s19030679 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lin, Yi
Tan, Xianlong
Yang, Bo
Yang, Kai
Zhang, Jianwei
Yu, Jing
Real-time Controlling Dynamics Sensing in Air Traffic System
title Real-time Controlling Dynamics Sensing in Air Traffic System
title_full Real-time Controlling Dynamics Sensing in Air Traffic System
title_fullStr Real-time Controlling Dynamics Sensing in Air Traffic System
title_full_unstemmed Real-time Controlling Dynamics Sensing in Air Traffic System
title_short Real-time Controlling Dynamics Sensing in Air Traffic System
title_sort real-time controlling dynamics sensing in air traffic system
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6387294/
https://www.ncbi.nlm.nih.gov/pubmed/30736452
http://dx.doi.org/10.3390/s19030679
work_keys_str_mv AT linyi realtimecontrollingdynamicssensinginairtrafficsystem
AT tanxianlong realtimecontrollingdynamicssensinginairtrafficsystem
AT yangbo realtimecontrollingdynamicssensinginairtrafficsystem
AT yangkai realtimecontrollingdynamicssensinginairtrafficsystem
AT zhangjianwei realtimecontrollingdynamicssensinginairtrafficsystem
AT yujing realtimecontrollingdynamicssensinginairtrafficsystem