Cargando…

A novel smart photoelectric lock system: Speech transmitted by laser and speech to text

We propose a circuit that modulates a speech signal to a laser, using which the speech signal can be transmitted using the laser. Also, it shows the use of a platform based on embedded ARM (Advanced RISC Machine), running a small deep learning model based on TDNN (Time delay neural network) and LSTM...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Cheng-Yan, Hsieh, Tung-Li, Chang, Chia-Chi, Perng, Jau-Woei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10036661/
https://www.ncbi.nlm.nih.gov/pubmed/36967961
http://dx.doi.org/10.1016/j.heliyon.2023.e14510
_version_ 1784911709011443712
author Guo, Cheng-Yan
Hsieh, Tung-Li
Chang, Chia-Chi
Perng, Jau-Woei
author_facet Guo, Cheng-Yan
Hsieh, Tung-Li
Chang, Chia-Chi
Perng, Jau-Woei
author_sort Guo, Cheng-Yan
collection PubMed
description We propose a circuit that modulates a speech signal to a laser, using which the speech signal can be transmitted using the laser. Also, it shows the use of a platform based on embedded ARM (Advanced RISC Machine), running a small deep learning model based on TDNN (Time delay neural network) and LSTM (Long short-term memory), and converting speech to text, and use the text cipher for unlocking. This research implements a smart lock system that can set a pre-record speech cipher and verify the similarity through a laser transmission speech cipher to unlock it. In our experiment result, the English speech of laser transmission can reach a WER (Word error rate) of 14.06% through the deep learning model to recognize the content of the speech cipher. We also design a similarity comparison algorithm based on LCS (Longest common subsequence) to compare the character set of the laser transmission speech compare and the prerecord speech cipher to calculate the similarity rate. Through the similarity comparison algorithm, when the WER is 27.27%, the male speech samples used in this study still have a 95% unlocking success rate, while the female speech samples have a 100% unlocking success rate. Compared with only using automatic speech recognition (ASR) to unlock, the method we propose is to compare the similarity of the content of speech cipher. The method significantly improves the unlocking fault tolerance of using lasers to transmit audio. Therefore, by using the laser to transmit the speech cipher, the usability of the photoelectric smart lock system has been significantly improved. At the same time, the characteristics of the laser are not easy to eavesdrop on the cipher, which can also improve security.
format Online
Article
Text
id pubmed-10036661
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-100366612023-03-25 A novel smart photoelectric lock system: Speech transmitted by laser and speech to text Guo, Cheng-Yan Hsieh, Tung-Li Chang, Chia-Chi Perng, Jau-Woei Heliyon Research Article We propose a circuit that modulates a speech signal to a laser, using which the speech signal can be transmitted using the laser. Also, it shows the use of a platform based on embedded ARM (Advanced RISC Machine), running a small deep learning model based on TDNN (Time delay neural network) and LSTM (Long short-term memory), and converting speech to text, and use the text cipher for unlocking. This research implements a smart lock system that can set a pre-record speech cipher and verify the similarity through a laser transmission speech cipher to unlock it. In our experiment result, the English speech of laser transmission can reach a WER (Word error rate) of 14.06% through the deep learning model to recognize the content of the speech cipher. We also design a similarity comparison algorithm based on LCS (Longest common subsequence) to compare the character set of the laser transmission speech compare and the prerecord speech cipher to calculate the similarity rate. Through the similarity comparison algorithm, when the WER is 27.27%, the male speech samples used in this study still have a 95% unlocking success rate, while the female speech samples have a 100% unlocking success rate. Compared with only using automatic speech recognition (ASR) to unlock, the method we propose is to compare the similarity of the content of speech cipher. The method significantly improves the unlocking fault tolerance of using lasers to transmit audio. Therefore, by using the laser to transmit the speech cipher, the usability of the photoelectric smart lock system has been significantly improved. At the same time, the characteristics of the laser are not easy to eavesdrop on the cipher, which can also improve security. Elsevier 2023-03-15 /pmc/articles/PMC10036661/ /pubmed/36967961 http://dx.doi.org/10.1016/j.heliyon.2023.e14510 Text en © 2023 The Authors. Published by Elsevier Ltd. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Guo, Cheng-Yan
Hsieh, Tung-Li
Chang, Chia-Chi
Perng, Jau-Woei
A novel smart photoelectric lock system: Speech transmitted by laser and speech to text
title A novel smart photoelectric lock system: Speech transmitted by laser and speech to text
title_full A novel smart photoelectric lock system: Speech transmitted by laser and speech to text
title_fullStr A novel smart photoelectric lock system: Speech transmitted by laser and speech to text
title_full_unstemmed A novel smart photoelectric lock system: Speech transmitted by laser and speech to text
title_short A novel smart photoelectric lock system: Speech transmitted by laser and speech to text
title_sort novel smart photoelectric lock system: speech transmitted by laser and speech to text
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10036661/
https://www.ncbi.nlm.nih.gov/pubmed/36967961
http://dx.doi.org/10.1016/j.heliyon.2023.e14510
work_keys_str_mv AT guochengyan anovelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext
AT hsiehtungli anovelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext
AT changchiachi anovelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext
AT perngjauwoei anovelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext
AT guochengyan novelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext
AT hsiehtungli novelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext
AT changchiachi novelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext
AT perngjauwoei novelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext