Cargando…
A novel smart photoelectric lock system: Speech transmitted by laser and speech to text
We propose a circuit that modulates a speech signal to a laser, using which the speech signal can be transmitted using the laser. Also, it shows the use of a platform based on embedded ARM (Advanced RISC Machine), running a small deep learning model based on TDNN (Time delay neural network) and LSTM...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10036661/ https://www.ncbi.nlm.nih.gov/pubmed/36967961 http://dx.doi.org/10.1016/j.heliyon.2023.e14510 |
_version_ | 1784911709011443712 |
---|---|
author | Guo, Cheng-Yan Hsieh, Tung-Li Chang, Chia-Chi Perng, Jau-Woei |
author_facet | Guo, Cheng-Yan Hsieh, Tung-Li Chang, Chia-Chi Perng, Jau-Woei |
author_sort | Guo, Cheng-Yan |
collection | PubMed |
description | We propose a circuit that modulates a speech signal to a laser, using which the speech signal can be transmitted using the laser. Also, it shows the use of a platform based on embedded ARM (Advanced RISC Machine), running a small deep learning model based on TDNN (Time delay neural network) and LSTM (Long short-term memory), and converting speech to text, and use the text cipher for unlocking. This research implements a smart lock system that can set a pre-record speech cipher and verify the similarity through a laser transmission speech cipher to unlock it. In our experiment result, the English speech of laser transmission can reach a WER (Word error rate) of 14.06% through the deep learning model to recognize the content of the speech cipher. We also design a similarity comparison algorithm based on LCS (Longest common subsequence) to compare the character set of the laser transmission speech compare and the prerecord speech cipher to calculate the similarity rate. Through the similarity comparison algorithm, when the WER is 27.27%, the male speech samples used in this study still have a 95% unlocking success rate, while the female speech samples have a 100% unlocking success rate. Compared with only using automatic speech recognition (ASR) to unlock, the method we propose is to compare the similarity of the content of speech cipher. The method significantly improves the unlocking fault tolerance of using lasers to transmit audio. Therefore, by using the laser to transmit the speech cipher, the usability of the photoelectric smart lock system has been significantly improved. At the same time, the characteristics of the laser are not easy to eavesdrop on the cipher, which can also improve security. |
format | Online Article Text |
id | pubmed-10036661 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-100366612023-03-25 A novel smart photoelectric lock system: Speech transmitted by laser and speech to text Guo, Cheng-Yan Hsieh, Tung-Li Chang, Chia-Chi Perng, Jau-Woei Heliyon Research Article We propose a circuit that modulates a speech signal to a laser, using which the speech signal can be transmitted using the laser. Also, it shows the use of a platform based on embedded ARM (Advanced RISC Machine), running a small deep learning model based on TDNN (Time delay neural network) and LSTM (Long short-term memory), and converting speech to text, and use the text cipher for unlocking. This research implements a smart lock system that can set a pre-record speech cipher and verify the similarity through a laser transmission speech cipher to unlock it. In our experiment result, the English speech of laser transmission can reach a WER (Word error rate) of 14.06% through the deep learning model to recognize the content of the speech cipher. We also design a similarity comparison algorithm based on LCS (Longest common subsequence) to compare the character set of the laser transmission speech compare and the prerecord speech cipher to calculate the similarity rate. Through the similarity comparison algorithm, when the WER is 27.27%, the male speech samples used in this study still have a 95% unlocking success rate, while the female speech samples have a 100% unlocking success rate. Compared with only using automatic speech recognition (ASR) to unlock, the method we propose is to compare the similarity of the content of speech cipher. The method significantly improves the unlocking fault tolerance of using lasers to transmit audio. Therefore, by using the laser to transmit the speech cipher, the usability of the photoelectric smart lock system has been significantly improved. At the same time, the characteristics of the laser are not easy to eavesdrop on the cipher, which can also improve security. Elsevier 2023-03-15 /pmc/articles/PMC10036661/ /pubmed/36967961 http://dx.doi.org/10.1016/j.heliyon.2023.e14510 Text en © 2023 The Authors. Published by Elsevier Ltd. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Guo, Cheng-Yan Hsieh, Tung-Li Chang, Chia-Chi Perng, Jau-Woei A novel smart photoelectric lock system: Speech transmitted by laser and speech to text |
title | A novel smart photoelectric lock system: Speech transmitted by laser and speech to text |
title_full | A novel smart photoelectric lock system: Speech transmitted by laser and speech to text |
title_fullStr | A novel smart photoelectric lock system: Speech transmitted by laser and speech to text |
title_full_unstemmed | A novel smart photoelectric lock system: Speech transmitted by laser and speech to text |
title_short | A novel smart photoelectric lock system: Speech transmitted by laser and speech to text |
title_sort | novel smart photoelectric lock system: speech transmitted by laser and speech to text |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10036661/ https://www.ncbi.nlm.nih.gov/pubmed/36967961 http://dx.doi.org/10.1016/j.heliyon.2023.e14510 |
work_keys_str_mv | AT guochengyan anovelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext AT hsiehtungli anovelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext AT changchiachi anovelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext AT perngjauwoei anovelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext AT guochengyan novelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext AT hsiehtungli novelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext AT changchiachi novelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext AT perngjauwoei novelsmartphotoelectriclocksystemspeechtransmittedbylaserandspeechtotext |