Cargando…

Machine learning for pattern and waveform recognitions in terahertz image data

Several machine learning (ML) techniques were tested for the feasibility of performing automated pattern and waveform recognitions of terahertz time-domain spectroscopy datasets. Out of all the ML techniques under test, it was observed that random forest statistical algorithm works well with the THz...

Descripción completa

Detalles Bibliográficos
Autores principales: Bulgarevich, Dmitry S., Talara, Miezel, Tani, Masahiko, Watanabe, Makoto
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7806755/
https://www.ncbi.nlm.nih.gov/pubmed/33441888
http://dx.doi.org/10.1038/s41598-020-80761-9
_version_ 1783636591909011456
author Bulgarevich, Dmitry S.
Talara, Miezel
Tani, Masahiko
Watanabe, Makoto
author_facet Bulgarevich, Dmitry S.
Talara, Miezel
Tani, Masahiko
Watanabe, Makoto
author_sort Bulgarevich, Dmitry S.
collection PubMed
description Several machine learning (ML) techniques were tested for the feasibility of performing automated pattern and waveform recognitions of terahertz time-domain spectroscopy datasets. Out of all the ML techniques under test, it was observed that random forest statistical algorithm works well with the THz datasets in both the frequency and time domains. With such ML algorithm, a classifier can be created with less than 1% out-of-bag error for segmentation of rusted and non-rusted sample regions of the image datasets in frequency domain. The degree of linear correlation between the rusted area percentage and the image spatial resolution with terahertz frequency can be used as an additional cross-validation criteria for the evaluation of classifier quality. However, for different rust staging measured datasets, a standardized procedure of image pre-processing is necessary to create/apply a single classifier and its usage is only limited to 1 ± 0.2 THz. Moreover, random forest is practically the best choice among the several popular ML techniques under test for waveform recognition of time-domain data in terms of classification accuracy and timing. Our results demonstrate the usefulness of random forest and several other machine learning algorithms for terahertz hyperspectral pattern recognition.
format Online
Article
Text
id pubmed-7806755
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-78067552021-01-14 Machine learning for pattern and waveform recognitions in terahertz image data Bulgarevich, Dmitry S. Talara, Miezel Tani, Masahiko Watanabe, Makoto Sci Rep Article Several machine learning (ML) techniques were tested for the feasibility of performing automated pattern and waveform recognitions of terahertz time-domain spectroscopy datasets. Out of all the ML techniques under test, it was observed that random forest statistical algorithm works well with the THz datasets in both the frequency and time domains. With such ML algorithm, a classifier can be created with less than 1% out-of-bag error for segmentation of rusted and non-rusted sample regions of the image datasets in frequency domain. The degree of linear correlation between the rusted area percentage and the image spatial resolution with terahertz frequency can be used as an additional cross-validation criteria for the evaluation of classifier quality. However, for different rust staging measured datasets, a standardized procedure of image pre-processing is necessary to create/apply a single classifier and its usage is only limited to 1 ± 0.2 THz. Moreover, random forest is practically the best choice among the several popular ML techniques under test for waveform recognition of time-domain data in terms of classification accuracy and timing. Our results demonstrate the usefulness of random forest and several other machine learning algorithms for terahertz hyperspectral pattern recognition. Nature Publishing Group UK 2021-01-13 /pmc/articles/PMC7806755/ /pubmed/33441888 http://dx.doi.org/10.1038/s41598-020-80761-9 Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Bulgarevich, Dmitry S.
Talara, Miezel
Tani, Masahiko
Watanabe, Makoto
Machine learning for pattern and waveform recognitions in terahertz image data
title Machine learning for pattern and waveform recognitions in terahertz image data
title_full Machine learning for pattern and waveform recognitions in terahertz image data
title_fullStr Machine learning for pattern and waveform recognitions in terahertz image data
title_full_unstemmed Machine learning for pattern and waveform recognitions in terahertz image data
title_short Machine learning for pattern and waveform recognitions in terahertz image data
title_sort machine learning for pattern and waveform recognitions in terahertz image data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7806755/
https://www.ncbi.nlm.nih.gov/pubmed/33441888
http://dx.doi.org/10.1038/s41598-020-80761-9
work_keys_str_mv AT bulgarevichdmitrys machinelearningforpatternandwaveformrecognitionsinterahertzimagedata
AT talaramiezel machinelearningforpatternandwaveformrecognitionsinterahertzimagedata
AT tanimasahiko machinelearningforpatternandwaveformrecognitionsinterahertzimagedata
AT watanabemakoto machinelearningforpatternandwaveformrecognitionsinterahertzimagedata