Cargando…

Machine Learning Modelling and Feature Engineering in Seismology Experiment

This article aims to discusses machine learning modelling using a dataset provided by the LANL (Los Alamos National Laboratory) earthquake prediction competition hosted by Kaggle. The data were obtained from a laboratory stick-slip friction experiment that mimics real earthquakes. Digitized acoustic...

Descripción completa

Detalles Bibliográficos
Autores principales: Brykov, Michail Nikolaevich, Petryshynets, Ivan, Pruncu, Catalin Iulian, Efremenko, Vasily Georgievich, Pimenov, Danil Yurievich, Giasin, Khaled, Sylenko, Serhii Anatolievich, Wojciechowski, Szymon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7435601/
https://www.ncbi.nlm.nih.gov/pubmed/32751350
http://dx.doi.org/10.3390/s20154228
_version_ 1783572360119451648
author Brykov, Michail Nikolaevich
Petryshynets, Ivan
Pruncu, Catalin Iulian
Efremenko, Vasily Georgievich
Pimenov, Danil Yurievich
Giasin, Khaled
Sylenko, Serhii Anatolievich
Wojciechowski, Szymon
author_facet Brykov, Michail Nikolaevich
Petryshynets, Ivan
Pruncu, Catalin Iulian
Efremenko, Vasily Georgievich
Pimenov, Danil Yurievich
Giasin, Khaled
Sylenko, Serhii Anatolievich
Wojciechowski, Szymon
author_sort Brykov, Michail Nikolaevich
collection PubMed
description This article aims to discusses machine learning modelling using a dataset provided by the LANL (Los Alamos National Laboratory) earthquake prediction competition hosted by Kaggle. The data were obtained from a laboratory stick-slip friction experiment that mimics real earthquakes. Digitized acoustic signals were recorded against time to failure of a granular layer compressed between steel plates. In this work, machine learning was employed to develop models that could predict earthquakes. The aim is to highlight the importance and potential applicability of machine learning in seismology The XGBoost algorithm was used for modelling combined with 6-fold cross-validation and the mean absolute error (MAE) metric for model quality estimation. The backward feature elimination technique was used followed by the forward feature construction approach to find the best combination of features. The advantage of this feature engineering method is that it enables the best subset to be found from a relatively large set of features in a relatively short time. It was confirmed that the proper combination of statistical characteristics describing acoustic data can be used for effective prediction of time to failure. Additionally, statistical features based on the autocorrelation of acoustic data can also be used for further improvement of model quality. A total of 48 statistical features were considered. The best subset was determined as having 10 features. Its corresponding MAE was 1.913 s, which was stable to the third decimal point. The presented results can be used to develop artificial intelligence algorithms devoted to earthquake prediction.
format Online
Article
Text
id pubmed-7435601
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-74356012020-08-28 Machine Learning Modelling and Feature Engineering in Seismology Experiment Brykov, Michail Nikolaevich Petryshynets, Ivan Pruncu, Catalin Iulian Efremenko, Vasily Georgievich Pimenov, Danil Yurievich Giasin, Khaled Sylenko, Serhii Anatolievich Wojciechowski, Szymon Sensors (Basel) Article This article aims to discusses machine learning modelling using a dataset provided by the LANL (Los Alamos National Laboratory) earthquake prediction competition hosted by Kaggle. The data were obtained from a laboratory stick-slip friction experiment that mimics real earthquakes. Digitized acoustic signals were recorded against time to failure of a granular layer compressed between steel plates. In this work, machine learning was employed to develop models that could predict earthquakes. The aim is to highlight the importance and potential applicability of machine learning in seismology The XGBoost algorithm was used for modelling combined with 6-fold cross-validation and the mean absolute error (MAE) metric for model quality estimation. The backward feature elimination technique was used followed by the forward feature construction approach to find the best combination of features. The advantage of this feature engineering method is that it enables the best subset to be found from a relatively large set of features in a relatively short time. It was confirmed that the proper combination of statistical characteristics describing acoustic data can be used for effective prediction of time to failure. Additionally, statistical features based on the autocorrelation of acoustic data can also be used for further improvement of model quality. A total of 48 statistical features were considered. The best subset was determined as having 10 features. Its corresponding MAE was 1.913 s, which was stable to the third decimal point. The presented results can be used to develop artificial intelligence algorithms devoted to earthquake prediction. MDPI 2020-07-29 /pmc/articles/PMC7435601/ /pubmed/32751350 http://dx.doi.org/10.3390/s20154228 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Brykov, Michail Nikolaevich
Petryshynets, Ivan
Pruncu, Catalin Iulian
Efremenko, Vasily Georgievich
Pimenov, Danil Yurievich
Giasin, Khaled
Sylenko, Serhii Anatolievich
Wojciechowski, Szymon
Machine Learning Modelling and Feature Engineering in Seismology Experiment
title Machine Learning Modelling and Feature Engineering in Seismology Experiment
title_full Machine Learning Modelling and Feature Engineering in Seismology Experiment
title_fullStr Machine Learning Modelling and Feature Engineering in Seismology Experiment
title_full_unstemmed Machine Learning Modelling and Feature Engineering in Seismology Experiment
title_short Machine Learning Modelling and Feature Engineering in Seismology Experiment
title_sort machine learning modelling and feature engineering in seismology experiment
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7435601/
https://www.ncbi.nlm.nih.gov/pubmed/32751350
http://dx.doi.org/10.3390/s20154228
work_keys_str_mv AT brykovmichailnikolaevich machinelearningmodellingandfeatureengineeringinseismologyexperiment
AT petryshynetsivan machinelearningmodellingandfeatureengineeringinseismologyexperiment
AT pruncucataliniulian machinelearningmodellingandfeatureengineeringinseismologyexperiment
AT efremenkovasilygeorgievich machinelearningmodellingandfeatureengineeringinseismologyexperiment
AT pimenovdanilyurievich machinelearningmodellingandfeatureengineeringinseismologyexperiment
AT giasinkhaled machinelearningmodellingandfeatureengineeringinseismologyexperiment
AT sylenkoserhiianatolievich machinelearningmodellingandfeatureengineeringinseismologyexperiment
AT wojciechowskiszymon machinelearningmodellingandfeatureengineeringinseismologyexperiment