Cargando…

Multivariate time series dataset for space weather data analytics

We introduce and make openly accessible a comprehensive, multivariate time series (MVTS) dataset extracted from solar photospheric vector magnetograms in Spaceweather HMI Active Region Patch (SHARP) series. Our dataset also includes a cross-checked NOAA solar flare catalog that immediately facilitat...

Descripción completa

Detalles Bibliográficos
Autores principales: Angryk, Rafal A., Martens, Petrus C., Aydin, Berkay, Kempton, Dustin, Mahajan, Sushant S., Basodi, Sunitha, Ahmadzadeh, Azim, Cai, Xumin, Filali Boubrahimi, Soukaina, Hamdi, Shah Muhammad, Schuh, Michael A., Georgoulis, Manolis K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7351763/
https://www.ncbi.nlm.nih.gov/pubmed/32651380
http://dx.doi.org/10.1038/s41597-020-0548-x
_version_ 1783557508731764736
author Angryk, Rafal A.
Martens, Petrus C.
Aydin, Berkay
Kempton, Dustin
Mahajan, Sushant S.
Basodi, Sunitha
Ahmadzadeh, Azim
Cai, Xumin
Filali Boubrahimi, Soukaina
Hamdi, Shah Muhammad
Schuh, Michael A.
Georgoulis, Manolis K.
author_facet Angryk, Rafal A.
Martens, Petrus C.
Aydin, Berkay
Kempton, Dustin
Mahajan, Sushant S.
Basodi, Sunitha
Ahmadzadeh, Azim
Cai, Xumin
Filali Boubrahimi, Soukaina
Hamdi, Shah Muhammad
Schuh, Michael A.
Georgoulis, Manolis K.
author_sort Angryk, Rafal A.
collection PubMed
description We introduce and make openly accessible a comprehensive, multivariate time series (MVTS) dataset extracted from solar photospheric vector magnetograms in Spaceweather HMI Active Region Patch (SHARP) series. Our dataset also includes a cross-checked NOAA solar flare catalog that immediately facilitates solar flare prediction efforts. We discuss methods used for data collection, cleaning and pre-processing of the solar active region and flare data, and we further describe a novel data integration and sampling methodology. Our dataset covers 4,098 MVTS data collections from active regions occurring between May 2010 and December 2018, includes 51 flare-predictive parameters, and integrates over 10,000 flare reports. Potential directions toward expansion of the time series, either “horizontally” – by adding more prediction-specific parameters, or “vertically” – by generalizing flare into integrated solar eruption prediction, are also explained. The immediate tasks enabled by the disseminated dataset include: optimization of solar flare prediction and detailed investigation for elusive flare predictors or precursors, with both operational (research-to-operations), and basic research (operations-to-research) benefits potentially following in the future.
format Online
Article
Text
id pubmed-7351763
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-73517632020-07-13 Multivariate time series dataset for space weather data analytics Angryk, Rafal A. Martens, Petrus C. Aydin, Berkay Kempton, Dustin Mahajan, Sushant S. Basodi, Sunitha Ahmadzadeh, Azim Cai, Xumin Filali Boubrahimi, Soukaina Hamdi, Shah Muhammad Schuh, Michael A. Georgoulis, Manolis K. Sci Data Data Descriptor We introduce and make openly accessible a comprehensive, multivariate time series (MVTS) dataset extracted from solar photospheric vector magnetograms in Spaceweather HMI Active Region Patch (SHARP) series. Our dataset also includes a cross-checked NOAA solar flare catalog that immediately facilitates solar flare prediction efforts. We discuss methods used for data collection, cleaning and pre-processing of the solar active region and flare data, and we further describe a novel data integration and sampling methodology. Our dataset covers 4,098 MVTS data collections from active regions occurring between May 2010 and December 2018, includes 51 flare-predictive parameters, and integrates over 10,000 flare reports. Potential directions toward expansion of the time series, either “horizontally” – by adding more prediction-specific parameters, or “vertically” – by generalizing flare into integrated solar eruption prediction, are also explained. The immediate tasks enabled by the disseminated dataset include: optimization of solar flare prediction and detailed investigation for elusive flare predictors or precursors, with both operational (research-to-operations), and basic research (operations-to-research) benefits potentially following in the future. Nature Publishing Group UK 2020-07-10 /pmc/articles/PMC7351763/ /pubmed/32651380 http://dx.doi.org/10.1038/s41597-020-0548-x Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
spellingShingle Data Descriptor
Angryk, Rafal A.
Martens, Petrus C.
Aydin, Berkay
Kempton, Dustin
Mahajan, Sushant S.
Basodi, Sunitha
Ahmadzadeh, Azim
Cai, Xumin
Filali Boubrahimi, Soukaina
Hamdi, Shah Muhammad
Schuh, Michael A.
Georgoulis, Manolis K.
Multivariate time series dataset for space weather data analytics
title Multivariate time series dataset for space weather data analytics
title_full Multivariate time series dataset for space weather data analytics
title_fullStr Multivariate time series dataset for space weather data analytics
title_full_unstemmed Multivariate time series dataset for space weather data analytics
title_short Multivariate time series dataset for space weather data analytics
title_sort multivariate time series dataset for space weather data analytics
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7351763/
https://www.ncbi.nlm.nih.gov/pubmed/32651380
http://dx.doi.org/10.1038/s41597-020-0548-x
work_keys_str_mv AT angrykrafala multivariatetimeseriesdatasetforspaceweatherdataanalytics
AT martenspetrusc multivariatetimeseriesdatasetforspaceweatherdataanalytics
AT aydinberkay multivariatetimeseriesdatasetforspaceweatherdataanalytics
AT kemptondustin multivariatetimeseriesdatasetforspaceweatherdataanalytics
AT mahajansushants multivariatetimeseriesdatasetforspaceweatherdataanalytics
AT basodisunitha multivariatetimeseriesdatasetforspaceweatherdataanalytics
AT ahmadzadehazim multivariatetimeseriesdatasetforspaceweatherdataanalytics
AT caixumin multivariatetimeseriesdatasetforspaceweatherdataanalytics
AT filaliboubrahimisoukaina multivariatetimeseriesdatasetforspaceweatherdataanalytics
AT hamdishahmuhammad multivariatetimeseriesdatasetforspaceweatherdataanalytics
AT schuhmichaela multivariatetimeseriesdatasetforspaceweatherdataanalytics
AT georgoulismanolisk multivariatetimeseriesdatasetforspaceweatherdataanalytics