Cargando…

A Methodology for Validating Diversity in Synthetic Time Series Generation()

In order for researchers to deliver robust evaluations of time series models, it often requires high volumes of data to ensure the appropriate level of rigor in testing. However, for many researchers, the lack of time series presents a barrier to a deeper evaluation. While researchers have developed...

Descripción completa

Detalles Bibliográficos
Autores principales: Bahrpeyma, Fouad, Roantree, Mark, Cappellari, Paolo, Scriney, Michael, McCarren, Andrew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8374706/
https://www.ncbi.nlm.nih.gov/pubmed/34434865
http://dx.doi.org/10.1016/j.mex.2021.101459
_version_ 1783740173916307456
author Bahrpeyma, Fouad
Roantree, Mark
Cappellari, Paolo
Scriney, Michael
McCarren, Andrew
author_facet Bahrpeyma, Fouad
Roantree, Mark
Cappellari, Paolo
Scriney, Michael
McCarren, Andrew
author_sort Bahrpeyma, Fouad
collection PubMed
description In order for researchers to deliver robust evaluations of time series models, it often requires high volumes of data to ensure the appropriate level of rigor in testing. However, for many researchers, the lack of time series presents a barrier to a deeper evaluation. While researchers have developed and used synthetic datasets, the development of this data requires a methodological approach to testing the entire dataset against a set of metrics which capture the diversity of the dataset. Unless researchers are confident that their test datasets display a broad set of time series characteristics, it may favor one type of predictive model over another. This can have the effect of undermining the evaluation of new predictive methods. In this paper, we present a new approach to generating and evaluating a high number of time series data. The construction algorithm and validation framework are described in detail, together with an analysis of the level of diversity present in the synthetic dataset.
format Online
Article
Text
id pubmed-8374706
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-83747062021-08-24 A Methodology for Validating Diversity in Synthetic Time Series Generation() Bahrpeyma, Fouad Roantree, Mark Cappellari, Paolo Scriney, Michael McCarren, Andrew MethodsX Method Article In order for researchers to deliver robust evaluations of time series models, it often requires high volumes of data to ensure the appropriate level of rigor in testing. However, for many researchers, the lack of time series presents a barrier to a deeper evaluation. While researchers have developed and used synthetic datasets, the development of this data requires a methodological approach to testing the entire dataset against a set of metrics which capture the diversity of the dataset. Unless researchers are confident that their test datasets display a broad set of time series characteristics, it may favor one type of predictive model over another. This can have the effect of undermining the evaluation of new predictive methods. In this paper, we present a new approach to generating and evaluating a high number of time series data. The construction algorithm and validation framework are described in detail, together with an analysis of the level of diversity present in the synthetic dataset. Elsevier 2021-07-24 /pmc/articles/PMC8374706/ /pubmed/34434865 http://dx.doi.org/10.1016/j.mex.2021.101459 Text en © 2021 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Method Article
Bahrpeyma, Fouad
Roantree, Mark
Cappellari, Paolo
Scriney, Michael
McCarren, Andrew
A Methodology for Validating Diversity in Synthetic Time Series Generation()
title A Methodology for Validating Diversity in Synthetic Time Series Generation()
title_full A Methodology for Validating Diversity in Synthetic Time Series Generation()
title_fullStr A Methodology for Validating Diversity in Synthetic Time Series Generation()
title_full_unstemmed A Methodology for Validating Diversity in Synthetic Time Series Generation()
title_short A Methodology for Validating Diversity in Synthetic Time Series Generation()
title_sort methodology for validating diversity in synthetic time series generation()
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8374706/
https://www.ncbi.nlm.nih.gov/pubmed/34434865
http://dx.doi.org/10.1016/j.mex.2021.101459
work_keys_str_mv AT bahrpeymafouad amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration
AT roantreemark amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration
AT cappellaripaolo amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration
AT scrineymichael amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration
AT mccarrenandrew amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration
AT bahrpeymafouad methodologyforvalidatingdiversityinsynthetictimeseriesgeneration
AT roantreemark methodologyforvalidatingdiversityinsynthetictimeseriesgeneration
AT cappellaripaolo methodologyforvalidatingdiversityinsynthetictimeseriesgeneration
AT scrineymichael methodologyforvalidatingdiversityinsynthetictimeseriesgeneration
AT mccarrenandrew methodologyforvalidatingdiversityinsynthetictimeseriesgeneration