Cargando…
A Methodology for Validating Diversity in Synthetic Time Series Generation()
In order for researchers to deliver robust evaluations of time series models, it often requires high volumes of data to ensure the appropriate level of rigor in testing. However, for many researchers, the lack of time series presents a barrier to a deeper evaluation. While researchers have developed...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8374706/ https://www.ncbi.nlm.nih.gov/pubmed/34434865 http://dx.doi.org/10.1016/j.mex.2021.101459 |
_version_ | 1783740173916307456 |
---|---|
author | Bahrpeyma, Fouad Roantree, Mark Cappellari, Paolo Scriney, Michael McCarren, Andrew |
author_facet | Bahrpeyma, Fouad Roantree, Mark Cappellari, Paolo Scriney, Michael McCarren, Andrew |
author_sort | Bahrpeyma, Fouad |
collection | PubMed |
description | In order for researchers to deliver robust evaluations of time series models, it often requires high volumes of data to ensure the appropriate level of rigor in testing. However, for many researchers, the lack of time series presents a barrier to a deeper evaluation. While researchers have developed and used synthetic datasets, the development of this data requires a methodological approach to testing the entire dataset against a set of metrics which capture the diversity of the dataset. Unless researchers are confident that their test datasets display a broad set of time series characteristics, it may favor one type of predictive model over another. This can have the effect of undermining the evaluation of new predictive methods. In this paper, we present a new approach to generating and evaluating a high number of time series data. The construction algorithm and validation framework are described in detail, together with an analysis of the level of diversity present in the synthetic dataset. |
format | Online Article Text |
id | pubmed-8374706 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-83747062021-08-24 A Methodology for Validating Diversity in Synthetic Time Series Generation() Bahrpeyma, Fouad Roantree, Mark Cappellari, Paolo Scriney, Michael McCarren, Andrew MethodsX Method Article In order for researchers to deliver robust evaluations of time series models, it often requires high volumes of data to ensure the appropriate level of rigor in testing. However, for many researchers, the lack of time series presents a barrier to a deeper evaluation. While researchers have developed and used synthetic datasets, the development of this data requires a methodological approach to testing the entire dataset against a set of metrics which capture the diversity of the dataset. Unless researchers are confident that their test datasets display a broad set of time series characteristics, it may favor one type of predictive model over another. This can have the effect of undermining the evaluation of new predictive methods. In this paper, we present a new approach to generating and evaluating a high number of time series data. The construction algorithm and validation framework are described in detail, together with an analysis of the level of diversity present in the synthetic dataset. Elsevier 2021-07-24 /pmc/articles/PMC8374706/ /pubmed/34434865 http://dx.doi.org/10.1016/j.mex.2021.101459 Text en © 2021 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Method Article Bahrpeyma, Fouad Roantree, Mark Cappellari, Paolo Scriney, Michael McCarren, Andrew A Methodology for Validating Diversity in Synthetic Time Series Generation() |
title | A Methodology for Validating Diversity in Synthetic Time Series Generation() |
title_full | A Methodology for Validating Diversity in Synthetic Time Series Generation() |
title_fullStr | A Methodology for Validating Diversity in Synthetic Time Series Generation() |
title_full_unstemmed | A Methodology for Validating Diversity in Synthetic Time Series Generation() |
title_short | A Methodology for Validating Diversity in Synthetic Time Series Generation() |
title_sort | methodology for validating diversity in synthetic time series generation() |
topic | Method Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8374706/ https://www.ncbi.nlm.nih.gov/pubmed/34434865 http://dx.doi.org/10.1016/j.mex.2021.101459 |
work_keys_str_mv | AT bahrpeymafouad amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT roantreemark amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT cappellaripaolo amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT scrineymichael amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT mccarrenandrew amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT bahrpeymafouad methodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT roantreemark methodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT cappellaripaolo methodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT scrineymichael methodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT mccarrenandrew methodologyforvalidatingdiversityinsynthetictimeseriesgeneration |