Cargando…

A Methodology for Validating Diversity in Synthetic Time Series Generation()

In order for researchers to deliver robust evaluations of time series models, it often requires high volumes of data to ensure the appropriate level of rigor in testing. However, for many researchers, the lack of time series presents a barrier to a deeper evaluation. While researchers have developed...

Descripción completa

Detalles Bibliográficos
Autores principales:	Bahrpeyma, Fouad, Roantree, Mark, Cappellari, Paolo, Scriney, Michael, McCarren, Andrew
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier 2021
Materias:	Method Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8374706/ https://www.ncbi.nlm.nih.gov/pubmed/34434865 http://dx.doi.org/10.1016/j.mex.2021.101459

_version_	1783740173916307456
author	Bahrpeyma, Fouad Roantree, Mark Cappellari, Paolo Scriney, Michael McCarren, Andrew
author_facet	Bahrpeyma, Fouad Roantree, Mark Cappellari, Paolo Scriney, Michael McCarren, Andrew
author_sort	Bahrpeyma, Fouad
collection	PubMed
description	In order for researchers to deliver robust evaluations of time series models, it often requires high volumes of data to ensure the appropriate level of rigor in testing. However, for many researchers, the lack of time series presents a barrier to a deeper evaluation. While researchers have developed and used synthetic datasets, the development of this data requires a methodological approach to testing the entire dataset against a set of metrics which capture the diversity of the dataset. Unless researchers are confident that their test datasets display a broad set of time series characteristics, it may favor one type of predictive model over another. This can have the effect of undermining the evaluation of new predictive methods. In this paper, we present a new approach to generating and evaluating a high number of time series data. The construction algorithm and validation framework are described in detail, together with an analysis of the level of diversity present in the synthetic dataset.
format	Online Article Text
id	pubmed-8374706
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Elsevier
record_format	MEDLINE/PubMed
spelling	pubmed-83747062021-08-24 A Methodology for Validating Diversity in Synthetic Time Series Generation() Bahrpeyma, Fouad Roantree, Mark Cappellari, Paolo Scriney, Michael McCarren, Andrew MethodsX Method Article In order for researchers to deliver robust evaluations of time series models, it often requires high volumes of data to ensure the appropriate level of rigor in testing. However, for many researchers, the lack of time series presents a barrier to a deeper evaluation. While researchers have developed and used synthetic datasets, the development of this data requires a methodological approach to testing the entire dataset against a set of metrics which capture the diversity of the dataset. Unless researchers are confident that their test datasets display a broad set of time series characteristics, it may favor one type of predictive model over another. This can have the effect of undermining the evaluation of new predictive methods. In this paper, we present a new approach to generating and evaluating a high number of time series data. The construction algorithm and validation framework are described in detail, together with an analysis of the level of diversity present in the synthetic dataset. Elsevier 2021-07-24 /pmc/articles/PMC8374706/ /pubmed/34434865 http://dx.doi.org/10.1016/j.mex.2021.101459 Text en © 2021 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Method Article Bahrpeyma, Fouad Roantree, Mark Cappellari, Paolo Scriney, Michael McCarren, Andrew A Methodology for Validating Diversity in Synthetic Time Series Generation()
title	A Methodology for Validating Diversity in Synthetic Time Series Generation()
title_full	A Methodology for Validating Diversity in Synthetic Time Series Generation()
title_fullStr	A Methodology for Validating Diversity in Synthetic Time Series Generation()
title_full_unstemmed	A Methodology for Validating Diversity in Synthetic Time Series Generation()
title_short	A Methodology for Validating Diversity in Synthetic Time Series Generation()
title_sort	methodology for validating diversity in synthetic time series generation()
topic	Method Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8374706/ https://www.ncbi.nlm.nih.gov/pubmed/34434865 http://dx.doi.org/10.1016/j.mex.2021.101459
work_keys_str_mv	AT bahrpeymafouad amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT roantreemark amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT cappellaripaolo amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT scrineymichael amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT mccarrenandrew amethodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT bahrpeymafouad methodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT roantreemark methodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT cappellaripaolo methodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT scrineymichael methodologyforvalidatingdiversityinsynthetictimeseriesgeneration AT mccarrenandrew methodologyforvalidatingdiversityinsynthetictimeseriesgeneration

A Methodology for Validating Diversity in Synthetic Time Series Generation()

Ejemplares similares