Cargando…

Sequential linear regression with online standardized data

The present study addresses the problem of sequential least square multidimensional linear regression, particularly in the case of a data stream, using a stochastic approximation process. To avoid the phenomenon of numerical explosion which can be encountered and to reduce the computing time in orde...

Descripción completa

Detalles Bibliográficos
Autores principales: Duarte, Kévin, Monnez, Jean-Marie, Albuisson, Eliane
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5773231/
https://www.ncbi.nlm.nih.gov/pubmed/29346392
http://dx.doi.org/10.1371/journal.pone.0191186
_version_ 1783293531558772736
author Duarte, Kévin
Monnez, Jean-Marie
Albuisson, Eliane
author_facet Duarte, Kévin
Monnez, Jean-Marie
Albuisson, Eliane
author_sort Duarte, Kévin
collection PubMed
description The present study addresses the problem of sequential least square multidimensional linear regression, particularly in the case of a data stream, using a stochastic approximation process. To avoid the phenomenon of numerical explosion which can be encountered and to reduce the computing time in order to take into account a maximum of arriving data, we propose using a process with online standardized data instead of raw data and the use of several observations per step or all observations until the current step. Herein, we define and study the almost sure convergence of three processes with online standardized data: a classical process with a variable step-size and use of a varying number of observations per step, an averaged process with a constant step-size and use of a varying number of observations per step, and a process with a variable or constant step-size and use of all observations until the current step. Their convergence is obtained under more general assumptions than classical ones. These processes are compared to classical processes on 11 datasets for a fixed total number of observations used and thereafter for a fixed processing time. Analyses indicate that the third-defined process typically yields the best results.
format Online
Article
Text
id pubmed-5773231
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-57732312018-01-26 Sequential linear regression with online standardized data Duarte, Kévin Monnez, Jean-Marie Albuisson, Eliane PLoS One Research Article The present study addresses the problem of sequential least square multidimensional linear regression, particularly in the case of a data stream, using a stochastic approximation process. To avoid the phenomenon of numerical explosion which can be encountered and to reduce the computing time in order to take into account a maximum of arriving data, we propose using a process with online standardized data instead of raw data and the use of several observations per step or all observations until the current step. Herein, we define and study the almost sure convergence of three processes with online standardized data: a classical process with a variable step-size and use of a varying number of observations per step, an averaged process with a constant step-size and use of a varying number of observations per step, and a process with a variable or constant step-size and use of all observations until the current step. Their convergence is obtained under more general assumptions than classical ones. These processes are compared to classical processes on 11 datasets for a fixed total number of observations used and thereafter for a fixed processing time. Analyses indicate that the third-defined process typically yields the best results. Public Library of Science 2018-01-18 /pmc/articles/PMC5773231/ /pubmed/29346392 http://dx.doi.org/10.1371/journal.pone.0191186 Text en © 2018 Duarte et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Duarte, Kévin
Monnez, Jean-Marie
Albuisson, Eliane
Sequential linear regression with online standardized data
title Sequential linear regression with online standardized data
title_full Sequential linear regression with online standardized data
title_fullStr Sequential linear regression with online standardized data
title_full_unstemmed Sequential linear regression with online standardized data
title_short Sequential linear regression with online standardized data
title_sort sequential linear regression with online standardized data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5773231/
https://www.ncbi.nlm.nih.gov/pubmed/29346392
http://dx.doi.org/10.1371/journal.pone.0191186
work_keys_str_mv AT duartekevin sequentiallinearregressionwithonlinestandardizeddata
AT monnezjeanmarie sequentiallinearregressionwithonlinestandardizeddata
AT albuissoneliane sequentiallinearregressionwithonlinestandardizeddata