Cargando…
Sequential linear regression with online standardized data
The present study addresses the problem of sequential least square multidimensional linear regression, particularly in the case of a data stream, using a stochastic approximation process. To avoid the phenomenon of numerical explosion which can be encountered and to reduce the computing time in orde...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5773231/ https://www.ncbi.nlm.nih.gov/pubmed/29346392 http://dx.doi.org/10.1371/journal.pone.0191186 |
_version_ | 1783293531558772736 |
---|---|
author | Duarte, Kévin Monnez, Jean-Marie Albuisson, Eliane |
author_facet | Duarte, Kévin Monnez, Jean-Marie Albuisson, Eliane |
author_sort | Duarte, Kévin |
collection | PubMed |
description | The present study addresses the problem of sequential least square multidimensional linear regression, particularly in the case of a data stream, using a stochastic approximation process. To avoid the phenomenon of numerical explosion which can be encountered and to reduce the computing time in order to take into account a maximum of arriving data, we propose using a process with online standardized data instead of raw data and the use of several observations per step or all observations until the current step. Herein, we define and study the almost sure convergence of three processes with online standardized data: a classical process with a variable step-size and use of a varying number of observations per step, an averaged process with a constant step-size and use of a varying number of observations per step, and a process with a variable or constant step-size and use of all observations until the current step. Their convergence is obtained under more general assumptions than classical ones. These processes are compared to classical processes on 11 datasets for a fixed total number of observations used and thereafter for a fixed processing time. Analyses indicate that the third-defined process typically yields the best results. |
format | Online Article Text |
id | pubmed-5773231 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-57732312018-01-26 Sequential linear regression with online standardized data Duarte, Kévin Monnez, Jean-Marie Albuisson, Eliane PLoS One Research Article The present study addresses the problem of sequential least square multidimensional linear regression, particularly in the case of a data stream, using a stochastic approximation process. To avoid the phenomenon of numerical explosion which can be encountered and to reduce the computing time in order to take into account a maximum of arriving data, we propose using a process with online standardized data instead of raw data and the use of several observations per step or all observations until the current step. Herein, we define and study the almost sure convergence of three processes with online standardized data: a classical process with a variable step-size and use of a varying number of observations per step, an averaged process with a constant step-size and use of a varying number of observations per step, and a process with a variable or constant step-size and use of all observations until the current step. Their convergence is obtained under more general assumptions than classical ones. These processes are compared to classical processes on 11 datasets for a fixed total number of observations used and thereafter for a fixed processing time. Analyses indicate that the third-defined process typically yields the best results. Public Library of Science 2018-01-18 /pmc/articles/PMC5773231/ /pubmed/29346392 http://dx.doi.org/10.1371/journal.pone.0191186 Text en © 2018 Duarte et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Duarte, Kévin Monnez, Jean-Marie Albuisson, Eliane Sequential linear regression with online standardized data |
title | Sequential linear regression with online standardized data |
title_full | Sequential linear regression with online standardized data |
title_fullStr | Sequential linear regression with online standardized data |
title_full_unstemmed | Sequential linear regression with online standardized data |
title_short | Sequential linear regression with online standardized data |
title_sort | sequential linear regression with online standardized data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5773231/ https://www.ncbi.nlm.nih.gov/pubmed/29346392 http://dx.doi.org/10.1371/journal.pone.0191186 |
work_keys_str_mv | AT duartekevin sequentiallinearregressionwithonlinestandardizeddata AT monnezjeanmarie sequentiallinearregressionwithonlinestandardizeddata AT albuissoneliane sequentiallinearregressionwithonlinestandardizeddata |