Cargando…

Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression

We introduce a new nonparametric outlier detection method for linear series, which requires no missing or removed data imputation. For an arithmetic progression (a series without outliers) with n elements, the ratio (R) of the sum of the minimum and the maximum elements and the sum of all elements i...

Descripción completa

Detalles Bibliográficos
Autores principales: Adikaram, K. K. L. B., Hussein, M. A., Effenberger, M., Becker, T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4121229/
https://www.ncbi.nlm.nih.gov/pubmed/25121139
http://dx.doi.org/10.1155/2014/821623
_version_ 1782329197355597824
author Adikaram, K. K. L. B.
Hussein, M. A.
Effenberger, M.
Becker, T.
author_facet Adikaram, K. K. L. B.
Hussein, M. A.
Effenberger, M.
Becker, T.
author_sort Adikaram, K. K. L. B.
collection PubMed
description We introduce a new nonparametric outlier detection method for linear series, which requires no missing or removed data imputation. For an arithmetic progression (a series without outliers) with n elements, the ratio (R) of the sum of the minimum and the maximum elements and the sum of all elements is always 2/n : (0,1]. R ≠ 2/n always implies the existence of outliers. Usually, R < 2/n implies that the minimum is an outlier, and R > 2/n implies that the maximum is an outlier. Based upon this, we derived a new method for identifying significant and nonsignificant outliers, separately. Two different techniques were used to manage missing data and removed outliers: (1) recalculate the terms after (or before) the removed or missing element while maintaining the initial angle in relation to a certain point or (2) transform data into a constant value, which is not affected by missing or removed elements. With a reference element, which was not an outlier, the method detected all outliers from data sets with 6 to 1000 elements containing 50% outliers which deviated by a factor of ±1.0e − 2 to ±1.0e + 2 from the correct value.
format Online
Article
Text
id pubmed-4121229
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-41212292014-08-12 Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression Adikaram, K. K. L. B. Hussein, M. A. Effenberger, M. Becker, T. ScientificWorldJournal Research Article We introduce a new nonparametric outlier detection method for linear series, which requires no missing or removed data imputation. For an arithmetic progression (a series without outliers) with n elements, the ratio (R) of the sum of the minimum and the maximum elements and the sum of all elements is always 2/n : (0,1]. R ≠ 2/n always implies the existence of outliers. Usually, R < 2/n implies that the minimum is an outlier, and R > 2/n implies that the maximum is an outlier. Based upon this, we derived a new method for identifying significant and nonsignificant outliers, separately. Two different techniques were used to manage missing data and removed outliers: (1) recalculate the terms after (or before) the removed or missing element while maintaining the initial angle in relation to a certain point or (2) transform data into a constant value, which is not affected by missing or removed elements. With a reference element, which was not an outlier, the method detected all outliers from data sets with 6 to 1000 elements containing 50% outliers which deviated by a factor of ±1.0e − 2 to ±1.0e + 2 from the correct value. Hindawi Publishing Corporation 2014 2014-07-10 /pmc/articles/PMC4121229/ /pubmed/25121139 http://dx.doi.org/10.1155/2014/821623 Text en Copyright © 2014 K. K. L. B. Adikaram et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Adikaram, K. K. L. B.
Hussein, M. A.
Effenberger, M.
Becker, T.
Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression
title Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression
title_full Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression
title_fullStr Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression
title_full_unstemmed Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression
title_short Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression
title_sort outlier detection method in linear regression based on sum of arithmetic progression
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4121229/
https://www.ncbi.nlm.nih.gov/pubmed/25121139
http://dx.doi.org/10.1155/2014/821623
work_keys_str_mv AT adikaramkklb outlierdetectionmethodinlinearregressionbasedonsumofarithmeticprogression
AT husseinma outlierdetectionmethodinlinearregressionbasedonsumofarithmeticprogression
AT effenbergerm outlierdetectionmethodinlinearregressionbasedonsumofarithmeticprogression
AT beckert outlierdetectionmethodinlinearregressionbasedonsumofarithmeticprogression