Cargando…
Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression
We introduce a new nonparametric outlier detection method for linear series, which requires no missing or removed data imputation. For an arithmetic progression (a series without outliers) with n elements, the ratio (R) of the sum of the minimum and the maximum elements and the sum of all elements i...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4121229/ https://www.ncbi.nlm.nih.gov/pubmed/25121139 http://dx.doi.org/10.1155/2014/821623 |
_version_ | 1782329197355597824 |
---|---|
author | Adikaram, K. K. L. B. Hussein, M. A. Effenberger, M. Becker, T. |
author_facet | Adikaram, K. K. L. B. Hussein, M. A. Effenberger, M. Becker, T. |
author_sort | Adikaram, K. K. L. B. |
collection | PubMed |
description | We introduce a new nonparametric outlier detection method for linear series, which requires no missing or removed data imputation. For an arithmetic progression (a series without outliers) with n elements, the ratio (R) of the sum of the minimum and the maximum elements and the sum of all elements is always 2/n : (0,1]. R ≠ 2/n always implies the existence of outliers. Usually, R < 2/n implies that the minimum is an outlier, and R > 2/n implies that the maximum is an outlier. Based upon this, we derived a new method for identifying significant and nonsignificant outliers, separately. Two different techniques were used to manage missing data and removed outliers: (1) recalculate the terms after (or before) the removed or missing element while maintaining the initial angle in relation to a certain point or (2) transform data into a constant value, which is not affected by missing or removed elements. With a reference element, which was not an outlier, the method detected all outliers from data sets with 6 to 1000 elements containing 50% outliers which deviated by a factor of ±1.0e − 2 to ±1.0e + 2 from the correct value. |
format | Online Article Text |
id | pubmed-4121229 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-41212292014-08-12 Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression Adikaram, K. K. L. B. Hussein, M. A. Effenberger, M. Becker, T. ScientificWorldJournal Research Article We introduce a new nonparametric outlier detection method for linear series, which requires no missing or removed data imputation. For an arithmetic progression (a series without outliers) with n elements, the ratio (R) of the sum of the minimum and the maximum elements and the sum of all elements is always 2/n : (0,1]. R ≠ 2/n always implies the existence of outliers. Usually, R < 2/n implies that the minimum is an outlier, and R > 2/n implies that the maximum is an outlier. Based upon this, we derived a new method for identifying significant and nonsignificant outliers, separately. Two different techniques were used to manage missing data and removed outliers: (1) recalculate the terms after (or before) the removed or missing element while maintaining the initial angle in relation to a certain point or (2) transform data into a constant value, which is not affected by missing or removed elements. With a reference element, which was not an outlier, the method detected all outliers from data sets with 6 to 1000 elements containing 50% outliers which deviated by a factor of ±1.0e − 2 to ±1.0e + 2 from the correct value. Hindawi Publishing Corporation 2014 2014-07-10 /pmc/articles/PMC4121229/ /pubmed/25121139 http://dx.doi.org/10.1155/2014/821623 Text en Copyright © 2014 K. K. L. B. Adikaram et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Adikaram, K. K. L. B. Hussein, M. A. Effenberger, M. Becker, T. Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression |
title | Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression |
title_full | Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression |
title_fullStr | Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression |
title_full_unstemmed | Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression |
title_short | Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression |
title_sort | outlier detection method in linear regression based on sum of arithmetic progression |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4121229/ https://www.ncbi.nlm.nih.gov/pubmed/25121139 http://dx.doi.org/10.1155/2014/821623 |
work_keys_str_mv | AT adikaramkklb outlierdetectionmethodinlinearregressionbasedonsumofarithmeticprogression AT husseinma outlierdetectionmethodinlinearregressionbasedonsumofarithmeticprogression AT effenbergerm outlierdetectionmethodinlinearregressionbasedonsumofarithmeticprogression AT beckert outlierdetectionmethodinlinearregressionbasedonsumofarithmeticprogression |