Cargando…
Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data
As an important technique for data pre-processing, outlier detection plays a crucial role in various real applications and has gained substantial attention, especially in medical fields. Despite the importance of outlier detection, many existing methods are vulnerable to the distribution of outliers...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10613083/ https://www.ncbi.nlm.nih.gov/pubmed/37900945 http://dx.doi.org/10.3389/fphys.2023.1233341 |
_version_ | 1785128749723811840 |
---|---|
author | Li, Jia Li, Jiangwei Wang, Chenxu Verbeek, Fons J. Schultz, Tanja Liu, Hui |
author_facet | Li, Jia Li, Jiangwei Wang, Chenxu Verbeek, Fons J. Schultz, Tanja Liu, Hui |
author_sort | Li, Jia |
collection | PubMed |
description | As an important technique for data pre-processing, outlier detection plays a crucial role in various real applications and has gained substantial attention, especially in medical fields. Despite the importance of outlier detection, many existing methods are vulnerable to the distribution of outliers and require prior knowledge, such as the outlier proportion. To address this problem to some extent, this article proposes an adaptive mini-minimum spanning tree-based outlier detection (MMOD) method, which utilizes a novel distance measure by scaling the Euclidean distance. For datasets containing different densities and taking on different shapes, our method can identify outliers without prior knowledge of outlier percentages. The results on both real-world medical data corpora and intuitive synthetic datasets demonstrate the effectiveness of the proposed method compared to state-of-the-art methods. |
format | Online Article Text |
id | pubmed-10613083 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-106130832023-10-29 Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data Li, Jia Li, Jiangwei Wang, Chenxu Verbeek, Fons J. Schultz, Tanja Liu, Hui Front Physiol Physiology As an important technique for data pre-processing, outlier detection plays a crucial role in various real applications and has gained substantial attention, especially in medical fields. Despite the importance of outlier detection, many existing methods are vulnerable to the distribution of outliers and require prior knowledge, such as the outlier proportion. To address this problem to some extent, this article proposes an adaptive mini-minimum spanning tree-based outlier detection (MMOD) method, which utilizes a novel distance measure by scaling the Euclidean distance. For datasets containing different densities and taking on different shapes, our method can identify outliers without prior knowledge of outlier percentages. The results on both real-world medical data corpora and intuitive synthetic datasets demonstrate the effectiveness of the proposed method compared to state-of-the-art methods. Frontiers Media S.A. 2023-10-13 /pmc/articles/PMC10613083/ /pubmed/37900945 http://dx.doi.org/10.3389/fphys.2023.1233341 Text en Copyright © 2023 Li, Li, Wang, Verbeek, Schultz and Liu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Physiology Li, Jia Li, Jiangwei Wang, Chenxu Verbeek, Fons J. Schultz, Tanja Liu, Hui Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data |
title | Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data |
title_full | Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data |
title_fullStr | Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data |
title_full_unstemmed | Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data |
title_short | Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data |
title_sort | outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data |
topic | Physiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10613083/ https://www.ncbi.nlm.nih.gov/pubmed/37900945 http://dx.doi.org/10.3389/fphys.2023.1233341 |
work_keys_str_mv | AT lijia outlierdetectionusingiterativeadaptiveminiminimumspanningtreegenerationwithapplicationsonmedicaldata AT lijiangwei outlierdetectionusingiterativeadaptiveminiminimumspanningtreegenerationwithapplicationsonmedicaldata AT wangchenxu outlierdetectionusingiterativeadaptiveminiminimumspanningtreegenerationwithapplicationsonmedicaldata AT verbeekfonsj outlierdetectionusingiterativeadaptiveminiminimumspanningtreegenerationwithapplicationsonmedicaldata AT schultztanja outlierdetectionusingiterativeadaptiveminiminimumspanningtreegenerationwithapplicationsonmedicaldata AT liuhui outlierdetectionusingiterativeadaptiveminiminimumspanningtreegenerationwithapplicationsonmedicaldata |