Cargando…

Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data

As an important technique for data pre-processing, outlier detection plays a crucial role in various real applications and has gained substantial attention, especially in medical fields. Despite the importance of outlier detection, many existing methods are vulnerable to the distribution of outliers...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jia, Li, Jiangwei, Wang, Chenxu, Verbeek, Fons J., Schultz, Tanja, Liu, Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10613083/
https://www.ncbi.nlm.nih.gov/pubmed/37900945
http://dx.doi.org/10.3389/fphys.2023.1233341
_version_ 1785128749723811840
author Li, Jia
Li, Jiangwei
Wang, Chenxu
Verbeek, Fons J.
Schultz, Tanja
Liu, Hui
author_facet Li, Jia
Li, Jiangwei
Wang, Chenxu
Verbeek, Fons J.
Schultz, Tanja
Liu, Hui
author_sort Li, Jia
collection PubMed
description As an important technique for data pre-processing, outlier detection plays a crucial role in various real applications and has gained substantial attention, especially in medical fields. Despite the importance of outlier detection, many existing methods are vulnerable to the distribution of outliers and require prior knowledge, such as the outlier proportion. To address this problem to some extent, this article proposes an adaptive mini-minimum spanning tree-based outlier detection (MMOD) method, which utilizes a novel distance measure by scaling the Euclidean distance. For datasets containing different densities and taking on different shapes, our method can identify outliers without prior knowledge of outlier percentages. The results on both real-world medical data corpora and intuitive synthetic datasets demonstrate the effectiveness of the proposed method compared to state-of-the-art methods.
format Online
Article
Text
id pubmed-10613083
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-106130832023-10-29 Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data Li, Jia Li, Jiangwei Wang, Chenxu Verbeek, Fons J. Schultz, Tanja Liu, Hui Front Physiol Physiology As an important technique for data pre-processing, outlier detection plays a crucial role in various real applications and has gained substantial attention, especially in medical fields. Despite the importance of outlier detection, many existing methods are vulnerable to the distribution of outliers and require prior knowledge, such as the outlier proportion. To address this problem to some extent, this article proposes an adaptive mini-minimum spanning tree-based outlier detection (MMOD) method, which utilizes a novel distance measure by scaling the Euclidean distance. For datasets containing different densities and taking on different shapes, our method can identify outliers without prior knowledge of outlier percentages. The results on both real-world medical data corpora and intuitive synthetic datasets demonstrate the effectiveness of the proposed method compared to state-of-the-art methods. Frontiers Media S.A. 2023-10-13 /pmc/articles/PMC10613083/ /pubmed/37900945 http://dx.doi.org/10.3389/fphys.2023.1233341 Text en Copyright © 2023 Li, Li, Wang, Verbeek, Schultz and Liu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Physiology
Li, Jia
Li, Jiangwei
Wang, Chenxu
Verbeek, Fons J.
Schultz, Tanja
Liu, Hui
Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data
title Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data
title_full Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data
title_fullStr Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data
title_full_unstemmed Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data
title_short Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data
title_sort outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data
topic Physiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10613083/
https://www.ncbi.nlm.nih.gov/pubmed/37900945
http://dx.doi.org/10.3389/fphys.2023.1233341
work_keys_str_mv AT lijia outlierdetectionusingiterativeadaptiveminiminimumspanningtreegenerationwithapplicationsonmedicaldata
AT lijiangwei outlierdetectionusingiterativeadaptiveminiminimumspanningtreegenerationwithapplicationsonmedicaldata
AT wangchenxu outlierdetectionusingiterativeadaptiveminiminimumspanningtreegenerationwithapplicationsonmedicaldata
AT verbeekfonsj outlierdetectionusingiterativeadaptiveminiminimumspanningtreegenerationwithapplicationsonmedicaldata
AT schultztanja outlierdetectionusingiterativeadaptiveminiminimumspanningtreegenerationwithapplicationsonmedicaldata
AT liuhui outlierdetectionusingiterativeadaptiveminiminimumspanningtreegenerationwithapplicationsonmedicaldata