Cargando…
pyDRMetrics - A Python toolkit for dimensionality reduction quality assessment
High-dimensional data are pervasive in this bigdata era. To avoid the curse of the dimensionality problem, various dimensionality reduction (DR) algorithms have been proposed. To facilitate systematic DR quality comparison and assessment, this paper reviews related metrics and develops an open-sourc...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7887408/ https://www.ncbi.nlm.nih.gov/pubmed/33644472 http://dx.doi.org/10.1016/j.heliyon.2021.e06199 |
_version_ | 1783651976192458752 |
---|---|
author | Zhang, Yinsheng Shang, Qian Zhang, Guoming |
author_facet | Zhang, Yinsheng Shang, Qian Zhang, Guoming |
author_sort | Zhang, Yinsheng |
collection | PubMed |
description | High-dimensional data are pervasive in this bigdata era. To avoid the curse of the dimensionality problem, various dimensionality reduction (DR) algorithms have been proposed. To facilitate systematic DR quality comparison and assessment, this paper reviews related metrics and develops an open-source Python package pyDRMetrics. Supported metrics include reconstruction error, distance matrix, residual variance, ranking matrix, co-ranking matrix, trustworthiness, continuity, co-k-nearest neighbor size, LCMC (local continuity meta criterion), and rank-based local/global properties. pyDRMetrics provides a native Python class and a web-oriented API. A case study of mass spectra is conducted to demonstrate the package functions. A web GUI wrapper is also published to support user-friendly B/S applications. |
format | Online Article Text |
id | pubmed-7887408 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-78874082021-02-26 pyDRMetrics - A Python toolkit for dimensionality reduction quality assessment Zhang, Yinsheng Shang, Qian Zhang, Guoming Heliyon Research Article High-dimensional data are pervasive in this bigdata era. To avoid the curse of the dimensionality problem, various dimensionality reduction (DR) algorithms have been proposed. To facilitate systematic DR quality comparison and assessment, this paper reviews related metrics and develops an open-source Python package pyDRMetrics. Supported metrics include reconstruction error, distance matrix, residual variance, ranking matrix, co-ranking matrix, trustworthiness, continuity, co-k-nearest neighbor size, LCMC (local continuity meta criterion), and rank-based local/global properties. pyDRMetrics provides a native Python class and a web-oriented API. A case study of mass spectra is conducted to demonstrate the package functions. A web GUI wrapper is also published to support user-friendly B/S applications. Elsevier 2021-02-06 /pmc/articles/PMC7887408/ /pubmed/33644472 http://dx.doi.org/10.1016/j.heliyon.2021.e06199 Text en © 2021 The Author(s) http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Zhang, Yinsheng Shang, Qian Zhang, Guoming pyDRMetrics - A Python toolkit for dimensionality reduction quality assessment |
title | pyDRMetrics - A Python toolkit for dimensionality reduction quality assessment |
title_full | pyDRMetrics - A Python toolkit for dimensionality reduction quality assessment |
title_fullStr | pyDRMetrics - A Python toolkit for dimensionality reduction quality assessment |
title_full_unstemmed | pyDRMetrics - A Python toolkit for dimensionality reduction quality assessment |
title_short | pyDRMetrics - A Python toolkit for dimensionality reduction quality assessment |
title_sort | pydrmetrics - a python toolkit for dimensionality reduction quality assessment |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7887408/ https://www.ncbi.nlm.nih.gov/pubmed/33644472 http://dx.doi.org/10.1016/j.heliyon.2021.e06199 |
work_keys_str_mv | AT zhangyinsheng pydrmetricsapythontoolkitfordimensionalityreductionqualityassessment AT shangqian pydrmetricsapythontoolkitfordimensionalityreductionqualityassessment AT zhangguoming pydrmetricsapythontoolkitfordimensionalityreductionqualityassessment |