Cargando…

Performance comparison of three scaling algorithms in NMR-based metabolomics analysis

Unit variance (UV) scaling, mean centering (CTR) scaling, and Pareto (Par) scaling are three commonly used algorithms in the preprocessing of metabolomics data. Based on our NMR-based metabolomics studies, we found that the clustering identification performances of these three scaling methods were d...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Xia, Fang, Yiqun, Ma, Haifeng, Zhang, Naixia, Li, Ci
Formato: Online Artículo Texto
Lenguaje:English
Publicado: De Gruyter 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10044292/
https://www.ncbi.nlm.nih.gov/pubmed/36998512
http://dx.doi.org/10.1515/biol-2022-0556
Descripción
Sumario:Unit variance (UV) scaling, mean centering (CTR) scaling, and Pareto (Par) scaling are three commonly used algorithms in the preprocessing of metabolomics data. Based on our NMR-based metabolomics studies, we found that the clustering identification performances of these three scaling methods were dramatically different as tested by the spectra data of 48 young athletes’ urine samples, spleen tissue (from mice), serum (from mice), and cell (from Staphylococcus aureus) samples. Our data suggested that for the extraction of clustering information, UV scaling could serve as a robust approach for NMR metabolomics data for the identification of clustering analysis even with the existence of technical errors. However, for the purpose of discriminative metabolite identification, UV scaling, CTR scaling, and Par scaling could equally extract discriminative metabolites efficiently based on the coefficient values. Based on the data presented in this study, we propose an optimal working pipeline for the selection of scaling algorithms in NMR-based metabolomics analysis, which has the potential to serve as guidance for junior researchers working in the NMR-based metabolomics research field.