Cargando…

A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset

With innovations and advancements in analytical instruments and computer technology, omics studies based on statistical analysis, such as phytochemical omics, oilomics/lipidomics, proteomics, metabolomics, and glycomics, are increasingly popular in the areas of food chemistry and nutrition science....

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Hefei, Wang, Selina C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9698950/
https://www.ncbi.nlm.nih.gov/pubmed/36430999
http://dx.doi.org/10.3390/life12111864
_version_ 1784838947870867456
author Zhao, Hefei
Wang, Selina C.
author_facet Zhao, Hefei
Wang, Selina C.
author_sort Zhao, Hefei
collection PubMed
description With innovations and advancements in analytical instruments and computer technology, omics studies based on statistical analysis, such as phytochemical omics, oilomics/lipidomics, proteomics, metabolomics, and glycomics, are increasingly popular in the areas of food chemistry and nutrition science. However, a remaining hurdle is the labor-intensive data process because learning coding skills and software operations are usually time-consuming for researchers without coding backgrounds. A MATLAB(®) coding basis and three-in-one integrated method, ‘Ana’, was created for data visualizations and statistical analysis in this work. The program loaded and analyzed an omics dataset from an Excel(®) file with 7 samples * 22 compounds as an example, and output six figures for three types of data visualization, including a 3D heatmap, heatmap hierarchical clustering analysis, and principal component analysis (PCA), in 18 s on a personal computer (PC) with a Windows 10 system and in 20 s on a Mac with a MacOS Monterey system. The code is rapid and efficient to print out high-quality figures up to 150 or 300 dpi. The output figures provide enough contrast to differentiate the omics dataset by both color code and bar size adjustments per their higher or lower values, allowing the figures to be qualified for publication and presentation purposes. It provides a rapid analysis method that would liberate researchers from labor-intensive and time-consuming manual or coding basis data analysis. A coding example with proper code annotations and completed user guidance is provided for undergraduate and postgraduate students to learn coding basis statistical data analysis and to help them utilize such techniques for their future research.
format Online
Article
Text
id pubmed-9698950
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96989502022-11-26 A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset Zhao, Hefei Wang, Selina C. Life (Basel) Technical Note With innovations and advancements in analytical instruments and computer technology, omics studies based on statistical analysis, such as phytochemical omics, oilomics/lipidomics, proteomics, metabolomics, and glycomics, are increasingly popular in the areas of food chemistry and nutrition science. However, a remaining hurdle is the labor-intensive data process because learning coding skills and software operations are usually time-consuming for researchers without coding backgrounds. A MATLAB(®) coding basis and three-in-one integrated method, ‘Ana’, was created for data visualizations and statistical analysis in this work. The program loaded and analyzed an omics dataset from an Excel(®) file with 7 samples * 22 compounds as an example, and output six figures for three types of data visualization, including a 3D heatmap, heatmap hierarchical clustering analysis, and principal component analysis (PCA), in 18 s on a personal computer (PC) with a Windows 10 system and in 20 s on a Mac with a MacOS Monterey system. The code is rapid and efficient to print out high-quality figures up to 150 or 300 dpi. The output figures provide enough contrast to differentiate the omics dataset by both color code and bar size adjustments per their higher or lower values, allowing the figures to be qualified for publication and presentation purposes. It provides a rapid analysis method that would liberate researchers from labor-intensive and time-consuming manual or coding basis data analysis. A coding example with proper code annotations and completed user guidance is provided for undergraduate and postgraduate students to learn coding basis statistical data analysis and to help them utilize such techniques for their future research. MDPI 2022-11-12 /pmc/articles/PMC9698950/ /pubmed/36430999 http://dx.doi.org/10.3390/life12111864 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Technical Note
Zhao, Hefei
Wang, Selina C.
A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset
title A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset
title_full A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset
title_fullStr A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset
title_full_unstemmed A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset
title_short A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset
title_sort coding basis and three-in-one integrated data visualization method ‘ana’ for the rapid analysis of multidimensional omics dataset
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9698950/
https://www.ncbi.nlm.nih.gov/pubmed/36430999
http://dx.doi.org/10.3390/life12111864
work_keys_str_mv AT zhaohefei acodingbasisandthreeinoneintegrateddatavisualizationmethodanafortherapidanalysisofmultidimensionalomicsdataset
AT wangselinac acodingbasisandthreeinoneintegrateddatavisualizationmethodanafortherapidanalysisofmultidimensionalomicsdataset
AT zhaohefei codingbasisandthreeinoneintegrateddatavisualizationmethodanafortherapidanalysisofmultidimensionalomicsdataset
AT wangselinac codingbasisandthreeinoneintegrateddatavisualizationmethodanafortherapidanalysisofmultidimensionalomicsdataset