Cargando…
A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset
With innovations and advancements in analytical instruments and computer technology, omics studies based on statistical analysis, such as phytochemical omics, oilomics/lipidomics, proteomics, metabolomics, and glycomics, are increasingly popular in the areas of food chemistry and nutrition science....
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9698950/ https://www.ncbi.nlm.nih.gov/pubmed/36430999 http://dx.doi.org/10.3390/life12111864 |
_version_ | 1784838947870867456 |
---|---|
author | Zhao, Hefei Wang, Selina C. |
author_facet | Zhao, Hefei Wang, Selina C. |
author_sort | Zhao, Hefei |
collection | PubMed |
description | With innovations and advancements in analytical instruments and computer technology, omics studies based on statistical analysis, such as phytochemical omics, oilomics/lipidomics, proteomics, metabolomics, and glycomics, are increasingly popular in the areas of food chemistry and nutrition science. However, a remaining hurdle is the labor-intensive data process because learning coding skills and software operations are usually time-consuming for researchers without coding backgrounds. A MATLAB(®) coding basis and three-in-one integrated method, ‘Ana’, was created for data visualizations and statistical analysis in this work. The program loaded and analyzed an omics dataset from an Excel(®) file with 7 samples * 22 compounds as an example, and output six figures for three types of data visualization, including a 3D heatmap, heatmap hierarchical clustering analysis, and principal component analysis (PCA), in 18 s on a personal computer (PC) with a Windows 10 system and in 20 s on a Mac with a MacOS Monterey system. The code is rapid and efficient to print out high-quality figures up to 150 or 300 dpi. The output figures provide enough contrast to differentiate the omics dataset by both color code and bar size adjustments per their higher or lower values, allowing the figures to be qualified for publication and presentation purposes. It provides a rapid analysis method that would liberate researchers from labor-intensive and time-consuming manual or coding basis data analysis. A coding example with proper code annotations and completed user guidance is provided for undergraduate and postgraduate students to learn coding basis statistical data analysis and to help them utilize such techniques for their future research. |
format | Online Article Text |
id | pubmed-9698950 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96989502022-11-26 A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset Zhao, Hefei Wang, Selina C. Life (Basel) Technical Note With innovations and advancements in analytical instruments and computer technology, omics studies based on statistical analysis, such as phytochemical omics, oilomics/lipidomics, proteomics, metabolomics, and glycomics, are increasingly popular in the areas of food chemistry and nutrition science. However, a remaining hurdle is the labor-intensive data process because learning coding skills and software operations are usually time-consuming for researchers without coding backgrounds. A MATLAB(®) coding basis and three-in-one integrated method, ‘Ana’, was created for data visualizations and statistical analysis in this work. The program loaded and analyzed an omics dataset from an Excel(®) file with 7 samples * 22 compounds as an example, and output six figures for three types of data visualization, including a 3D heatmap, heatmap hierarchical clustering analysis, and principal component analysis (PCA), in 18 s on a personal computer (PC) with a Windows 10 system and in 20 s on a Mac with a MacOS Monterey system. The code is rapid and efficient to print out high-quality figures up to 150 or 300 dpi. The output figures provide enough contrast to differentiate the omics dataset by both color code and bar size adjustments per their higher or lower values, allowing the figures to be qualified for publication and presentation purposes. It provides a rapid analysis method that would liberate researchers from labor-intensive and time-consuming manual or coding basis data analysis. A coding example with proper code annotations and completed user guidance is provided for undergraduate and postgraduate students to learn coding basis statistical data analysis and to help them utilize such techniques for their future research. MDPI 2022-11-12 /pmc/articles/PMC9698950/ /pubmed/36430999 http://dx.doi.org/10.3390/life12111864 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Technical Note Zhao, Hefei Wang, Selina C. A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset |
title | A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset |
title_full | A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset |
title_fullStr | A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset |
title_full_unstemmed | A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset |
title_short | A Coding Basis and Three-in-One Integrated Data Visualization Method ‘Ana’ for the Rapid Analysis of Multidimensional Omics Dataset |
title_sort | coding basis and three-in-one integrated data visualization method ‘ana’ for the rapid analysis of multidimensional omics dataset |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9698950/ https://www.ncbi.nlm.nih.gov/pubmed/36430999 http://dx.doi.org/10.3390/life12111864 |
work_keys_str_mv | AT zhaohefei acodingbasisandthreeinoneintegrateddatavisualizationmethodanafortherapidanalysisofmultidimensionalomicsdataset AT wangselinac acodingbasisandthreeinoneintegrateddatavisualizationmethodanafortherapidanalysisofmultidimensionalomicsdataset AT zhaohefei codingbasisandthreeinoneintegrateddatavisualizationmethodanafortherapidanalysisofmultidimensionalomicsdataset AT wangselinac codingbasisandthreeinoneintegrateddatavisualizationmethodanafortherapidanalysisofmultidimensionalomicsdataset |