Cargando…

Interpretation of omics data analyses

Omics studies attempt to extract meaningful messages from large-scale and high-dimensional data sets by treating the data sets as a whole. The concept of treating data sets as a whole is important in every step of the data-handling procedures: the pre-processing step of data records, the step of sta...

Descripción completa

Detalles Bibliográficos
Autores principales: Yamada, Ryo, Okada, Daigo, Wang, Juan, Basak, Tapati, Koyama, Satoshi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Singapore 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728595/
https://www.ncbi.nlm.nih.gov/pubmed/32385339
http://dx.doi.org/10.1038/s10038-020-0763-5
_version_ 1783621307492990976
author Yamada, Ryo
Okada, Daigo
Wang, Juan
Basak, Tapati
Koyama, Satoshi
author_facet Yamada, Ryo
Okada, Daigo
Wang, Juan
Basak, Tapati
Koyama, Satoshi
author_sort Yamada, Ryo
collection PubMed
description Omics studies attempt to extract meaningful messages from large-scale and high-dimensional data sets by treating the data sets as a whole. The concept of treating data sets as a whole is important in every step of the data-handling procedures: the pre-processing step of data records, the step of statistical analyses and machine learning, translation of the outputs into human natural perceptions, and acceptance of the messages with uncertainty. In the pre-processing, the method by which to control the data quality and batch effects are discussed. For the main analyses, the approaches are divided into two types and their basic concepts are discussed. The first type is the evaluation of many items individually, followed by interpretation of individual items in the context of multiple testing and combination. The second type is the extraction of fewer important aspects from the whole data records. The outputs of the main analyses are translated into natural languages with techniques, such as annotation and ontology. The other technique for making the outputs perceptible is visualization. At the end of this review, one of the most important issues in the interpretation of omics data analyses is discussed. Omics studies have a large amount of information in their data sets, and every approach reveals only a very restricted aspect of the whole data sets. The understandable messages from these studies have unavoidable uncertainty.
format Online
Article
Text
id pubmed-7728595
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Springer Singapore
record_format MEDLINE/PubMed
spelling pubmed-77285952020-12-17 Interpretation of omics data analyses Yamada, Ryo Okada, Daigo Wang, Juan Basak, Tapati Koyama, Satoshi J Hum Genet Review Article Omics studies attempt to extract meaningful messages from large-scale and high-dimensional data sets by treating the data sets as a whole. The concept of treating data sets as a whole is important in every step of the data-handling procedures: the pre-processing step of data records, the step of statistical analyses and machine learning, translation of the outputs into human natural perceptions, and acceptance of the messages with uncertainty. In the pre-processing, the method by which to control the data quality and batch effects are discussed. For the main analyses, the approaches are divided into two types and their basic concepts are discussed. The first type is the evaluation of many items individually, followed by interpretation of individual items in the context of multiple testing and combination. The second type is the extraction of fewer important aspects from the whole data records. The outputs of the main analyses are translated into natural languages with techniques, such as annotation and ontology. The other technique for making the outputs perceptible is visualization. At the end of this review, one of the most important issues in the interpretation of omics data analyses is discussed. Omics studies have a large amount of information in their data sets, and every approach reveals only a very restricted aspect of the whole data sets. The understandable messages from these studies have unavoidable uncertainty. Springer Singapore 2020-05-08 2021 /pmc/articles/PMC7728595/ /pubmed/32385339 http://dx.doi.org/10.1038/s10038-020-0763-5 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Review Article
Yamada, Ryo
Okada, Daigo
Wang, Juan
Basak, Tapati
Koyama, Satoshi
Interpretation of omics data analyses
title Interpretation of omics data analyses
title_full Interpretation of omics data analyses
title_fullStr Interpretation of omics data analyses
title_full_unstemmed Interpretation of omics data analyses
title_short Interpretation of omics data analyses
title_sort interpretation of omics data analyses
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728595/
https://www.ncbi.nlm.nih.gov/pubmed/32385339
http://dx.doi.org/10.1038/s10038-020-0763-5
work_keys_str_mv AT yamadaryo interpretationofomicsdataanalyses
AT okadadaigo interpretationofomicsdataanalyses
AT wangjuan interpretationofomicsdataanalyses
AT basaktapati interpretationofomicsdataanalyses
AT koyamasatoshi interpretationofomicsdataanalyses