Cargando…
Interpretation of omics data analyses
Omics studies attempt to extract meaningful messages from large-scale and high-dimensional data sets by treating the data sets as a whole. The concept of treating data sets as a whole is important in every step of the data-handling procedures: the pre-processing step of data records, the step of sta...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Singapore
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728595/ https://www.ncbi.nlm.nih.gov/pubmed/32385339 http://dx.doi.org/10.1038/s10038-020-0763-5 |
_version_ | 1783621307492990976 |
---|---|
author | Yamada, Ryo Okada, Daigo Wang, Juan Basak, Tapati Koyama, Satoshi |
author_facet | Yamada, Ryo Okada, Daigo Wang, Juan Basak, Tapati Koyama, Satoshi |
author_sort | Yamada, Ryo |
collection | PubMed |
description | Omics studies attempt to extract meaningful messages from large-scale and high-dimensional data sets by treating the data sets as a whole. The concept of treating data sets as a whole is important in every step of the data-handling procedures: the pre-processing step of data records, the step of statistical analyses and machine learning, translation of the outputs into human natural perceptions, and acceptance of the messages with uncertainty. In the pre-processing, the method by which to control the data quality and batch effects are discussed. For the main analyses, the approaches are divided into two types and their basic concepts are discussed. The first type is the evaluation of many items individually, followed by interpretation of individual items in the context of multiple testing and combination. The second type is the extraction of fewer important aspects from the whole data records. The outputs of the main analyses are translated into natural languages with techniques, such as annotation and ontology. The other technique for making the outputs perceptible is visualization. At the end of this review, one of the most important issues in the interpretation of omics data analyses is discussed. Omics studies have a large amount of information in their data sets, and every approach reveals only a very restricted aspect of the whole data sets. The understandable messages from these studies have unavoidable uncertainty. |
format | Online Article Text |
id | pubmed-7728595 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Springer Singapore |
record_format | MEDLINE/PubMed |
spelling | pubmed-77285952020-12-17 Interpretation of omics data analyses Yamada, Ryo Okada, Daigo Wang, Juan Basak, Tapati Koyama, Satoshi J Hum Genet Review Article Omics studies attempt to extract meaningful messages from large-scale and high-dimensional data sets by treating the data sets as a whole. The concept of treating data sets as a whole is important in every step of the data-handling procedures: the pre-processing step of data records, the step of statistical analyses and machine learning, translation of the outputs into human natural perceptions, and acceptance of the messages with uncertainty. In the pre-processing, the method by which to control the data quality and batch effects are discussed. For the main analyses, the approaches are divided into two types and their basic concepts are discussed. The first type is the evaluation of many items individually, followed by interpretation of individual items in the context of multiple testing and combination. The second type is the extraction of fewer important aspects from the whole data records. The outputs of the main analyses are translated into natural languages with techniques, such as annotation and ontology. The other technique for making the outputs perceptible is visualization. At the end of this review, one of the most important issues in the interpretation of omics data analyses is discussed. Omics studies have a large amount of information in their data sets, and every approach reveals only a very restricted aspect of the whole data sets. The understandable messages from these studies have unavoidable uncertainty. Springer Singapore 2020-05-08 2021 /pmc/articles/PMC7728595/ /pubmed/32385339 http://dx.doi.org/10.1038/s10038-020-0763-5 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Review Article Yamada, Ryo Okada, Daigo Wang, Juan Basak, Tapati Koyama, Satoshi Interpretation of omics data analyses |
title | Interpretation of omics data analyses |
title_full | Interpretation of omics data analyses |
title_fullStr | Interpretation of omics data analyses |
title_full_unstemmed | Interpretation of omics data analyses |
title_short | Interpretation of omics data analyses |
title_sort | interpretation of omics data analyses |
topic | Review Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728595/ https://www.ncbi.nlm.nih.gov/pubmed/32385339 http://dx.doi.org/10.1038/s10038-020-0763-5 |
work_keys_str_mv | AT yamadaryo interpretationofomicsdataanalyses AT okadadaigo interpretationofomicsdataanalyses AT wangjuan interpretationofomicsdataanalyses AT basaktapati interpretationofomicsdataanalyses AT koyamasatoshi interpretationofomicsdataanalyses |