Cargando…

Regularization and grouping -omics data by GCA method: A transcriptomic case

The paper presents the application of Grade Correspondence Analysis (GCA) and Grade Correspondence Cluster Analysis (GCCA) for ordering and grouping -omics datasets, using transcriptomic data as an example. Based on gene expression data describing 256 patients with Multiple Myeloma it was shown that...

Descripción completa

Detalles Bibliográficos
Autores principales: Piwowar, Monika, Kocemba-Pilarczyk, Kinga A., Piwowar, Piotr
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6211732/
https://www.ncbi.nlm.nih.gov/pubmed/30383819
http://dx.doi.org/10.1371/journal.pone.0206608
_version_ 1783367397601705984
author Piwowar, Monika
Kocemba-Pilarczyk, Kinga A.
Piwowar, Piotr
author_facet Piwowar, Monika
Kocemba-Pilarczyk, Kinga A.
Piwowar, Piotr
author_sort Piwowar, Monika
collection PubMed
description The paper presents the application of Grade Correspondence Analysis (GCA) and Grade Correspondence Cluster Analysis (GCCA) for ordering and grouping -omics datasets, using transcriptomic data as an example. Based on gene expression data describing 256 patients with Multiple Myeloma it was shown that the GCA method could be used to find regularities in the analyzed collections and to create characteristic gene expression profiles for individual groups of patients. GCA iteratively permutes rows and columns to maximize the tau-Kendall or rho-Spearman coefficients, which makes it possible to arrange rows and columns in such a way that the most similar ones remain in each other’s neighbourhood. In this way, the GCA algorithm highlights regularities in the data matrix. The ranked data can then be grouped using the GCCA method, and after that aggregated in clusters, providing a representation that is easier to analyze–especially in the case of large sets of gene expression profiles. Regularization of transcriptomic data, which is presented in this manuscript, has enabled division of the data set into column clusters (representing genes) and row clusters (representing patients). Subsequently, rows were aggregated (based on medians) to visualise the gene expression profiles for patients with Multiple Myeloma in each collection. The presented analysis became the starting point for characterisation of differentiated genes and biochemical processes in which they are involved. GCA analysis may provide an alternative analytical method to support differentiation and analysis of gene expression profiles characterising individual groups of patients.
format Online
Article
Text
id pubmed-6211732
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-62117322018-11-19 Regularization and grouping -omics data by GCA method: A transcriptomic case Piwowar, Monika Kocemba-Pilarczyk, Kinga A. Piwowar, Piotr PLoS One Research Article The paper presents the application of Grade Correspondence Analysis (GCA) and Grade Correspondence Cluster Analysis (GCCA) for ordering and grouping -omics datasets, using transcriptomic data as an example. Based on gene expression data describing 256 patients with Multiple Myeloma it was shown that the GCA method could be used to find regularities in the analyzed collections and to create characteristic gene expression profiles for individual groups of patients. GCA iteratively permutes rows and columns to maximize the tau-Kendall or rho-Spearman coefficients, which makes it possible to arrange rows and columns in such a way that the most similar ones remain in each other’s neighbourhood. In this way, the GCA algorithm highlights regularities in the data matrix. The ranked data can then be grouped using the GCCA method, and after that aggregated in clusters, providing a representation that is easier to analyze–especially in the case of large sets of gene expression profiles. Regularization of transcriptomic data, which is presented in this manuscript, has enabled division of the data set into column clusters (representing genes) and row clusters (representing patients). Subsequently, rows were aggregated (based on medians) to visualise the gene expression profiles for patients with Multiple Myeloma in each collection. The presented analysis became the starting point for characterisation of differentiated genes and biochemical processes in which they are involved. GCA analysis may provide an alternative analytical method to support differentiation and analysis of gene expression profiles characterising individual groups of patients. Public Library of Science 2018-11-01 /pmc/articles/PMC6211732/ /pubmed/30383819 http://dx.doi.org/10.1371/journal.pone.0206608 Text en © 2018 Piwowar et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Piwowar, Monika
Kocemba-Pilarczyk, Kinga A.
Piwowar, Piotr
Regularization and grouping -omics data by GCA method: A transcriptomic case
title Regularization and grouping -omics data by GCA method: A transcriptomic case
title_full Regularization and grouping -omics data by GCA method: A transcriptomic case
title_fullStr Regularization and grouping -omics data by GCA method: A transcriptomic case
title_full_unstemmed Regularization and grouping -omics data by GCA method: A transcriptomic case
title_short Regularization and grouping -omics data by GCA method: A transcriptomic case
title_sort regularization and grouping -omics data by gca method: a transcriptomic case
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6211732/
https://www.ncbi.nlm.nih.gov/pubmed/30383819
http://dx.doi.org/10.1371/journal.pone.0206608
work_keys_str_mv AT piwowarmonika regularizationandgroupingomicsdatabygcamethodatranscriptomiccase
AT kocembapilarczykkingaa regularizationandgroupingomicsdatabygcamethodatranscriptomiccase
AT piwowarpiotr regularizationandgroupingomicsdatabygcamethodatranscriptomiccase