Cargando…

Directional Association Measurement in Contingency Tables: Genomic Case

Analysis of large data sets is currently a major challenge. Strong efforts are being undertaken to tackle this problem by developing new methods or modifying existing ones. The Z association method is a new method for describing directional association in contingency tables. It allows to arbitrarily...

Descripción completa

Detalles Bibliográficos
Autores principales: Piwowar, Monika, KuŁaga, Tomasz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Mary Ann Liebert, Inc., publishers 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6441286/
https://www.ncbi.nlm.nih.gov/pubmed/30562062
http://dx.doi.org/10.1089/cmb.2018.0202
_version_ 1783407527352860672
author Piwowar, Monika
KuŁaga, Tomasz
author_facet Piwowar, Monika
KuŁaga, Tomasz
author_sort Piwowar, Monika
collection PubMed
description Analysis of large data sets is currently a major challenge. Strong efforts are being undertaken to tackle this problem by developing new methods or modifying existing ones. The Z association method is a new method for describing directional association in contingency tables. It allows to arbitrarily group categories for each of the two variables, for which the contingency table is analyzed. The Z coefficient was calculated on a sample data set with gene mutations in different cancer types. Results showed some association with both gene mutations and annotation groups. Detailed results obtained for particular cancer types versus particular genes and annotation groups were in line with well-known facts in cancer genomics. The “MEUSassociation” R library allows to analyze the directional association between two categorical variables, and the mutual relationship is summarized in a contingency table, by means of the Z association coefficient. The method implemented in the library allows to compute the standard Z coefficient and to apply it in a case, where all possible singular coefficients Z(A:B) are computed at the same time, giving information of association between particular rows and columns. Investigating the ranked list of the highest singular coefficients allows to reduce the complexity of a large-scale data set. Both the Z coefficient and its R implementation are important tools in categorical data analysis.
format Online
Article
Text
id pubmed-6441286
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Mary Ann Liebert, Inc., publishers
record_format MEDLINE/PubMed
spelling pubmed-64412862019-04-01 Directional Association Measurement in Contingency Tables: Genomic Case Piwowar, Monika KuŁaga, Tomasz J Comput Biol Research Articles Analysis of large data sets is currently a major challenge. Strong efforts are being undertaken to tackle this problem by developing new methods or modifying existing ones. The Z association method is a new method for describing directional association in contingency tables. It allows to arbitrarily group categories for each of the two variables, for which the contingency table is analyzed. The Z coefficient was calculated on a sample data set with gene mutations in different cancer types. Results showed some association with both gene mutations and annotation groups. Detailed results obtained for particular cancer types versus particular genes and annotation groups were in line with well-known facts in cancer genomics. The “MEUSassociation” R library allows to analyze the directional association between two categorical variables, and the mutual relationship is summarized in a contingency table, by means of the Z association coefficient. The method implemented in the library allows to compute the standard Z coefficient and to apply it in a case, where all possible singular coefficients Z(A:B) are computed at the same time, giving information of association between particular rows and columns. Investigating the ranked list of the highest singular coefficients allows to reduce the complexity of a large-scale data set. Both the Z coefficient and its R implementation are important tools in categorical data analysis. Mary Ann Liebert, Inc., publishers 2019-03-01 2019-03-06 /pmc/articles/PMC6441286/ /pubmed/30562062 http://dx.doi.org/10.1089/cmb.2018.0202 Text en © Monika Piwowar and Tomasz Kulaga, 2018. Published by Mary Ann Liebert, Inc. This Open Access article is distributed under the terms of the Creative Commons License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle Research Articles
Piwowar, Monika
KuŁaga, Tomasz
Directional Association Measurement in Contingency Tables: Genomic Case
title Directional Association Measurement in Contingency Tables: Genomic Case
title_full Directional Association Measurement in Contingency Tables: Genomic Case
title_fullStr Directional Association Measurement in Contingency Tables: Genomic Case
title_full_unstemmed Directional Association Measurement in Contingency Tables: Genomic Case
title_short Directional Association Measurement in Contingency Tables: Genomic Case
title_sort directional association measurement in contingency tables: genomic case
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6441286/
https://www.ncbi.nlm.nih.gov/pubmed/30562062
http://dx.doi.org/10.1089/cmb.2018.0202
work_keys_str_mv AT piwowarmonika directionalassociationmeasurementincontingencytablesgenomiccase
AT kułagatomasz directionalassociationmeasurementincontingencytablesgenomiccase