Cargando…
Directional Association Measurement in Contingency Tables: Genomic Case
Analysis of large data sets is currently a major challenge. Strong efforts are being undertaken to tackle this problem by developing new methods or modifying existing ones. The Z association method is a new method for describing directional association in contingency tables. It allows to arbitrarily...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Mary Ann Liebert, Inc., publishers
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6441286/ https://www.ncbi.nlm.nih.gov/pubmed/30562062 http://dx.doi.org/10.1089/cmb.2018.0202 |
_version_ | 1783407527352860672 |
---|---|
author | Piwowar, Monika KuŁaga, Tomasz |
author_facet | Piwowar, Monika KuŁaga, Tomasz |
author_sort | Piwowar, Monika |
collection | PubMed |
description | Analysis of large data sets is currently a major challenge. Strong efforts are being undertaken to tackle this problem by developing new methods or modifying existing ones. The Z association method is a new method for describing directional association in contingency tables. It allows to arbitrarily group categories for each of the two variables, for which the contingency table is analyzed. The Z coefficient was calculated on a sample data set with gene mutations in different cancer types. Results showed some association with both gene mutations and annotation groups. Detailed results obtained for particular cancer types versus particular genes and annotation groups were in line with well-known facts in cancer genomics. The “MEUSassociation” R library allows to analyze the directional association between two categorical variables, and the mutual relationship is summarized in a contingency table, by means of the Z association coefficient. The method implemented in the library allows to compute the standard Z coefficient and to apply it in a case, where all possible singular coefficients Z(A:B) are computed at the same time, giving information of association between particular rows and columns. Investigating the ranked list of the highest singular coefficients allows to reduce the complexity of a large-scale data set. Both the Z coefficient and its R implementation are important tools in categorical data analysis. |
format | Online Article Text |
id | pubmed-6441286 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Mary Ann Liebert, Inc., publishers |
record_format | MEDLINE/PubMed |
spelling | pubmed-64412862019-04-01 Directional Association Measurement in Contingency Tables: Genomic Case Piwowar, Monika KuŁaga, Tomasz J Comput Biol Research Articles Analysis of large data sets is currently a major challenge. Strong efforts are being undertaken to tackle this problem by developing new methods or modifying existing ones. The Z association method is a new method for describing directional association in contingency tables. It allows to arbitrarily group categories for each of the two variables, for which the contingency table is analyzed. The Z coefficient was calculated on a sample data set with gene mutations in different cancer types. Results showed some association with both gene mutations and annotation groups. Detailed results obtained for particular cancer types versus particular genes and annotation groups were in line with well-known facts in cancer genomics. The “MEUSassociation” R library allows to analyze the directional association between two categorical variables, and the mutual relationship is summarized in a contingency table, by means of the Z association coefficient. The method implemented in the library allows to compute the standard Z coefficient and to apply it in a case, where all possible singular coefficients Z(A:B) are computed at the same time, giving information of association between particular rows and columns. Investigating the ranked list of the highest singular coefficients allows to reduce the complexity of a large-scale data set. Both the Z coefficient and its R implementation are important tools in categorical data analysis. Mary Ann Liebert, Inc., publishers 2019-03-01 2019-03-06 /pmc/articles/PMC6441286/ /pubmed/30562062 http://dx.doi.org/10.1089/cmb.2018.0202 Text en © Monika Piwowar and Tomasz Kulaga, 2018. Published by Mary Ann Liebert, Inc. This Open Access article is distributed under the terms of the Creative Commons License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. |
spellingShingle | Research Articles Piwowar, Monika KuŁaga, Tomasz Directional Association Measurement in Contingency Tables: Genomic Case |
title | Directional Association Measurement in Contingency Tables: Genomic Case |
title_full | Directional Association Measurement in Contingency Tables: Genomic Case |
title_fullStr | Directional Association Measurement in Contingency Tables: Genomic Case |
title_full_unstemmed | Directional Association Measurement in Contingency Tables: Genomic Case |
title_short | Directional Association Measurement in Contingency Tables: Genomic Case |
title_sort | directional association measurement in contingency tables: genomic case |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6441286/ https://www.ncbi.nlm.nih.gov/pubmed/30562062 http://dx.doi.org/10.1089/cmb.2018.0202 |
work_keys_str_mv | AT piwowarmonika directionalassociationmeasurementincontingencytablesgenomiccase AT kułagatomasz directionalassociationmeasurementincontingencytablesgenomiccase |