Cargando…

Density distribution of gene expression profiles and evaluation of using maximal information coefficient to identify differentially expressed genes

The hypothesis of data probability density distributions has many effects on the design of a new statistical method. Based on the analysis of a group of real gene expression profiles, this study reveal that the primary density distributions of the real profiles are normal/log-normal and t distributi...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Han-Ming, Yang, Dan, Liu, Zhao-Fa, Hu, Sheng-Zhou, Yan, Shen-Hai, He, Xian-Wen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6636747/
https://www.ncbi.nlm.nih.gov/pubmed/31314810
http://dx.doi.org/10.1371/journal.pone.0219551
Descripción
Sumario:The hypothesis of data probability density distributions has many effects on the design of a new statistical method. Based on the analysis of a group of real gene expression profiles, this study reveal that the primary density distributions of the real profiles are normal/log-normal and t distributions, accounting for 80% and 19% respectively. According to these distributions, we generated a series of simulation data to make a more comprehensive assessment for a novel statistical method, maximal information coefficient (MIC). The results show that MIC is not only in the top tier in the overall performance of identifying differentially expressed genes, but also exhibits a better adaptability and an excellent noise immunity in comparison with the existing methods.