Cargando…

Density distribution of gene expression profiles and evaluation of using maximal information coefficient to identify differentially expressed genes

The hypothesis of data probability density distributions has many effects on the design of a new statistical method. Based on the analysis of a group of real gene expression profiles, this study reveal that the primary density distributions of the real profiles are normal/log-normal and t distributi...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Han-Ming, Yang, Dan, Liu, Zhao-Fa, Hu, Sheng-Zhou, Yan, Shen-Hai, He, Xian-Wen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6636747/
https://www.ncbi.nlm.nih.gov/pubmed/31314810
http://dx.doi.org/10.1371/journal.pone.0219551
_version_ 1783436118162669568
author Liu, Han-Ming
Yang, Dan
Liu, Zhao-Fa
Hu, Sheng-Zhou
Yan, Shen-Hai
He, Xian-Wen
author_facet Liu, Han-Ming
Yang, Dan
Liu, Zhao-Fa
Hu, Sheng-Zhou
Yan, Shen-Hai
He, Xian-Wen
author_sort Liu, Han-Ming
collection PubMed
description The hypothesis of data probability density distributions has many effects on the design of a new statistical method. Based on the analysis of a group of real gene expression profiles, this study reveal that the primary density distributions of the real profiles are normal/log-normal and t distributions, accounting for 80% and 19% respectively. According to these distributions, we generated a series of simulation data to make a more comprehensive assessment for a novel statistical method, maximal information coefficient (MIC). The results show that MIC is not only in the top tier in the overall performance of identifying differentially expressed genes, but also exhibits a better adaptability and an excellent noise immunity in comparison with the existing methods.
format Online
Article
Text
id pubmed-6636747
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-66367472019-07-25 Density distribution of gene expression profiles and evaluation of using maximal information coefficient to identify differentially expressed genes Liu, Han-Ming Yang, Dan Liu, Zhao-Fa Hu, Sheng-Zhou Yan, Shen-Hai He, Xian-Wen PLoS One Research Article The hypothesis of data probability density distributions has many effects on the design of a new statistical method. Based on the analysis of a group of real gene expression profiles, this study reveal that the primary density distributions of the real profiles are normal/log-normal and t distributions, accounting for 80% and 19% respectively. According to these distributions, we generated a series of simulation data to make a more comprehensive assessment for a novel statistical method, maximal information coefficient (MIC). The results show that MIC is not only in the top tier in the overall performance of identifying differentially expressed genes, but also exhibits a better adaptability and an excellent noise immunity in comparison with the existing methods. Public Library of Science 2019-07-17 /pmc/articles/PMC6636747/ /pubmed/31314810 http://dx.doi.org/10.1371/journal.pone.0219551 Text en © 2019 Liu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Liu, Han-Ming
Yang, Dan
Liu, Zhao-Fa
Hu, Sheng-Zhou
Yan, Shen-Hai
He, Xian-Wen
Density distribution of gene expression profiles and evaluation of using maximal information coefficient to identify differentially expressed genes
title Density distribution of gene expression profiles and evaluation of using maximal information coefficient to identify differentially expressed genes
title_full Density distribution of gene expression profiles and evaluation of using maximal information coefficient to identify differentially expressed genes
title_fullStr Density distribution of gene expression profiles and evaluation of using maximal information coefficient to identify differentially expressed genes
title_full_unstemmed Density distribution of gene expression profiles and evaluation of using maximal information coefficient to identify differentially expressed genes
title_short Density distribution of gene expression profiles and evaluation of using maximal information coefficient to identify differentially expressed genes
title_sort density distribution of gene expression profiles and evaluation of using maximal information coefficient to identify differentially expressed genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6636747/
https://www.ncbi.nlm.nih.gov/pubmed/31314810
http://dx.doi.org/10.1371/journal.pone.0219551
work_keys_str_mv AT liuhanming densitydistributionofgeneexpressionprofilesandevaluationofusingmaximalinformationcoefficienttoidentifydifferentiallyexpressedgenes
AT yangdan densitydistributionofgeneexpressionprofilesandevaluationofusingmaximalinformationcoefficienttoidentifydifferentiallyexpressedgenes
AT liuzhaofa densitydistributionofgeneexpressionprofilesandevaluationofusingmaximalinformationcoefficienttoidentifydifferentiallyexpressedgenes
AT hushengzhou densitydistributionofgeneexpressionprofilesandevaluationofusingmaximalinformationcoefficienttoidentifydifferentiallyexpressedgenes
AT yanshenhai densitydistributionofgeneexpressionprofilesandevaluationofusingmaximalinformationcoefficienttoidentifydifferentiallyexpressedgenes
AT hexianwen densitydistributionofgeneexpressionprofilesandevaluationofusingmaximalinformationcoefficienttoidentifydifferentiallyexpressedgenes