Cargando…

A New Distribution Family for Microarray Data †

The traditional approach with microarray data has been to apply transformations that approximately normalize them, with the drawback of losing the original scale. The alternative standpoint taken here is to search for models that fit the data, characterized by the presence of negative values, preser...

Descripción completa

Detalles Bibliográficos
Autores principales: Kelmansky, Diana Mabel, Ricci, Lila
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374365/
https://www.ncbi.nlm.nih.gov/pubmed/28208652
http://dx.doi.org/10.3390/microarrays6010005
_version_ 1782518874212663296
author Kelmansky, Diana Mabel
Ricci, Lila
author_facet Kelmansky, Diana Mabel
Ricci, Lila
author_sort Kelmansky, Diana Mabel
collection PubMed
description The traditional approach with microarray data has been to apply transformations that approximately normalize them, with the drawback of losing the original scale. The alternative standpoint taken here is to search for models that fit the data, characterized by the presence of negative values, preserving their scale; one advantage of this strategy is that it facilitates a direct interpretation of the results. A new family of distributions named gpower-normal indexed by [Formula: see text] is introduced and it is proven that these variables become normal or truncated normal when a suitable gpower transformation is applied. Expressions are given for moments and quantiles, in terms of the truncated normal density. This new family can be used to model asymmetric data that include non-positive values, as required for microarray analysis. Moreover, it has been proven that the gpower-normal family is a special case of pseudo-dispersion models, inheriting all the good properties of these models, such as asymptotic normality for small variances. A combined maximum likelihood method is proposed to estimate the model parameters, and it is applied to microarray and contamination data. R codes are available from the authors upon request.
format Online
Article
Text
id pubmed-5374365
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-53743652017-04-10 A New Distribution Family for Microarray Data † Kelmansky, Diana Mabel Ricci, Lila Microarrays (Basel) Article The traditional approach with microarray data has been to apply transformations that approximately normalize them, with the drawback of losing the original scale. The alternative standpoint taken here is to search for models that fit the data, characterized by the presence of negative values, preserving their scale; one advantage of this strategy is that it facilitates a direct interpretation of the results. A new family of distributions named gpower-normal indexed by [Formula: see text] is introduced and it is proven that these variables become normal or truncated normal when a suitable gpower transformation is applied. Expressions are given for moments and quantiles, in terms of the truncated normal density. This new family can be used to model asymmetric data that include non-positive values, as required for microarray analysis. Moreover, it has been proven that the gpower-normal family is a special case of pseudo-dispersion models, inheriting all the good properties of these models, such as asymptotic normality for small variances. A combined maximum likelihood method is proposed to estimate the model parameters, and it is applied to microarray and contamination data. R codes are available from the authors upon request. MDPI 2017-02-10 /pmc/articles/PMC5374365/ /pubmed/28208652 http://dx.doi.org/10.3390/microarrays6010005 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kelmansky, Diana Mabel
Ricci, Lila
A New Distribution Family for Microarray Data †
title A New Distribution Family for Microarray Data †
title_full A New Distribution Family for Microarray Data †
title_fullStr A New Distribution Family for Microarray Data †
title_full_unstemmed A New Distribution Family for Microarray Data †
title_short A New Distribution Family for Microarray Data †
title_sort new distribution family for microarray data †
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374365/
https://www.ncbi.nlm.nih.gov/pubmed/28208652
http://dx.doi.org/10.3390/microarrays6010005
work_keys_str_mv AT kelmanskydianamabel anewdistributionfamilyformicroarraydata
AT riccilila anewdistributionfamilyformicroarraydata
AT kelmanskydianamabel newdistributionfamilyformicroarraydata
AT riccilila newdistributionfamilyformicroarraydata