Cargando…

Evaluation of fecal mRNA reproducibility via a marginal transformed mixture modeling approach

BACKGROUND: Developing and evaluating new technology that enables researchers to recover gene-expression levels of colonic cells from fecal samples could be key to a non-invasive screening tool for early detection of colon cancer. The current study, to the best of our knowledge, is the first to inve...

Descripción completa

Detalles Bibliográficos
Autores principales: George, Nysia I, Lupton, Joanne R, Turner, Nancy D, Chapkin, Robert S, Davidson, Laurie A, Wang, Naisyin
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2827371/
https://www.ncbi.nlm.nih.gov/pubmed/20055994
http://dx.doi.org/10.1186/1471-2105-11-13
_version_ 1782177928660910080
author George, Nysia I
Lupton, Joanne R
Turner, Nancy D
Chapkin, Robert S
Davidson, Laurie A
Wang, Naisyin
author_facet George, Nysia I
Lupton, Joanne R
Turner, Nancy D
Chapkin, Robert S
Davidson, Laurie A
Wang, Naisyin
author_sort George, Nysia I
collection PubMed
description BACKGROUND: Developing and evaluating new technology that enables researchers to recover gene-expression levels of colonic cells from fecal samples could be key to a non-invasive screening tool for early detection of colon cancer. The current study, to the best of our knowledge, is the first to investigate and report the reproducibility of fecal microarray data. Using the intraclass correlation coefficient (ICC) as a measure of reproducibility and the preliminary analysis of fecal and mucosal data, we assessed the reliability of mixture density estimation and the reproducibility of fecal microarray data. Using Monte Carlo-based methods, we explored whether ICC values should be modeled as a beta-mixture or transformed first and fitted with a normal-mixture. We used outcomes from bootstrapped goodness-of-fit tests to determine which approach is less sensitive toward potential violation of distributional assumptions. RESULTS: The graphical examination of both the distributions of ICC and probit-transformed ICC (PT-ICC) clearly shows that there are two components in the distributions. For ICC measurements, which are between 0 and 1, the practice in literature has been to assume that the data points are from a beta-mixture distribution. Nevertheless, in our study we show that the use of a normal-mixture modeling approach on PT-ICC could provide superior performance. CONCLUSIONS: When modeling ICC values of gene expression levels, using mixture of normals in the probit-transformed (PT) scale is less sensitive toward model mis-specification than using mixture of betas. We show that a biased conclusion could be made if we follow the traditional approach and model the two sets of ICC values using the mixture of betas directly. The problematic estimation arises from the sensitivity of beta-mixtures toward model mis-specification, particularly when there are observations in the neighborhood of the the boundary points, 0 or 1. Since beta-mixture modeling is commonly used in approximating the distribution of measurements between 0 and 1, our findings have important implications beyond the findings of the current study. By using the normal-mixture approach on PT-ICC, we observed the quality of reproducible genes in fecal array data to be comparable to those in mucosal arrays.
format Text
id pubmed-2827371
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28273712010-02-24 Evaluation of fecal mRNA reproducibility via a marginal transformed mixture modeling approach George, Nysia I Lupton, Joanne R Turner, Nancy D Chapkin, Robert S Davidson, Laurie A Wang, Naisyin BMC Bioinformatics Methodology article BACKGROUND: Developing and evaluating new technology that enables researchers to recover gene-expression levels of colonic cells from fecal samples could be key to a non-invasive screening tool for early detection of colon cancer. The current study, to the best of our knowledge, is the first to investigate and report the reproducibility of fecal microarray data. Using the intraclass correlation coefficient (ICC) as a measure of reproducibility and the preliminary analysis of fecal and mucosal data, we assessed the reliability of mixture density estimation and the reproducibility of fecal microarray data. Using Monte Carlo-based methods, we explored whether ICC values should be modeled as a beta-mixture or transformed first and fitted with a normal-mixture. We used outcomes from bootstrapped goodness-of-fit tests to determine which approach is less sensitive toward potential violation of distributional assumptions. RESULTS: The graphical examination of both the distributions of ICC and probit-transformed ICC (PT-ICC) clearly shows that there are two components in the distributions. For ICC measurements, which are between 0 and 1, the practice in literature has been to assume that the data points are from a beta-mixture distribution. Nevertheless, in our study we show that the use of a normal-mixture modeling approach on PT-ICC could provide superior performance. CONCLUSIONS: When modeling ICC values of gene expression levels, using mixture of normals in the probit-transformed (PT) scale is less sensitive toward model mis-specification than using mixture of betas. We show that a biased conclusion could be made if we follow the traditional approach and model the two sets of ICC values using the mixture of betas directly. The problematic estimation arises from the sensitivity of beta-mixtures toward model mis-specification, particularly when there are observations in the neighborhood of the the boundary points, 0 or 1. Since beta-mixture modeling is commonly used in approximating the distribution of measurements between 0 and 1, our findings have important implications beyond the findings of the current study. By using the normal-mixture approach on PT-ICC, we observed the quality of reproducible genes in fecal array data to be comparable to those in mucosal arrays. BioMed Central 2010-01-07 /pmc/articles/PMC2827371/ /pubmed/20055994 http://dx.doi.org/10.1186/1471-2105-11-13 Text en Copyright ©2010 George et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology article
George, Nysia I
Lupton, Joanne R
Turner, Nancy D
Chapkin, Robert S
Davidson, Laurie A
Wang, Naisyin
Evaluation of fecal mRNA reproducibility via a marginal transformed mixture modeling approach
title Evaluation of fecal mRNA reproducibility via a marginal transformed mixture modeling approach
title_full Evaluation of fecal mRNA reproducibility via a marginal transformed mixture modeling approach
title_fullStr Evaluation of fecal mRNA reproducibility via a marginal transformed mixture modeling approach
title_full_unstemmed Evaluation of fecal mRNA reproducibility via a marginal transformed mixture modeling approach
title_short Evaluation of fecal mRNA reproducibility via a marginal transformed mixture modeling approach
title_sort evaluation of fecal mrna reproducibility via a marginal transformed mixture modeling approach
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2827371/
https://www.ncbi.nlm.nih.gov/pubmed/20055994
http://dx.doi.org/10.1186/1471-2105-11-13
work_keys_str_mv AT georgenysiai evaluationoffecalmrnareproducibilityviaamarginaltransformedmixturemodelingapproach
AT luptonjoanner evaluationoffecalmrnareproducibilityviaamarginaltransformedmixturemodelingapproach
AT turnernancyd evaluationoffecalmrnareproducibilityviaamarginaltransformedmixturemodelingapproach
AT chapkinroberts evaluationoffecalmrnareproducibilityviaamarginaltransformedmixturemodelingapproach
AT davidsonlauriea evaluationoffecalmrnareproducibilityviaamarginaltransformedmixturemodelingapproach
AT wangnaisyin evaluationoffecalmrnareproducibilityviaamarginaltransformedmixturemodelingapproach