Cargando…

Using Pre-existing Microarray Datasets to Increase Experimental Power: Application to Insulin Resistance

Although they have become a widely used experimental technique for identifying differentially expressed (DE) genes, DNA microarrays are notorious for generating noisy data. A common strategy for mitigating the effects of noise is to perform many experimental replicates. This approach is often costly...

Descripción completa

Detalles Bibliográficos
Autores principales: Daigle, Bernie J., Deng, Alicia, McLaughlin, Tracey, Cushman, Samuel W., Cam, Margaret C., Reaven, Gerald, Tsao, Philip S., Altman, Russ B.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2845644/
https://www.ncbi.nlm.nih.gov/pubmed/20361040
http://dx.doi.org/10.1371/journal.pcbi.1000718
_version_ 1782179424717766656
author Daigle, Bernie J.
Deng, Alicia
McLaughlin, Tracey
Cushman, Samuel W.
Cam, Margaret C.
Reaven, Gerald
Tsao, Philip S.
Altman, Russ B.
author_facet Daigle, Bernie J.
Deng, Alicia
McLaughlin, Tracey
Cushman, Samuel W.
Cam, Margaret C.
Reaven, Gerald
Tsao, Philip S.
Altman, Russ B.
author_sort Daigle, Bernie J.
collection PubMed
description Although they have become a widely used experimental technique for identifying differentially expressed (DE) genes, DNA microarrays are notorious for generating noisy data. A common strategy for mitigating the effects of noise is to perform many experimental replicates. This approach is often costly and sometimes impossible given limited resources; thus, analytical methods are needed which increase accuracy at no additional cost. One inexpensive source of microarray replicates comes from prior work: to date, data from hundreds of thousands of microarray experiments are in the public domain. Although these data assay a wide range of conditions, they cannot be used directly to inform any particular experiment and are thus ignored by most DE gene methods. We present the SVD Augmented Gene expression Analysis Tool (SAGAT), a mathematically principled, data-driven approach for identifying DE genes. SAGAT increases the power of a microarray experiment by using observed coexpression relationships from publicly available microarray datasets to reduce uncertainty in individual genes' expression measurements. We tested the method on three well-replicated human microarray datasets and demonstrate that use of SAGAT increased effective sample sizes by as many as 2.72 arrays. We applied SAGAT to unpublished data from a microarray study investigating transcriptional responses to insulin resistance, resulting in a 50% increase in the number of significant genes detected. We evaluated 11 (58%) of these genes experimentally using qPCR, confirming the directions of expression change for all 11 and statistical significance for three. Use of SAGAT revealed coherent biological changes in three pathways: inflammation, differentiation, and fatty acid synthesis, furthering our molecular understanding of a type 2 diabetes risk factor. We envision SAGAT as a means to maximize the potential for biological discovery from subtle transcriptional responses, and we provide it as a freely available software package that is immediately applicable to any human microarray study.
format Text
id pubmed-2845644
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28456442010-04-02 Using Pre-existing Microarray Datasets to Increase Experimental Power: Application to Insulin Resistance Daigle, Bernie J. Deng, Alicia McLaughlin, Tracey Cushman, Samuel W. Cam, Margaret C. Reaven, Gerald Tsao, Philip S. Altman, Russ B. PLoS Comput Biol Research Article Although they have become a widely used experimental technique for identifying differentially expressed (DE) genes, DNA microarrays are notorious for generating noisy data. A common strategy for mitigating the effects of noise is to perform many experimental replicates. This approach is often costly and sometimes impossible given limited resources; thus, analytical methods are needed which increase accuracy at no additional cost. One inexpensive source of microarray replicates comes from prior work: to date, data from hundreds of thousands of microarray experiments are in the public domain. Although these data assay a wide range of conditions, they cannot be used directly to inform any particular experiment and are thus ignored by most DE gene methods. We present the SVD Augmented Gene expression Analysis Tool (SAGAT), a mathematically principled, data-driven approach for identifying DE genes. SAGAT increases the power of a microarray experiment by using observed coexpression relationships from publicly available microarray datasets to reduce uncertainty in individual genes' expression measurements. We tested the method on three well-replicated human microarray datasets and demonstrate that use of SAGAT increased effective sample sizes by as many as 2.72 arrays. We applied SAGAT to unpublished data from a microarray study investigating transcriptional responses to insulin resistance, resulting in a 50% increase in the number of significant genes detected. We evaluated 11 (58%) of these genes experimentally using qPCR, confirming the directions of expression change for all 11 and statistical significance for three. Use of SAGAT revealed coherent biological changes in three pathways: inflammation, differentiation, and fatty acid synthesis, furthering our molecular understanding of a type 2 diabetes risk factor. We envision SAGAT as a means to maximize the potential for biological discovery from subtle transcriptional responses, and we provide it as a freely available software package that is immediately applicable to any human microarray study. Public Library of Science 2010-03-26 /pmc/articles/PMC2845644/ /pubmed/20361040 http://dx.doi.org/10.1371/journal.pcbi.1000718 Text en Daigle, Jr. et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Daigle, Bernie J.
Deng, Alicia
McLaughlin, Tracey
Cushman, Samuel W.
Cam, Margaret C.
Reaven, Gerald
Tsao, Philip S.
Altman, Russ B.
Using Pre-existing Microarray Datasets to Increase Experimental Power: Application to Insulin Resistance
title Using Pre-existing Microarray Datasets to Increase Experimental Power: Application to Insulin Resistance
title_full Using Pre-existing Microarray Datasets to Increase Experimental Power: Application to Insulin Resistance
title_fullStr Using Pre-existing Microarray Datasets to Increase Experimental Power: Application to Insulin Resistance
title_full_unstemmed Using Pre-existing Microarray Datasets to Increase Experimental Power: Application to Insulin Resistance
title_short Using Pre-existing Microarray Datasets to Increase Experimental Power: Application to Insulin Resistance
title_sort using pre-existing microarray datasets to increase experimental power: application to insulin resistance
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2845644/
https://www.ncbi.nlm.nih.gov/pubmed/20361040
http://dx.doi.org/10.1371/journal.pcbi.1000718
work_keys_str_mv AT daigleberniej usingpreexistingmicroarraydatasetstoincreaseexperimentalpowerapplicationtoinsulinresistance
AT dengalicia usingpreexistingmicroarraydatasetstoincreaseexperimentalpowerapplicationtoinsulinresistance
AT mclaughlintracey usingpreexistingmicroarraydatasetstoincreaseexperimentalpowerapplicationtoinsulinresistance
AT cushmansamuelw usingpreexistingmicroarraydatasetstoincreaseexperimentalpowerapplicationtoinsulinresistance
AT cammargaretc usingpreexistingmicroarraydatasetstoincreaseexperimentalpowerapplicationtoinsulinresistance
AT reavengerald usingpreexistingmicroarraydatasetstoincreaseexperimentalpowerapplicationtoinsulinresistance
AT tsaophilips usingpreexistingmicroarraydatasetstoincreaseexperimentalpowerapplicationtoinsulinresistance
AT altmanrussb usingpreexistingmicroarraydatasetstoincreaseexperimentalpowerapplicationtoinsulinresistance