Cargando…

Approaches to reduce false positives and false negatives in the analysis of microarray data: applications in type 1 diabetes research

BACKGROUND: As studies of molecular biology system attempt to achieve a comprehensive understanding of a particular system, Type 1 errors may be a significant problem. However, few investigators are inclined to accept the increase in Type 2 errors (false positives) that may result when less stringen...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wu, Jian, Lenchik, Nataliya I, Gerling, Ivan C
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2008
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2559876/ https://www.ncbi.nlm.nih.gov/pubmed/18831777 http://dx.doi.org/10.1186/1471-2164-9-S2-S12

_version_	1782159684175659008
author	Wu, Jian Lenchik, Nataliya I Gerling, Ivan C
author_facet	Wu, Jian Lenchik, Nataliya I Gerling, Ivan C
author_sort	Wu, Jian
collection	PubMed
description	BACKGROUND: As studies of molecular biology system attempt to achieve a comprehensive understanding of a particular system, Type 1 errors may be a significant problem. However, few investigators are inclined to accept the increase in Type 2 errors (false positives) that may result when less stringent statistical cut-off values are used. To address this dilemma, we developed an analysis strategy that used a stringent statistical analysis to create a list of differentially expressed genes that served as "bait" to "fish out" other genes with similar patterns of expression. RESULTS: Comparing two strains of mice (NOD and C57Bl/6), we identified 93 genes with statistically significant differences in their patterns of expression. Hierarchical clustering identified an additional 39 genes with similar patterns of expression differences between the two strains. Pathway analysis was then employed: 1) identify the central genes and define biological processes that may be regulated by the genes identified, and 2) identify genes on the lists that could not be connected to each other in pathways (potential false positives). For networks created by both gene lists, the most connected (central) genes were interferon gamma (IFN-γ) and tumor necrosis factor alpha (TNF-α). These two cytokines are relevant to the biological differences between the two strains of mice. Furthermore, the network created by the list of 39 genes also suggested other biological differences between the strains. CONCLUSION: Taken together, these data demonstrate how stringent statistical analysis, combined with hierarchical clustering and pathway analysis may offer deeper insight into the biological processes reflected from a set of expression array data. This approach allows us to 'recapture" false negative genes that otherwise would have been missed by the statistical analysis.
format	Text
id	pubmed-2559876
institution	National Center for Biotechnology Information
language	English
publishDate	2008
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-25598762008-10-04 Approaches to reduce false positives and false negatives in the analysis of microarray data: applications in type 1 diabetes research Wu, Jian Lenchik, Nataliya I Gerling, Ivan C BMC Genomics Research BACKGROUND: As studies of molecular biology system attempt to achieve a comprehensive understanding of a particular system, Type 1 errors may be a significant problem. However, few investigators are inclined to accept the increase in Type 2 errors (false positives) that may result when less stringent statistical cut-off values are used. To address this dilemma, we developed an analysis strategy that used a stringent statistical analysis to create a list of differentially expressed genes that served as "bait" to "fish out" other genes with similar patterns of expression. RESULTS: Comparing two strains of mice (NOD and C57Bl/6), we identified 93 genes with statistically significant differences in their patterns of expression. Hierarchical clustering identified an additional 39 genes with similar patterns of expression differences between the two strains. Pathway analysis was then employed: 1) identify the central genes and define biological processes that may be regulated by the genes identified, and 2) identify genes on the lists that could not be connected to each other in pathways (potential false positives). For networks created by both gene lists, the most connected (central) genes were interferon gamma (IFN-γ) and tumor necrosis factor alpha (TNF-α). These two cytokines are relevant to the biological differences between the two strains of mice. Furthermore, the network created by the list of 39 genes also suggested other biological differences between the strains. CONCLUSION: Taken together, these data demonstrate how stringent statistical analysis, combined with hierarchical clustering and pathway analysis may offer deeper insight into the biological processes reflected from a set of expression array data. This approach allows us to 'recapture" false negative genes that otherwise would have been missed by the statistical analysis. BioMed Central 2008-09-16 /pmc/articles/PMC2559876/ /pubmed/18831777 http://dx.doi.org/10.1186/1471-2164-9-S2-S12 Text en Copyright © 2008 Wu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Wu, Jian Lenchik, Nataliya I Gerling, Ivan C Approaches to reduce false positives and false negatives in the analysis of microarray data: applications in type 1 diabetes research
title	Approaches to reduce false positives and false negatives in the analysis of microarray data: applications in type 1 diabetes research
title_full	Approaches to reduce false positives and false negatives in the analysis of microarray data: applications in type 1 diabetes research
title_fullStr	Approaches to reduce false positives and false negatives in the analysis of microarray data: applications in type 1 diabetes research
title_full_unstemmed	Approaches to reduce false positives and false negatives in the analysis of microarray data: applications in type 1 diabetes research
title_short	Approaches to reduce false positives and false negatives in the analysis of microarray data: applications in type 1 diabetes research
title_sort	approaches to reduce false positives and false negatives in the analysis of microarray data: applications in type 1 diabetes research
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2559876/ https://www.ncbi.nlm.nih.gov/pubmed/18831777 http://dx.doi.org/10.1186/1471-2164-9-S2-S12
work_keys_str_mv	AT wujian approachestoreducefalsepositivesandfalsenegativesintheanalysisofmicroarraydataapplicationsintype1diabetesresearch AT lenchiknataliyai approachestoreducefalsepositivesandfalsenegativesintheanalysisofmicroarraydataapplicationsintype1diabetesresearch AT gerlingivanc approachestoreducefalsepositivesandfalsenegativesintheanalysisofmicroarraydataapplicationsintype1diabetesresearch

Approaches to reduce false positives and false negatives in the analysis of microarray data: applications in type 1 diabetes research

Ejemplares similares