Cargando…

Finding consistent disease subnetworks across microarray datasets

BACKGROUND: While contemporary methods of microarray analysis are excellent tools for studying individual microarray datasets, they have a tendency to produce different results from different datasets of the same disease. We aim to solve this reproducibility problem by introducing a technique (SNet)...

Descripción completa

Detalles Bibliográficos
Autores principales:	Soh, Donny, Dong, Difeng, Guo, Yike, Wong, Limsoon
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Proceedings
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3278831/ https://www.ncbi.nlm.nih.gov/pubmed/22372958 http://dx.doi.org/10.1186/1471-2105-12-S13-S15

_version_	1782223611741863936
author	Soh, Donny Dong, Difeng Guo, Yike Wong, Limsoon
author_facet	Soh, Donny Dong, Difeng Guo, Yike Wong, Limsoon
author_sort	Soh, Donny
collection	PubMed
description	BACKGROUND: While contemporary methods of microarray analysis are excellent tools for studying individual microarray datasets, they have a tendency to produce different results from different datasets of the same disease. We aim to solve this reproducibility problem by introducing a technique (SNet). SNet provides both quantitative and descriptive analysis of microarray datasets by identifying specific connected portions of pathways that are significant. We term such portions within pathways as “subnetworks”. RESULTS: We tested SNet on independent datasets of several diseases, including childhood ALL, DMD and lung cancer. For each of these diseases, we obtained two independent microarray datasets produced by distinct labs on distinct platforms. In each case, our technique consistently produced almost the same list of significant nontrivial subnetworks from two independent sets of microarray data. The gene-level agreement of these significant subnetworks was between 51.18% to 93.01%. In contrast, when the same pairs of microarray datasets were analysed using GSEA, t-test and SAM, this percentage fell between 2.38% to 28.90% for GSEA, 49.60% tp 73.01% for t-test, and 49.96% to 81.25% for SAM. Furthermore, the genes selected using these existing methods did not form subnetworks of substantial size. Thus it is more probable that the subnetworks selected by our technique can provide the researcher with more descriptive information on the portions of the pathway actually affected by the disease. CONCLUSIONS: These results clearly demonstrate that our technique generates significant subnetworks and genes that are more consistent and reproducible across datasets compared to the other popular methods available (GSEA, t-test and SAM). The large size of subnetworks which we generate indicates that they are generally more biologically significant (less likely to be spurious). In addition, we have chosen two sample subnetworks and validated them with references from biological literature. This shows that our algorithm is capable of generating descriptive biologically conclusions.
format	Online Article Text
id	pubmed-3278831
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-32788312012-02-14 Finding consistent disease subnetworks across microarray datasets Soh, Donny Dong, Difeng Guo, Yike Wong, Limsoon BMC Bioinformatics Proceedings BACKGROUND: While contemporary methods of microarray analysis are excellent tools for studying individual microarray datasets, they have a tendency to produce different results from different datasets of the same disease. We aim to solve this reproducibility problem by introducing a technique (SNet). SNet provides both quantitative and descriptive analysis of microarray datasets by identifying specific connected portions of pathways that are significant. We term such portions within pathways as “subnetworks”. RESULTS: We tested SNet on independent datasets of several diseases, including childhood ALL, DMD and lung cancer. For each of these diseases, we obtained two independent microarray datasets produced by distinct labs on distinct platforms. In each case, our technique consistently produced almost the same list of significant nontrivial subnetworks from two independent sets of microarray data. The gene-level agreement of these significant subnetworks was between 51.18% to 93.01%. In contrast, when the same pairs of microarray datasets were analysed using GSEA, t-test and SAM, this percentage fell between 2.38% to 28.90% for GSEA, 49.60% tp 73.01% for t-test, and 49.96% to 81.25% for SAM. Furthermore, the genes selected using these existing methods did not form subnetworks of substantial size. Thus it is more probable that the subnetworks selected by our technique can provide the researcher with more descriptive information on the portions of the pathway actually affected by the disease. CONCLUSIONS: These results clearly demonstrate that our technique generates significant subnetworks and genes that are more consistent and reproducible across datasets compared to the other popular methods available (GSEA, t-test and SAM). The large size of subnetworks which we generate indicates that they are generally more biologically significant (less likely to be spurious). In addition, we have chosen two sample subnetworks and validated them with references from biological literature. This shows that our algorithm is capable of generating descriptive biologically conclusions. BioMed Central 2011-11-30 /pmc/articles/PMC3278831/ /pubmed/22372958 http://dx.doi.org/10.1186/1471-2105-12-S13-S15 Text en Copyright ©2011 Soh et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Proceedings Soh, Donny Dong, Difeng Guo, Yike Wong, Limsoon Finding consistent disease subnetworks across microarray datasets
title	Finding consistent disease subnetworks across microarray datasets
title_full	Finding consistent disease subnetworks across microarray datasets
title_fullStr	Finding consistent disease subnetworks across microarray datasets
title_full_unstemmed	Finding consistent disease subnetworks across microarray datasets
title_short	Finding consistent disease subnetworks across microarray datasets
title_sort	finding consistent disease subnetworks across microarray datasets
topic	Proceedings
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3278831/ https://www.ncbi.nlm.nih.gov/pubmed/22372958 http://dx.doi.org/10.1186/1471-2105-12-S13-S15
work_keys_str_mv	AT sohdonny findingconsistentdiseasesubnetworksacrossmicroarraydatasets AT dongdifeng findingconsistentdiseasesubnetworksacrossmicroarraydatasets AT guoyike findingconsistentdiseasesubnetworksacrossmicroarraydatasets AT wonglimsoon findingconsistentdiseasesubnetworksacrossmicroarraydatasets

Finding consistent disease subnetworks across microarray datasets

Ejemplares similares