Cargando…

Asymmetric microarray data produces gene lists highly predictive of research literature on multiple cancer types

BACKGROUND: Much of the public access cancer microarray data is asymmetric, belonging to datasets containing no samples from normal tissue. Asymmetric data cannot be used in standard meta-analysis approaches (such as the inverse variance method) to obtain large sample sizes for statistical power enr...

Descripción completa

Detalles Bibliográficos
Autores principales: Dawany, Noor B, Tozeren, Aydin
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2949900/
https://www.ncbi.nlm.nih.gov/pubmed/20875095
http://dx.doi.org/10.1186/1471-2105-11-483
_version_ 1782187607339302912
author Dawany, Noor B
Tozeren, Aydin
author_facet Dawany, Noor B
Tozeren, Aydin
author_sort Dawany, Noor B
collection PubMed
description BACKGROUND: Much of the public access cancer microarray data is asymmetric, belonging to datasets containing no samples from normal tissue. Asymmetric data cannot be used in standard meta-analysis approaches (such as the inverse variance method) to obtain large sample sizes for statistical power enrichment. Noting that plenty of normal tissue microarray samples exist in studies not involving cancer, we investigated the viability and accuracy of an integrated microarray analysis approach based on significance analysis of microarrays (merged SAM) using a collection of data from separate diseased and normal samples. RESULTS: We focused on five solid cancer types (colon, kidney, liver, lung, and pancreas), where available microarray data allowed us to compare meta-analysis and integrated approaches. Our results from the merged SAM significantly overlapped gene lists from the validated inverse-variance method. Both meta-analysis and merged SAM approaches successfully captured the aberrances in the cell cycle that commonly occur in the different cancer types. However, the integrated SAM analysis replicated the known cancer literature (excluding microarray studies) with much more accuracy than the meta-analysis. CONCLUSION: The merged SAM test is a powerful, robust approach for combining data from similar platforms and for analyzing asymmetric datasets, including those with only normal or only cancer samples that cannot be utilized by meta-analysis methods. The integrated SAM approach can also be used in comparing global gene expression between various subtypes of cancer arising from the same tissue.
format Text
id pubmed-2949900
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29499002010-11-03 Asymmetric microarray data produces gene lists highly predictive of research literature on multiple cancer types Dawany, Noor B Tozeren, Aydin BMC Bioinformatics Methodology Article BACKGROUND: Much of the public access cancer microarray data is asymmetric, belonging to datasets containing no samples from normal tissue. Asymmetric data cannot be used in standard meta-analysis approaches (such as the inverse variance method) to obtain large sample sizes for statistical power enrichment. Noting that plenty of normal tissue microarray samples exist in studies not involving cancer, we investigated the viability and accuracy of an integrated microarray analysis approach based on significance analysis of microarrays (merged SAM) using a collection of data from separate diseased and normal samples. RESULTS: We focused on five solid cancer types (colon, kidney, liver, lung, and pancreas), where available microarray data allowed us to compare meta-analysis and integrated approaches. Our results from the merged SAM significantly overlapped gene lists from the validated inverse-variance method. Both meta-analysis and merged SAM approaches successfully captured the aberrances in the cell cycle that commonly occur in the different cancer types. However, the integrated SAM analysis replicated the known cancer literature (excluding microarray studies) with much more accuracy than the meta-analysis. CONCLUSION: The merged SAM test is a powerful, robust approach for combining data from similar platforms and for analyzing asymmetric datasets, including those with only normal or only cancer samples that cannot be utilized by meta-analysis methods. The integrated SAM approach can also be used in comparing global gene expression between various subtypes of cancer arising from the same tissue. BioMed Central 2010-09-27 /pmc/articles/PMC2949900/ /pubmed/20875095 http://dx.doi.org/10.1186/1471-2105-11-483 Text en Copyright ©2010 Dawany and Tozeren; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Dawany, Noor B
Tozeren, Aydin
Asymmetric microarray data produces gene lists highly predictive of research literature on multiple cancer types
title Asymmetric microarray data produces gene lists highly predictive of research literature on multiple cancer types
title_full Asymmetric microarray data produces gene lists highly predictive of research literature on multiple cancer types
title_fullStr Asymmetric microarray data produces gene lists highly predictive of research literature on multiple cancer types
title_full_unstemmed Asymmetric microarray data produces gene lists highly predictive of research literature on multiple cancer types
title_short Asymmetric microarray data produces gene lists highly predictive of research literature on multiple cancer types
title_sort asymmetric microarray data produces gene lists highly predictive of research literature on multiple cancer types
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2949900/
https://www.ncbi.nlm.nih.gov/pubmed/20875095
http://dx.doi.org/10.1186/1471-2105-11-483
work_keys_str_mv AT dawanynoorb asymmetricmicroarraydataproducesgenelistshighlypredictiveofresearchliteratureonmultiplecancertypes
AT tozerenaydin asymmetricmicroarraydataproducesgenelistshighlypredictiveofresearchliteratureonmultiplecancertypes