Cargando…

Large-scale integration of cancer microarray data identifies a robust common cancer signature

BACKGROUND: There is a continuing need to develop molecular diagnostic tools which complement histopathologic examination to increase the accuracy of cancer diagnosis. DNA microarrays provide a means for measuring gene expression signatures which can then be used as components of genomic-based diagn...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Lei, Geman, Donald, Winslow, Raimond L
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1950528/
https://www.ncbi.nlm.nih.gov/pubmed/17663766
http://dx.doi.org/10.1186/1471-2105-8-275
_version_ 1782134556173795328
author Xu, Lei
Geman, Donald
Winslow, Raimond L
author_facet Xu, Lei
Geman, Donald
Winslow, Raimond L
author_sort Xu, Lei
collection PubMed
description BACKGROUND: There is a continuing need to develop molecular diagnostic tools which complement histopathologic examination to increase the accuracy of cancer diagnosis. DNA microarrays provide a means for measuring gene expression signatures which can then be used as components of genomic-based diagnostic tests to determine the presence of cancer. RESULTS: In this study, we collect and integrate ~ 1500 microarray gene expression profiles from 26 published cancer data sets across 21 major human cancer types. We then apply a statistical method, referred to as the Top-Scoring Pair of Groups (TSPG) classifier, and a repeated random sampling strategy to the integrated training data sets and identify a common cancer signature consisting of 46 genes. These 46 genes are naturally divided into two distinct groups; those in one group are typically expressed less than those in the other group for cancer tissues. Given a new expression profile, the classifier discriminates cancer from normal tissues by ranking the expression values of the 46 genes in the cancer signature and comparing the average ranks of the two groups. This signature is then validated by applying this decision rule to independent test data. CONCLUSION: By combining the TSPG method and repeated random sampling, a robust common cancer signature has been identified from large-scale microarray data integration. Upon further validation, this signature may be useful as a robust and objective diagnostic test for cancer.
format Text
id pubmed-1950528
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19505282007-08-22 Large-scale integration of cancer microarray data identifies a robust common cancer signature Xu, Lei Geman, Donald Winslow, Raimond L BMC Bioinformatics Research Article BACKGROUND: There is a continuing need to develop molecular diagnostic tools which complement histopathologic examination to increase the accuracy of cancer diagnosis. DNA microarrays provide a means for measuring gene expression signatures which can then be used as components of genomic-based diagnostic tests to determine the presence of cancer. RESULTS: In this study, we collect and integrate ~ 1500 microarray gene expression profiles from 26 published cancer data sets across 21 major human cancer types. We then apply a statistical method, referred to as the Top-Scoring Pair of Groups (TSPG) classifier, and a repeated random sampling strategy to the integrated training data sets and identify a common cancer signature consisting of 46 genes. These 46 genes are naturally divided into two distinct groups; those in one group are typically expressed less than those in the other group for cancer tissues. Given a new expression profile, the classifier discriminates cancer from normal tissues by ranking the expression values of the 46 genes in the cancer signature and comparing the average ranks of the two groups. This signature is then validated by applying this decision rule to independent test data. CONCLUSION: By combining the TSPG method and repeated random sampling, a robust common cancer signature has been identified from large-scale microarray data integration. Upon further validation, this signature may be useful as a robust and objective diagnostic test for cancer. BioMed Central 2007-07-30 /pmc/articles/PMC1950528/ /pubmed/17663766 http://dx.doi.org/10.1186/1471-2105-8-275 Text en Copyright © 2007 Xu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Xu, Lei
Geman, Donald
Winslow, Raimond L
Large-scale integration of cancer microarray data identifies a robust common cancer signature
title Large-scale integration of cancer microarray data identifies a robust common cancer signature
title_full Large-scale integration of cancer microarray data identifies a robust common cancer signature
title_fullStr Large-scale integration of cancer microarray data identifies a robust common cancer signature
title_full_unstemmed Large-scale integration of cancer microarray data identifies a robust common cancer signature
title_short Large-scale integration of cancer microarray data identifies a robust common cancer signature
title_sort large-scale integration of cancer microarray data identifies a robust common cancer signature
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1950528/
https://www.ncbi.nlm.nih.gov/pubmed/17663766
http://dx.doi.org/10.1186/1471-2105-8-275
work_keys_str_mv AT xulei largescaleintegrationofcancermicroarraydataidentifiesarobustcommoncancersignature
AT gemandonald largescaleintegrationofcancermicroarraydataidentifiesarobustcommoncancersignature
AT winslowraimondl largescaleintegrationofcancermicroarraydataidentifiesarobustcommoncancersignature