Cargando…
Large-scale integration of cancer microarray data identifies a robust common cancer signature
BACKGROUND: There is a continuing need to develop molecular diagnostic tools which complement histopathologic examination to increase the accuracy of cancer diagnosis. DNA microarrays provide a means for measuring gene expression signatures which can then be used as components of genomic-based diagn...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1950528/ https://www.ncbi.nlm.nih.gov/pubmed/17663766 http://dx.doi.org/10.1186/1471-2105-8-275 |
_version_ | 1782134556173795328 |
---|---|
author | Xu, Lei Geman, Donald Winslow, Raimond L |
author_facet | Xu, Lei Geman, Donald Winslow, Raimond L |
author_sort | Xu, Lei |
collection | PubMed |
description | BACKGROUND: There is a continuing need to develop molecular diagnostic tools which complement histopathologic examination to increase the accuracy of cancer diagnosis. DNA microarrays provide a means for measuring gene expression signatures which can then be used as components of genomic-based diagnostic tests to determine the presence of cancer. RESULTS: In this study, we collect and integrate ~ 1500 microarray gene expression profiles from 26 published cancer data sets across 21 major human cancer types. We then apply a statistical method, referred to as the Top-Scoring Pair of Groups (TSPG) classifier, and a repeated random sampling strategy to the integrated training data sets and identify a common cancer signature consisting of 46 genes. These 46 genes are naturally divided into two distinct groups; those in one group are typically expressed less than those in the other group for cancer tissues. Given a new expression profile, the classifier discriminates cancer from normal tissues by ranking the expression values of the 46 genes in the cancer signature and comparing the average ranks of the two groups. This signature is then validated by applying this decision rule to independent test data. CONCLUSION: By combining the TSPG method and repeated random sampling, a robust common cancer signature has been identified from large-scale microarray data integration. Upon further validation, this signature may be useful as a robust and objective diagnostic test for cancer. |
format | Text |
id | pubmed-1950528 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-19505282007-08-22 Large-scale integration of cancer microarray data identifies a robust common cancer signature Xu, Lei Geman, Donald Winslow, Raimond L BMC Bioinformatics Research Article BACKGROUND: There is a continuing need to develop molecular diagnostic tools which complement histopathologic examination to increase the accuracy of cancer diagnosis. DNA microarrays provide a means for measuring gene expression signatures which can then be used as components of genomic-based diagnostic tests to determine the presence of cancer. RESULTS: In this study, we collect and integrate ~ 1500 microarray gene expression profiles from 26 published cancer data sets across 21 major human cancer types. We then apply a statistical method, referred to as the Top-Scoring Pair of Groups (TSPG) classifier, and a repeated random sampling strategy to the integrated training data sets and identify a common cancer signature consisting of 46 genes. These 46 genes are naturally divided into two distinct groups; those in one group are typically expressed less than those in the other group for cancer tissues. Given a new expression profile, the classifier discriminates cancer from normal tissues by ranking the expression values of the 46 genes in the cancer signature and comparing the average ranks of the two groups. This signature is then validated by applying this decision rule to independent test data. CONCLUSION: By combining the TSPG method and repeated random sampling, a robust common cancer signature has been identified from large-scale microarray data integration. Upon further validation, this signature may be useful as a robust and objective diagnostic test for cancer. BioMed Central 2007-07-30 /pmc/articles/PMC1950528/ /pubmed/17663766 http://dx.doi.org/10.1186/1471-2105-8-275 Text en Copyright © 2007 Xu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Xu, Lei Geman, Donald Winslow, Raimond L Large-scale integration of cancer microarray data identifies a robust common cancer signature |
title | Large-scale integration of cancer microarray data identifies a robust common cancer signature |
title_full | Large-scale integration of cancer microarray data identifies a robust common cancer signature |
title_fullStr | Large-scale integration of cancer microarray data identifies a robust common cancer signature |
title_full_unstemmed | Large-scale integration of cancer microarray data identifies a robust common cancer signature |
title_short | Large-scale integration of cancer microarray data identifies a robust common cancer signature |
title_sort | large-scale integration of cancer microarray data identifies a robust common cancer signature |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1950528/ https://www.ncbi.nlm.nih.gov/pubmed/17663766 http://dx.doi.org/10.1186/1471-2105-8-275 |
work_keys_str_mv | AT xulei largescaleintegrationofcancermicroarraydataidentifiesarobustcommoncancersignature AT gemandonald largescaleintegrationofcancermicroarraydataidentifiesarobustcommoncancersignature AT winslowraimondl largescaleintegrationofcancermicroarraydataidentifiesarobustcommoncancersignature |