Cargando…

Maximizing biomarker discovery by minimizing gene signatures

BACKGROUND: The use of gene signatures can potentially be of considerable value in the field of clinical diagnosis. However, gene signatures defined with different methods can be quite various even when applied the same disease and the same endpoint. Previous studies have shown that the correct sele...

Descripción completa

Detalles Bibliográficos
Autores principales: Chang, Chang, Wang, Junwei, Zhao, Chen, Fostel, Jennifer, Tong, Weida, Bushel, Pierre R, Deng, Youping, Pusztai, Lajos, Symmans, W Fraser, Shi, Tieliu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287502/
https://www.ncbi.nlm.nih.gov/pubmed/22369133
http://dx.doi.org/10.1186/1471-2164-12-S5-S6
_version_ 1782224677834326016
author Chang, Chang
Wang, Junwei
Zhao, Chen
Fostel, Jennifer
Tong, Weida
Bushel, Pierre R
Deng, Youping
Pusztai, Lajos
Symmans, W Fraser
Shi, Tieliu
author_facet Chang, Chang
Wang, Junwei
Zhao, Chen
Fostel, Jennifer
Tong, Weida
Bushel, Pierre R
Deng, Youping
Pusztai, Lajos
Symmans, W Fraser
Shi, Tieliu
author_sort Chang, Chang
collection PubMed
description BACKGROUND: The use of gene signatures can potentially be of considerable value in the field of clinical diagnosis. However, gene signatures defined with different methods can be quite various even when applied the same disease and the same endpoint. Previous studies have shown that the correct selection of subsets of genes from microarray data is key for the accurate classification of disease phenotypes, and a number of methods have been proposed for the purpose. However, these methods refine the subsets by only considering each single feature, and they do not confirm the association between the genes identified in each gene signature and the phenotype of the disease. We proposed an innovative new method termed Minimize Feature's Size (MFS) based on multiple level similarity analyses and association between the genes and disease for breast cancer endpoints by comparing classifier models generated from the second phase of MicroArray Quality Control (MAQC-II), trying to develop effective meta-analysis strategies to transform the MAQC-II signatures into a robust and reliable set of biomarker for clinical applications. RESULTS: We analyzed the similarity of the multiple gene signatures in an endpoint and between the two endpoints of breast cancer at probe and gene levels, the results indicate that disease-related genes can be preferably selected as the components of gene signature, and that the gene signatures for the two endpoints could be interchangeable. The minimized signatures were built at probe level by using MFS for each endpoint. By applying the approach, we generated a much smaller set of gene signature with the similar predictive power compared with those gene signatures from MAQC-II. CONCLUSIONS: Our results indicate that gene signatures of both large and small sizes could perform equally well in clinical applications. Besides, consistency and biological significances can be detected among different gene signatures, reflecting the studying endpoints. New classifiers built with MFS exhibit improved performance with both internal and external validation, suggesting that MFS method generally reduces redundancies for features within gene signatures and improves the performance of the model. Consequently, our strategy will be beneficial for the microarray-based clinical applications.
format Online
Article
Text
id pubmed-3287502
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32875022012-03-01 Maximizing biomarker discovery by minimizing gene signatures Chang, Chang Wang, Junwei Zhao, Chen Fostel, Jennifer Tong, Weida Bushel, Pierre R Deng, Youping Pusztai, Lajos Symmans, W Fraser Shi, Tieliu BMC Genomics Research Article BACKGROUND: The use of gene signatures can potentially be of considerable value in the field of clinical diagnosis. However, gene signatures defined with different methods can be quite various even when applied the same disease and the same endpoint. Previous studies have shown that the correct selection of subsets of genes from microarray data is key for the accurate classification of disease phenotypes, and a number of methods have been proposed for the purpose. However, these methods refine the subsets by only considering each single feature, and they do not confirm the association between the genes identified in each gene signature and the phenotype of the disease. We proposed an innovative new method termed Minimize Feature's Size (MFS) based on multiple level similarity analyses and association between the genes and disease for breast cancer endpoints by comparing classifier models generated from the second phase of MicroArray Quality Control (MAQC-II), trying to develop effective meta-analysis strategies to transform the MAQC-II signatures into a robust and reliable set of biomarker for clinical applications. RESULTS: We analyzed the similarity of the multiple gene signatures in an endpoint and between the two endpoints of breast cancer at probe and gene levels, the results indicate that disease-related genes can be preferably selected as the components of gene signature, and that the gene signatures for the two endpoints could be interchangeable. The minimized signatures were built at probe level by using MFS for each endpoint. By applying the approach, we generated a much smaller set of gene signature with the similar predictive power compared with those gene signatures from MAQC-II. CONCLUSIONS: Our results indicate that gene signatures of both large and small sizes could perform equally well in clinical applications. Besides, consistency and biological significances can be detected among different gene signatures, reflecting the studying endpoints. New classifiers built with MFS exhibit improved performance with both internal and external validation, suggesting that MFS method generally reduces redundancies for features within gene signatures and improves the performance of the model. Consequently, our strategy will be beneficial for the microarray-based clinical applications. BioMed Central 2011-12-23 /pmc/articles/PMC3287502/ /pubmed/22369133 http://dx.doi.org/10.1186/1471-2164-12-S5-S6 Text en Copyright ©2011 Chang et al. licensee BioMed Central Ltd http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chang, Chang
Wang, Junwei
Zhao, Chen
Fostel, Jennifer
Tong, Weida
Bushel, Pierre R
Deng, Youping
Pusztai, Lajos
Symmans, W Fraser
Shi, Tieliu
Maximizing biomarker discovery by minimizing gene signatures
title Maximizing biomarker discovery by minimizing gene signatures
title_full Maximizing biomarker discovery by minimizing gene signatures
title_fullStr Maximizing biomarker discovery by minimizing gene signatures
title_full_unstemmed Maximizing biomarker discovery by minimizing gene signatures
title_short Maximizing biomarker discovery by minimizing gene signatures
title_sort maximizing biomarker discovery by minimizing gene signatures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287502/
https://www.ncbi.nlm.nih.gov/pubmed/22369133
http://dx.doi.org/10.1186/1471-2164-12-S5-S6
work_keys_str_mv AT changchang maximizingbiomarkerdiscoverybyminimizinggenesignatures
AT wangjunwei maximizingbiomarkerdiscoverybyminimizinggenesignatures
AT zhaochen maximizingbiomarkerdiscoverybyminimizinggenesignatures
AT fosteljennifer maximizingbiomarkerdiscoverybyminimizinggenesignatures
AT tongweida maximizingbiomarkerdiscoverybyminimizinggenesignatures
AT bushelpierrer maximizingbiomarkerdiscoverybyminimizinggenesignatures
AT dengyouping maximizingbiomarkerdiscoverybyminimizinggenesignatures
AT pusztailajos maximizingbiomarkerdiscoverybyminimizinggenesignatures
AT symmanswfraser maximizingbiomarkerdiscoverybyminimizinggenesignatures
AT shitieliu maximizingbiomarkerdiscoverybyminimizinggenesignatures