Cargando…

Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data

An important goal of DNA microarray research is to develop tools to diagnose cancer more accurately based on the genetic profile of a tumor. There are several existing techniques in the literature for performing this type of diagnosis. Unfortunately, most of these techniques assume that different su...

Descripción completa

Detalles Bibliográficos
Autores principales: Bair, Eric, Tibshirani, Robert
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC387275/
https://www.ncbi.nlm.nih.gov/pubmed/15094809
http://dx.doi.org/10.1371/journal.pbio.0020108
_version_ 1782121300941078528
author Bair, Eric
Tibshirani, Robert
author_facet Bair, Eric
Tibshirani, Robert
author_sort Bair, Eric
collection PubMed
description An important goal of DNA microarray research is to develop tools to diagnose cancer more accurately based on the genetic profile of a tumor. There are several existing techniques in the literature for performing this type of diagnosis. Unfortunately, most of these techniques assume that different subtypes of cancer are already known to exist. Their utility is limited when such subtypes have not been previously identified. Although methods for identifying such subtypes exist, these methods do not work well for all datasets. It would be desirable to develop a procedure to find such subtypes that is applicable in a wide variety of circumstances. Even if no information is known about possible subtypes of a certain form of cancer, clinical information about the patients, such as their survival time, is often available. In this study, we develop some procedures that utilize both the gene expression data and the clinical data to identify subtypes of cancer and use this knowledge to diagnose future patients. These procedures were successfully applied to several publicly available datasets. We present diagnostic procedures that accurately predict the survival of future patients based on the gene expression profile and survival times of previous patients. This has the potential to be a powerful tool for diagnosing and treating cancer.
format Text
id pubmed-387275
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-3872752004-04-15 Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data Bair, Eric Tibshirani, Robert PLoS Biol Research Article An important goal of DNA microarray research is to develop tools to diagnose cancer more accurately based on the genetic profile of a tumor. There are several existing techniques in the literature for performing this type of diagnosis. Unfortunately, most of these techniques assume that different subtypes of cancer are already known to exist. Their utility is limited when such subtypes have not been previously identified. Although methods for identifying such subtypes exist, these methods do not work well for all datasets. It would be desirable to develop a procedure to find such subtypes that is applicable in a wide variety of circumstances. Even if no information is known about possible subtypes of a certain form of cancer, clinical information about the patients, such as their survival time, is often available. In this study, we develop some procedures that utilize both the gene expression data and the clinical data to identify subtypes of cancer and use this knowledge to diagnose future patients. These procedures were successfully applied to several publicly available datasets. We present diagnostic procedures that accurately predict the survival of future patients based on the gene expression profile and survival times of previous patients. This has the potential to be a powerful tool for diagnosing and treating cancer. Public Library of Science 2004-04 2004-04-13 /pmc/articles/PMC387275/ /pubmed/15094809 http://dx.doi.org/10.1371/journal.pbio.0020108 Text en Copyright: © 2004 Bair and Tibshirani. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Bair, Eric
Tibshirani, Robert
Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data
title Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data
title_full Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data
title_fullStr Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data
title_full_unstemmed Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data
title_short Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data
title_sort semi-supervised methods to predict patient survival from gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC387275/
https://www.ncbi.nlm.nih.gov/pubmed/15094809
http://dx.doi.org/10.1371/journal.pbio.0020108
work_keys_str_mv AT baireric semisupervisedmethodstopredictpatientsurvivalfromgeneexpressiondata
AT tibshiranirobert semisupervisedmethodstopredictpatientsurvivalfromgeneexpressiondata