Cargando…

Data Mining for Gene Networks Relevant to Poor Prognosis in Lung Cancer Via Backward-Chaining Rule Induction

We use Backward Chaining Rule Induction (BCRI), a novel data mining method for hypothesizing causative mechanisms, to mine lung cancer gene expression array data for mechanisms that could impact survival. Initially, a supervised learning system is used to generate a prediction model in the form of “...

Descripción completa

Detalles Bibliográficos
Autores principales: Edgerton, Mary E., Fisher, Douglas H., Tang, Lianhong, Frey, Lewis J., Chen, Zhihua
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2312096/
https://www.ncbi.nlm.nih.gov/pubmed/19455237
_version_ 1782152571222228992
author Edgerton, Mary E.
Fisher, Douglas H.
Tang, Lianhong
Frey, Lewis J.
Chen, Zhihua
author_facet Edgerton, Mary E.
Fisher, Douglas H.
Tang, Lianhong
Frey, Lewis J.
Chen, Zhihua
author_sort Edgerton, Mary E.
collection PubMed
description We use Backward Chaining Rule Induction (BCRI), a novel data mining method for hypothesizing causative mechanisms, to mine lung cancer gene expression array data for mechanisms that could impact survival. Initially, a supervised learning system is used to generate a prediction model in the form of “IF <conditions> THEN <outcome>” style rules. Next, each antecedent (i.e. an IF condition) of a previously discovered rule becomes the outcome class for subsequent application of supervised rule induction. This step is repeated until a termination condition is satisfied. “Chains” of rules are created by working backward from an initial condition (e.g. survival status). Through this iterative process of “backward chaining,” BCRI searches for rules that describe plausible gene interactions for subsequent validation. Thus, BCRI is a semi-supervised approach that constrains the search through the vast space of plausible causal mechanisms by using a top-level outcome to kick-start the process. We demonstrate the general BCRI task sequence, how to implement it, the validation process, and how BCRI-rules discovered from lung cancer microarray data can be combined with prior knowledge to generate hypotheses about functional genomics.
format Text
id pubmed-2312096
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-23120962008-04-16 Data Mining for Gene Networks Relevant to Poor Prognosis in Lung Cancer Via Backward-Chaining Rule Induction Edgerton, Mary E. Fisher, Douglas H. Tang, Lianhong Frey, Lewis J. Chen, Zhihua Cancer Inform Original Research We use Backward Chaining Rule Induction (BCRI), a novel data mining method for hypothesizing causative mechanisms, to mine lung cancer gene expression array data for mechanisms that could impact survival. Initially, a supervised learning system is used to generate a prediction model in the form of “IF <conditions> THEN <outcome>” style rules. Next, each antecedent (i.e. an IF condition) of a previously discovered rule becomes the outcome class for subsequent application of supervised rule induction. This step is repeated until a termination condition is satisfied. “Chains” of rules are created by working backward from an initial condition (e.g. survival status). Through this iterative process of “backward chaining,” BCRI searches for rules that describe plausible gene interactions for subsequent validation. Thus, BCRI is a semi-supervised approach that constrains the search through the vast space of plausible causal mechanisms by using a top-level outcome to kick-start the process. We demonstrate the general BCRI task sequence, how to implement it, the validation process, and how BCRI-rules discovered from lung cancer microarray data can be combined with prior knowledge to generate hypotheses about functional genomics. Libertas Academica 2007-02-10 /pmc/articles/PMC2312096/ /pubmed/19455237 Text en © 2007 The authors. http://creativecommons.org/licenses/by/3.0 This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Original Research
Edgerton, Mary E.
Fisher, Douglas H.
Tang, Lianhong
Frey, Lewis J.
Chen, Zhihua
Data Mining for Gene Networks Relevant to Poor Prognosis in Lung Cancer Via Backward-Chaining Rule Induction
title Data Mining for Gene Networks Relevant to Poor Prognosis in Lung Cancer Via Backward-Chaining Rule Induction
title_full Data Mining for Gene Networks Relevant to Poor Prognosis in Lung Cancer Via Backward-Chaining Rule Induction
title_fullStr Data Mining for Gene Networks Relevant to Poor Prognosis in Lung Cancer Via Backward-Chaining Rule Induction
title_full_unstemmed Data Mining for Gene Networks Relevant to Poor Prognosis in Lung Cancer Via Backward-Chaining Rule Induction
title_short Data Mining for Gene Networks Relevant to Poor Prognosis in Lung Cancer Via Backward-Chaining Rule Induction
title_sort data mining for gene networks relevant to poor prognosis in lung cancer via backward-chaining rule induction
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2312096/
https://www.ncbi.nlm.nih.gov/pubmed/19455237
work_keys_str_mv AT edgertonmarye dataminingforgenenetworksrelevanttopoorprognosisinlungcancerviabackwardchainingruleinduction
AT fisherdouglash dataminingforgenenetworksrelevanttopoorprognosisinlungcancerviabackwardchainingruleinduction
AT tanglianhong dataminingforgenenetworksrelevanttopoorprognosisinlungcancerviabackwardchainingruleinduction
AT freylewisj dataminingforgenenetworksrelevanttopoorprognosisinlungcancerviabackwardchainingruleinduction
AT chenzhihua dataminingforgenenetworksrelevanttopoorprognosisinlungcancerviabackwardchainingruleinduction