Cargando…

An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling

Technology is constantly evolving, necessitating the development of workflows for efficient use of high-dimensional data. We develop and test an empirical workflow for predictive modeling based on single nucleotide polymorphisms (SNP) from genome-wide association study (GWAS) datasets. To this aim,...

Descripción completa

Detalles Bibliográficos
Autores principales: Floudas, Charalampos S., Balasubramanian, Jeya Balaji, Romkes, Marjorie, Gopalakrishnan, Vanathi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Medical Informatics Association 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3814469/
https://www.ncbi.nlm.nih.gov/pubmed/24303297
_version_ 1782289261039452160
author Floudas, Charalampos S.
Balasubramanian, Jeya Balaji
Romkes, Marjorie
Gopalakrishnan, Vanathi
author_facet Floudas, Charalampos S.
Balasubramanian, Jeya Balaji
Romkes, Marjorie
Gopalakrishnan, Vanathi
author_sort Floudas, Charalampos S.
collection PubMed
description Technology is constantly evolving, necessitating the development of workflows for efficient use of high-dimensional data. We develop and test an empirical workflow for predictive modeling based on single nucleotide polymorphisms (SNP) from genome-wide association study (GWAS) datasets. To this aim, we use as a case study SNP-based prediction of survival for non-small cell lung cancer (NSCLC) with a Bayesian rule learner system (BRL+). Lung cancer is a leading cause of mortality. Standard treatment for early stages of NSCLC is surgery. Adjuvant chemotherapy would be beneficial for patients with early recurrence; consequently, we need models capable of such prediction. This workflow outlines the challenges involved in processing GWAS datasets from one popular platform (Affymetrix®), from the results files of the hybridization experiment to the model construction. Our results show that our workflow is feasible and efficient for processing such data while also yielding SNP based models with high predictive accuracy over cross validation.
format Online
Article
Text
id pubmed-3814469
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher American Medical Informatics Association
record_format MEDLINE/PubMed
spelling pubmed-38144692013-12-03 An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling Floudas, Charalampos S. Balasubramanian, Jeya Balaji Romkes, Marjorie Gopalakrishnan, Vanathi AMIA Jt Summits Transl Sci Proc Articles Technology is constantly evolving, necessitating the development of workflows for efficient use of high-dimensional data. We develop and test an empirical workflow for predictive modeling based on single nucleotide polymorphisms (SNP) from genome-wide association study (GWAS) datasets. To this aim, we use as a case study SNP-based prediction of survival for non-small cell lung cancer (NSCLC) with a Bayesian rule learner system (BRL+). Lung cancer is a leading cause of mortality. Standard treatment for early stages of NSCLC is surgery. Adjuvant chemotherapy would be beneficial for patients with early recurrence; consequently, we need models capable of such prediction. This workflow outlines the challenges involved in processing GWAS datasets from one popular platform (Affymetrix®), from the results files of the hybridization experiment to the model construction. Our results show that our workflow is feasible and efficient for processing such data while also yielding SNP based models with high predictive accuracy over cross validation. American Medical Informatics Association 2013-03-18 /pmc/articles/PMC3814469/ /pubmed/24303297 Text en ©2013 AMIA - All rights reserved.
spellingShingle Articles
Floudas, Charalampos S.
Balasubramanian, Jeya Balaji
Romkes, Marjorie
Gopalakrishnan, Vanathi
An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling
title An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling
title_full An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling
title_fullStr An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling
title_full_unstemmed An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling
title_short An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling
title_sort empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3814469/
https://www.ncbi.nlm.nih.gov/pubmed/24303297
work_keys_str_mv AT floudascharalamposs anempiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling
AT balasubramanianjeyabalaji anempiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling
AT romkesmarjorie anempiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling
AT gopalakrishnanvanathi anempiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling
AT floudascharalamposs empiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling
AT balasubramanianjeyabalaji empiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling
AT romkesmarjorie empiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling
AT gopalakrishnanvanathi empiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling