Cargando…
An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling
Technology is constantly evolving, necessitating the development of workflows for efficient use of high-dimensional data. We develop and test an empirical workflow for predictive modeling based on single nucleotide polymorphisms (SNP) from genome-wide association study (GWAS) datasets. To this aim,...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Medical Informatics Association
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3814469/ https://www.ncbi.nlm.nih.gov/pubmed/24303297 |
_version_ | 1782289261039452160 |
---|---|
author | Floudas, Charalampos S. Balasubramanian, Jeya Balaji Romkes, Marjorie Gopalakrishnan, Vanathi |
author_facet | Floudas, Charalampos S. Balasubramanian, Jeya Balaji Romkes, Marjorie Gopalakrishnan, Vanathi |
author_sort | Floudas, Charalampos S. |
collection | PubMed |
description | Technology is constantly evolving, necessitating the development of workflows for efficient use of high-dimensional data. We develop and test an empirical workflow for predictive modeling based on single nucleotide polymorphisms (SNP) from genome-wide association study (GWAS) datasets. To this aim, we use as a case study SNP-based prediction of survival for non-small cell lung cancer (NSCLC) with a Bayesian rule learner system (BRL+). Lung cancer is a leading cause of mortality. Standard treatment for early stages of NSCLC is surgery. Adjuvant chemotherapy would be beneficial for patients with early recurrence; consequently, we need models capable of such prediction. This workflow outlines the challenges involved in processing GWAS datasets from one popular platform (Affymetrix®), from the results files of the hybridization experiment to the model construction. Our results show that our workflow is feasible and efficient for processing such data while also yielding SNP based models with high predictive accuracy over cross validation. |
format | Online Article Text |
id | pubmed-3814469 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | American Medical Informatics Association |
record_format | MEDLINE/PubMed |
spelling | pubmed-38144692013-12-03 An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling Floudas, Charalampos S. Balasubramanian, Jeya Balaji Romkes, Marjorie Gopalakrishnan, Vanathi AMIA Jt Summits Transl Sci Proc Articles Technology is constantly evolving, necessitating the development of workflows for efficient use of high-dimensional data. We develop and test an empirical workflow for predictive modeling based on single nucleotide polymorphisms (SNP) from genome-wide association study (GWAS) datasets. To this aim, we use as a case study SNP-based prediction of survival for non-small cell lung cancer (NSCLC) with a Bayesian rule learner system (BRL+). Lung cancer is a leading cause of mortality. Standard treatment for early stages of NSCLC is surgery. Adjuvant chemotherapy would be beneficial for patients with early recurrence; consequently, we need models capable of such prediction. This workflow outlines the challenges involved in processing GWAS datasets from one popular platform (Affymetrix®), from the results files of the hybridization experiment to the model construction. Our results show that our workflow is feasible and efficient for processing such data while also yielding SNP based models with high predictive accuracy over cross validation. American Medical Informatics Association 2013-03-18 /pmc/articles/PMC3814469/ /pubmed/24303297 Text en ©2013 AMIA - All rights reserved. |
spellingShingle | Articles Floudas, Charalampos S. Balasubramanian, Jeya Balaji Romkes, Marjorie Gopalakrishnan, Vanathi An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling |
title | An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling |
title_full | An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling |
title_fullStr | An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling |
title_full_unstemmed | An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling |
title_short | An empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling |
title_sort | empirical workflow for genome-wide single nucleotide polymorphism-based predictive modeling |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3814469/ https://www.ncbi.nlm.nih.gov/pubmed/24303297 |
work_keys_str_mv | AT floudascharalamposs anempiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling AT balasubramanianjeyabalaji anempiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling AT romkesmarjorie anempiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling AT gopalakrishnanvanathi anempiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling AT floudascharalamposs empiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling AT balasubramanianjeyabalaji empiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling AT romkesmarjorie empiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling AT gopalakrishnanvanathi empiricalworkflowforgenomewidesinglenucleotidepolymorphismbasedpredictivemodeling |