Cargando…
Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes
The biological cause of clinically observed variability of normal tissue damage following radiotherapy is poorly understood. We hypothesized that machine/statistical learning methods using single nucleotide polymorphism (SNP)-based genome-wide association studies (GWAS) would identify groups of pati...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5324069/ https://www.ncbi.nlm.nih.gov/pubmed/28233873 http://dx.doi.org/10.1038/srep43381 |
_version_ | 1782510147737747456 |
---|---|
author | Oh, Jung Hun Kerns, Sarah Ostrer, Harry Powell, Simon N. Rosenstein, Barry Deasy, Joseph O. |
author_facet | Oh, Jung Hun Kerns, Sarah Ostrer, Harry Powell, Simon N. Rosenstein, Barry Deasy, Joseph O. |
author_sort | Oh, Jung Hun |
collection | PubMed |
description | The biological cause of clinically observed variability of normal tissue damage following radiotherapy is poorly understood. We hypothesized that machine/statistical learning methods using single nucleotide polymorphism (SNP)-based genome-wide association studies (GWAS) would identify groups of patients of differing complication risk, and furthermore could be used to identify key biological sources of variability. We developed a novel learning algorithm, called pre-conditioned random forest regression (PRFR), to construct polygenic risk models using hundreds of SNPs, thereby capturing genomic features that confer small differential risk. Predictive models were trained and validated on a cohort of 368 prostate cancer patients for two post-radiotherapy clinical endpoints: late rectal bleeding and erectile dysfunction. The proposed method results in better predictive performance compared with existing computational methods. Gene ontology enrichment analysis and protein-protein interaction network analysis are used to identify key biological processes and proteins that were plausible based on other published studies. In conclusion, we confirm that novel machine learning methods can produce large predictive models (hundreds of SNPs), yielding clinically useful risk stratification models, as well as identifying important underlying biological processes in the radiation damage and tissue repair process. The methods are generally applicable to GWAS data and are not specific to radiotherapy endpoints. |
format | Online Article Text |
id | pubmed-5324069 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-53240692017-03-01 Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes Oh, Jung Hun Kerns, Sarah Ostrer, Harry Powell, Simon N. Rosenstein, Barry Deasy, Joseph O. Sci Rep Article The biological cause of clinically observed variability of normal tissue damage following radiotherapy is poorly understood. We hypothesized that machine/statistical learning methods using single nucleotide polymorphism (SNP)-based genome-wide association studies (GWAS) would identify groups of patients of differing complication risk, and furthermore could be used to identify key biological sources of variability. We developed a novel learning algorithm, called pre-conditioned random forest regression (PRFR), to construct polygenic risk models using hundreds of SNPs, thereby capturing genomic features that confer small differential risk. Predictive models were trained and validated on a cohort of 368 prostate cancer patients for two post-radiotherapy clinical endpoints: late rectal bleeding and erectile dysfunction. The proposed method results in better predictive performance compared with existing computational methods. Gene ontology enrichment analysis and protein-protein interaction network analysis are used to identify key biological processes and proteins that were plausible based on other published studies. In conclusion, we confirm that novel machine learning methods can produce large predictive models (hundreds of SNPs), yielding clinically useful risk stratification models, as well as identifying important underlying biological processes in the radiation damage and tissue repair process. The methods are generally applicable to GWAS data and are not specific to radiotherapy endpoints. Nature Publishing Group 2017-02-24 /pmc/articles/PMC5324069/ /pubmed/28233873 http://dx.doi.org/10.1038/srep43381 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Oh, Jung Hun Kerns, Sarah Ostrer, Harry Powell, Simon N. Rosenstein, Barry Deasy, Joseph O. Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes |
title | Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes |
title_full | Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes |
title_fullStr | Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes |
title_full_unstemmed | Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes |
title_short | Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes |
title_sort | computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5324069/ https://www.ncbi.nlm.nih.gov/pubmed/28233873 http://dx.doi.org/10.1038/srep43381 |
work_keys_str_mv | AT ohjunghun computationalmethodsusinggenomewideassociationstudiestopredictradiotherapycomplicationsandtoidentifycorrelativemolecularprocesses AT kernssarah computationalmethodsusinggenomewideassociationstudiestopredictradiotherapycomplicationsandtoidentifycorrelativemolecularprocesses AT ostrerharry computationalmethodsusinggenomewideassociationstudiestopredictradiotherapycomplicationsandtoidentifycorrelativemolecularprocesses AT powellsimonn computationalmethodsusinggenomewideassociationstudiestopredictradiotherapycomplicationsandtoidentifycorrelativemolecularprocesses AT rosensteinbarry computationalmethodsusinggenomewideassociationstudiestopredictradiotherapycomplicationsandtoidentifycorrelativemolecularprocesses AT deasyjosepho computationalmethodsusinggenomewideassociationstudiestopredictradiotherapycomplicationsandtoidentifycorrelativemolecularprocesses |