Cargando…

Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes

The biological cause of clinically observed variability of normal tissue damage following radiotherapy is poorly understood. We hypothesized that machine/statistical learning methods using single nucleotide polymorphism (SNP)-based genome-wide association studies (GWAS) would identify groups of pati...

Descripción completa

Detalles Bibliográficos
Autores principales: Oh, Jung Hun, Kerns, Sarah, Ostrer, Harry, Powell, Simon N., Rosenstein, Barry, Deasy, Joseph O.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5324069/
https://www.ncbi.nlm.nih.gov/pubmed/28233873
http://dx.doi.org/10.1038/srep43381
_version_ 1782510147737747456
author Oh, Jung Hun
Kerns, Sarah
Ostrer, Harry
Powell, Simon N.
Rosenstein, Barry
Deasy, Joseph O.
author_facet Oh, Jung Hun
Kerns, Sarah
Ostrer, Harry
Powell, Simon N.
Rosenstein, Barry
Deasy, Joseph O.
author_sort Oh, Jung Hun
collection PubMed
description The biological cause of clinically observed variability of normal tissue damage following radiotherapy is poorly understood. We hypothesized that machine/statistical learning methods using single nucleotide polymorphism (SNP)-based genome-wide association studies (GWAS) would identify groups of patients of differing complication risk, and furthermore could be used to identify key biological sources of variability. We developed a novel learning algorithm, called pre-conditioned random forest regression (PRFR), to construct polygenic risk models using hundreds of SNPs, thereby capturing genomic features that confer small differential risk. Predictive models were trained and validated on a cohort of 368 prostate cancer patients for two post-radiotherapy clinical endpoints: late rectal bleeding and erectile dysfunction. The proposed method results in better predictive performance compared with existing computational methods. Gene ontology enrichment analysis and protein-protein interaction network analysis are used to identify key biological processes and proteins that were plausible based on other published studies. In conclusion, we confirm that novel machine learning methods can produce large predictive models (hundreds of SNPs), yielding clinically useful risk stratification models, as well as identifying important underlying biological processes in the radiation damage and tissue repair process. The methods are generally applicable to GWAS data and are not specific to radiotherapy endpoints.
format Online
Article
Text
id pubmed-5324069
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-53240692017-03-01 Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes Oh, Jung Hun Kerns, Sarah Ostrer, Harry Powell, Simon N. Rosenstein, Barry Deasy, Joseph O. Sci Rep Article The biological cause of clinically observed variability of normal tissue damage following radiotherapy is poorly understood. We hypothesized that machine/statistical learning methods using single nucleotide polymorphism (SNP)-based genome-wide association studies (GWAS) would identify groups of patients of differing complication risk, and furthermore could be used to identify key biological sources of variability. We developed a novel learning algorithm, called pre-conditioned random forest regression (PRFR), to construct polygenic risk models using hundreds of SNPs, thereby capturing genomic features that confer small differential risk. Predictive models were trained and validated on a cohort of 368 prostate cancer patients for two post-radiotherapy clinical endpoints: late rectal bleeding and erectile dysfunction. The proposed method results in better predictive performance compared with existing computational methods. Gene ontology enrichment analysis and protein-protein interaction network analysis are used to identify key biological processes and proteins that were plausible based on other published studies. In conclusion, we confirm that novel machine learning methods can produce large predictive models (hundreds of SNPs), yielding clinically useful risk stratification models, as well as identifying important underlying biological processes in the radiation damage and tissue repair process. The methods are generally applicable to GWAS data and are not specific to radiotherapy endpoints. Nature Publishing Group 2017-02-24 /pmc/articles/PMC5324069/ /pubmed/28233873 http://dx.doi.org/10.1038/srep43381 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Oh, Jung Hun
Kerns, Sarah
Ostrer, Harry
Powell, Simon N.
Rosenstein, Barry
Deasy, Joseph O.
Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes
title Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes
title_full Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes
title_fullStr Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes
title_full_unstemmed Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes
title_short Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes
title_sort computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5324069/
https://www.ncbi.nlm.nih.gov/pubmed/28233873
http://dx.doi.org/10.1038/srep43381
work_keys_str_mv AT ohjunghun computationalmethodsusinggenomewideassociationstudiestopredictradiotherapycomplicationsandtoidentifycorrelativemolecularprocesses
AT kernssarah computationalmethodsusinggenomewideassociationstudiestopredictradiotherapycomplicationsandtoidentifycorrelativemolecularprocesses
AT ostrerharry computationalmethodsusinggenomewideassociationstudiestopredictradiotherapycomplicationsandtoidentifycorrelativemolecularprocesses
AT powellsimonn computationalmethodsusinggenomewideassociationstudiestopredictradiotherapycomplicationsandtoidentifycorrelativemolecularprocesses
AT rosensteinbarry computationalmethodsusinggenomewideassociationstudiestopredictradiotherapycomplicationsandtoidentifycorrelativemolecularprocesses
AT deasyjosepho computationalmethodsusinggenomewideassociationstudiestopredictradiotherapycomplicationsandtoidentifycorrelativemolecularprocesses