Cargando…

Machine learning on genome-wide association studies to predict the risk of radiation-associated contralateral breast cancer in the WECARE Study

The purpose of this study was to identify germline single nucleotide polymorphisms (SNPs) that optimally predict radiation-associated contralateral breast cancer (RCBC) and to provide new biological insights into the carcinogenic process. Fifty-two women with contralateral breast cancer and 153 wome...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Sangkyu, Liang, Xiaolin, Woods, Meghan, Reiner, Anne S., Concannon, Patrick, Bernstein, Leslie, Lynch, Charles F., Boice, John D., Deasy, Joseph O., Bernstein, Jonine L., Oh, Jung Hun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7046218/
https://www.ncbi.nlm.nih.gov/pubmed/32106268
http://dx.doi.org/10.1371/journal.pone.0226157
_version_ 1783501922946252800
author Lee, Sangkyu
Liang, Xiaolin
Woods, Meghan
Reiner, Anne S.
Concannon, Patrick
Bernstein, Leslie
Lynch, Charles F.
Boice, John D.
Deasy, Joseph O.
Bernstein, Jonine L.
Oh, Jung Hun
author_facet Lee, Sangkyu
Liang, Xiaolin
Woods, Meghan
Reiner, Anne S.
Concannon, Patrick
Bernstein, Leslie
Lynch, Charles F.
Boice, John D.
Deasy, Joseph O.
Bernstein, Jonine L.
Oh, Jung Hun
author_sort Lee, Sangkyu
collection PubMed
description The purpose of this study was to identify germline single nucleotide polymorphisms (SNPs) that optimally predict radiation-associated contralateral breast cancer (RCBC) and to provide new biological insights into the carcinogenic process. Fifty-two women with contralateral breast cancer and 153 women with unilateral breast cancer were identified within the Women’s Environmental Cancer and Radiation Epidemiology (WECARE) Study who were at increased risk of RCBC because they were ≤ 40 years of age at first diagnosis of breast cancer and received a scatter radiation dose > 1 Gy to the contralateral breast. A previously reported algorithm, preconditioned random forest regression, was applied to predict the risk of developing RCBC. The resulting model produced an area under the curve (AUC) of 0.62 (p = 0.04) on hold-out validation data. The biological analysis identified the cyclic AMP-mediated signaling and Ephrin-A as significant biological correlates, which were previously shown to influence cell survival after radiation in an ATM-dependent manner. The key connected genes and proteins that are identified in this analysis were previously identified as relevant to breast cancer, radiation response, or both. In summary, machine learning/bioinformatics methods applied to genome-wide genotyping data have great potential to reveal plausible biological correlates associated with the risk of RCBC.
format Online
Article
Text
id pubmed-7046218
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-70462182020-03-09 Machine learning on genome-wide association studies to predict the risk of radiation-associated contralateral breast cancer in the WECARE Study Lee, Sangkyu Liang, Xiaolin Woods, Meghan Reiner, Anne S. Concannon, Patrick Bernstein, Leslie Lynch, Charles F. Boice, John D. Deasy, Joseph O. Bernstein, Jonine L. Oh, Jung Hun PLoS One Research Article The purpose of this study was to identify germline single nucleotide polymorphisms (SNPs) that optimally predict radiation-associated contralateral breast cancer (RCBC) and to provide new biological insights into the carcinogenic process. Fifty-two women with contralateral breast cancer and 153 women with unilateral breast cancer were identified within the Women’s Environmental Cancer and Radiation Epidemiology (WECARE) Study who were at increased risk of RCBC because they were ≤ 40 years of age at first diagnosis of breast cancer and received a scatter radiation dose > 1 Gy to the contralateral breast. A previously reported algorithm, preconditioned random forest regression, was applied to predict the risk of developing RCBC. The resulting model produced an area under the curve (AUC) of 0.62 (p = 0.04) on hold-out validation data. The biological analysis identified the cyclic AMP-mediated signaling and Ephrin-A as significant biological correlates, which were previously shown to influence cell survival after radiation in an ATM-dependent manner. The key connected genes and proteins that are identified in this analysis were previously identified as relevant to breast cancer, radiation response, or both. In summary, machine learning/bioinformatics methods applied to genome-wide genotyping data have great potential to reveal plausible biological correlates associated with the risk of RCBC. Public Library of Science 2020-02-27 /pmc/articles/PMC7046218/ /pubmed/32106268 http://dx.doi.org/10.1371/journal.pone.0226157 Text en © 2020 Lee et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lee, Sangkyu
Liang, Xiaolin
Woods, Meghan
Reiner, Anne S.
Concannon, Patrick
Bernstein, Leslie
Lynch, Charles F.
Boice, John D.
Deasy, Joseph O.
Bernstein, Jonine L.
Oh, Jung Hun
Machine learning on genome-wide association studies to predict the risk of radiation-associated contralateral breast cancer in the WECARE Study
title Machine learning on genome-wide association studies to predict the risk of radiation-associated contralateral breast cancer in the WECARE Study
title_full Machine learning on genome-wide association studies to predict the risk of radiation-associated contralateral breast cancer in the WECARE Study
title_fullStr Machine learning on genome-wide association studies to predict the risk of radiation-associated contralateral breast cancer in the WECARE Study
title_full_unstemmed Machine learning on genome-wide association studies to predict the risk of radiation-associated contralateral breast cancer in the WECARE Study
title_short Machine learning on genome-wide association studies to predict the risk of radiation-associated contralateral breast cancer in the WECARE Study
title_sort machine learning on genome-wide association studies to predict the risk of radiation-associated contralateral breast cancer in the wecare study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7046218/
https://www.ncbi.nlm.nih.gov/pubmed/32106268
http://dx.doi.org/10.1371/journal.pone.0226157
work_keys_str_mv AT leesangkyu machinelearningongenomewideassociationstudiestopredicttheriskofradiationassociatedcontralateralbreastcancerinthewecarestudy
AT liangxiaolin machinelearningongenomewideassociationstudiestopredicttheriskofradiationassociatedcontralateralbreastcancerinthewecarestudy
AT woodsmeghan machinelearningongenomewideassociationstudiestopredicttheriskofradiationassociatedcontralateralbreastcancerinthewecarestudy
AT reinerannes machinelearningongenomewideassociationstudiestopredicttheriskofradiationassociatedcontralateralbreastcancerinthewecarestudy
AT concannonpatrick machinelearningongenomewideassociationstudiestopredicttheriskofradiationassociatedcontralateralbreastcancerinthewecarestudy
AT bernsteinleslie machinelearningongenomewideassociationstudiestopredicttheriskofradiationassociatedcontralateralbreastcancerinthewecarestudy
AT lynchcharlesf machinelearningongenomewideassociationstudiestopredicttheriskofradiationassociatedcontralateralbreastcancerinthewecarestudy
AT boicejohnd machinelearningongenomewideassociationstudiestopredicttheriskofradiationassociatedcontralateralbreastcancerinthewecarestudy
AT deasyjosepho machinelearningongenomewideassociationstudiestopredicttheriskofradiationassociatedcontralateralbreastcancerinthewecarestudy
AT bernsteinjoninel machinelearningongenomewideassociationstudiestopredicttheriskofradiationassociatedcontralateralbreastcancerinthewecarestudy
AT ohjunghun machinelearningongenomewideassociationstudiestopredicttheriskofradiationassociatedcontralateralbreastcancerinthewecarestudy