Cargando…

Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer

INTRODUCTION: The ability to accurately predict whether a woman will develop breast cancer later in her life, should reduce the number of breast cancer deaths. Different predictive models exist for breast cancer based on family history, BRCA status, and SNP analysis. The best of these models has an...

Descripción completa

Detalles Bibliográficos
Autores principales: Ko, Charmeine, Brody, James P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10273758/
https://www.ncbi.nlm.nih.gov/pubmed/37328908
http://dx.doi.org/10.1186/s40246-023-00482-8
_version_ 1785059711570149376
author Ko, Charmeine
Brody, James P.
author_facet Ko, Charmeine
Brody, James P.
author_sort Ko, Charmeine
collection PubMed
description INTRODUCTION: The ability to accurately predict whether a woman will develop breast cancer later in her life, should reduce the number of breast cancer deaths. Different predictive models exist for breast cancer based on family history, BRCA status, and SNP analysis. The best of these models has an accuracy (area under the receiver operating characteristic curve, AUC) of about 0.65. We have developed computational methods to characterize a genome by a small set of numbers that represent the length of segments of the chromosomes, called chromosomal-scale length variation (CSLV). METHODS: We built machine learning models to differentiate between women who had breast cancer and women who did not based on their CSLV characterization. We applied this procedure to two different datasets: the UK Biobank (1534 women with breast cancer and 4391 women who did not) and the Cancer Genome Atlas (TCGA) 874 with breast cancer and 3381 without. RESULTS: We found a machine learning model that could predict breast cancer with an AUC of 0.836 95% CI (0.830.0.843) in the UK Biobank data. Using a similar approach with the TCGA data, we obtained a model with an AUC of 0.704 95% CI (0.702, 0.706). Variable importance analysis indicated that no single chromosomal region was responsible for significant fraction of the model results. CONCLUSION: In this retrospective study, chromosomal-scale length variation could effectively predict whether or not a woman enrolled in the UK Biobank study developed breast cancer. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40246-023-00482-8.
format Online
Article
Text
id pubmed-10273758
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-102737582023-06-17 Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer Ko, Charmeine Brody, James P. Hum Genomics Research INTRODUCTION: The ability to accurately predict whether a woman will develop breast cancer later in her life, should reduce the number of breast cancer deaths. Different predictive models exist for breast cancer based on family history, BRCA status, and SNP analysis. The best of these models has an accuracy (area under the receiver operating characteristic curve, AUC) of about 0.65. We have developed computational methods to characterize a genome by a small set of numbers that represent the length of segments of the chromosomes, called chromosomal-scale length variation (CSLV). METHODS: We built machine learning models to differentiate between women who had breast cancer and women who did not based on their CSLV characterization. We applied this procedure to two different datasets: the UK Biobank (1534 women with breast cancer and 4391 women who did not) and the Cancer Genome Atlas (TCGA) 874 with breast cancer and 3381 without. RESULTS: We found a machine learning model that could predict breast cancer with an AUC of 0.836 95% CI (0.830.0.843) in the UK Biobank data. Using a similar approach with the TCGA data, we obtained a model with an AUC of 0.704 95% CI (0.702, 0.706). Variable importance analysis indicated that no single chromosomal region was responsible for significant fraction of the model results. CONCLUSION: In this retrospective study, chromosomal-scale length variation could effectively predict whether or not a woman enrolled in the UK Biobank study developed breast cancer. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40246-023-00482-8. BioMed Central 2023-06-16 /pmc/articles/PMC10273758/ /pubmed/37328908 http://dx.doi.org/10.1186/s40246-023-00482-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Ko, Charmeine
Brody, James P.
Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer
title Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer
title_full Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer
title_fullStr Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer
title_full_unstemmed Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer
title_short Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer
title_sort evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10273758/
https://www.ncbi.nlm.nih.gov/pubmed/37328908
http://dx.doi.org/10.1186/s40246-023-00482-8
work_keys_str_mv AT kocharmeine evaluationofageneticriskscorecomputedusinghumanchromosomalscalelengthvariationtopredictbreastcancer
AT brodyjamesp evaluationofageneticriskscorecomputedusinghumanchromosomalscalelengthvariationtopredictbreastcancer