Cargando…
Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer
INTRODUCTION: The ability to accurately predict whether a woman will develop breast cancer later in her life, should reduce the number of breast cancer deaths. Different predictive models exist for breast cancer based on family history, BRCA status, and SNP analysis. The best of these models has an...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10273758/ https://www.ncbi.nlm.nih.gov/pubmed/37328908 http://dx.doi.org/10.1186/s40246-023-00482-8 |
_version_ | 1785059711570149376 |
---|---|
author | Ko, Charmeine Brody, James P. |
author_facet | Ko, Charmeine Brody, James P. |
author_sort | Ko, Charmeine |
collection | PubMed |
description | INTRODUCTION: The ability to accurately predict whether a woman will develop breast cancer later in her life, should reduce the number of breast cancer deaths. Different predictive models exist for breast cancer based on family history, BRCA status, and SNP analysis. The best of these models has an accuracy (area under the receiver operating characteristic curve, AUC) of about 0.65. We have developed computational methods to characterize a genome by a small set of numbers that represent the length of segments of the chromosomes, called chromosomal-scale length variation (CSLV). METHODS: We built machine learning models to differentiate between women who had breast cancer and women who did not based on their CSLV characterization. We applied this procedure to two different datasets: the UK Biobank (1534 women with breast cancer and 4391 women who did not) and the Cancer Genome Atlas (TCGA) 874 with breast cancer and 3381 without. RESULTS: We found a machine learning model that could predict breast cancer with an AUC of 0.836 95% CI (0.830.0.843) in the UK Biobank data. Using a similar approach with the TCGA data, we obtained a model with an AUC of 0.704 95% CI (0.702, 0.706). Variable importance analysis indicated that no single chromosomal region was responsible for significant fraction of the model results. CONCLUSION: In this retrospective study, chromosomal-scale length variation could effectively predict whether or not a woman enrolled in the UK Biobank study developed breast cancer. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40246-023-00482-8. |
format | Online Article Text |
id | pubmed-10273758 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-102737582023-06-17 Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer Ko, Charmeine Brody, James P. Hum Genomics Research INTRODUCTION: The ability to accurately predict whether a woman will develop breast cancer later in her life, should reduce the number of breast cancer deaths. Different predictive models exist for breast cancer based on family history, BRCA status, and SNP analysis. The best of these models has an accuracy (area under the receiver operating characteristic curve, AUC) of about 0.65. We have developed computational methods to characterize a genome by a small set of numbers that represent the length of segments of the chromosomes, called chromosomal-scale length variation (CSLV). METHODS: We built machine learning models to differentiate between women who had breast cancer and women who did not based on their CSLV characterization. We applied this procedure to two different datasets: the UK Biobank (1534 women with breast cancer and 4391 women who did not) and the Cancer Genome Atlas (TCGA) 874 with breast cancer and 3381 without. RESULTS: We found a machine learning model that could predict breast cancer with an AUC of 0.836 95% CI (0.830.0.843) in the UK Biobank data. Using a similar approach with the TCGA data, we obtained a model with an AUC of 0.704 95% CI (0.702, 0.706). Variable importance analysis indicated that no single chromosomal region was responsible for significant fraction of the model results. CONCLUSION: In this retrospective study, chromosomal-scale length variation could effectively predict whether or not a woman enrolled in the UK Biobank study developed breast cancer. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40246-023-00482-8. BioMed Central 2023-06-16 /pmc/articles/PMC10273758/ /pubmed/37328908 http://dx.doi.org/10.1186/s40246-023-00482-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Ko, Charmeine Brody, James P. Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer |
title | Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer |
title_full | Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer |
title_fullStr | Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer |
title_full_unstemmed | Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer |
title_short | Evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer |
title_sort | evaluation of a genetic risk score computed using human chromosomal-scale length variation to predict breast cancer |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10273758/ https://www.ncbi.nlm.nih.gov/pubmed/37328908 http://dx.doi.org/10.1186/s40246-023-00482-8 |
work_keys_str_mv | AT kocharmeine evaluationofageneticriskscorecomputedusinghumanchromosomalscalelengthvariationtopredictbreastcancer AT brodyjamesp evaluationofageneticriskscorecomputedusinghumanchromosomalscalelengthvariationtopredictbreastcancer |