Cargando…

Multi-Omics Marker Analysis Enables Early Prediction of Breast Tumor Progression

Ductal carcinoma in situ (DCIS) is a preinvasive form of breast cancer with a highly variable potential of becoming invasive and affecting mortality of the patients. Due to the lack of accurate markers of disease progression, many women with detected DCIS are currently overtreated. To distinguish th...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Haifeng, Lien, Tonje, Bergholtz, Helga, Fleischer, Thomas, Djerroudi, Lounes, Vincent-Salomon, Anne, Sørlie, Therese, Aittokallio, Tero
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8209521/
https://www.ncbi.nlm.nih.gov/pubmed/34149812
http://dx.doi.org/10.3389/fgene.2021.670749
_version_ 1783709147214118912
author Xu, Haifeng
Lien, Tonje
Bergholtz, Helga
Fleischer, Thomas
Djerroudi, Lounes
Vincent-Salomon, Anne
Sørlie, Therese
Aittokallio, Tero
author_facet Xu, Haifeng
Lien, Tonje
Bergholtz, Helga
Fleischer, Thomas
Djerroudi, Lounes
Vincent-Salomon, Anne
Sørlie, Therese
Aittokallio, Tero
author_sort Xu, Haifeng
collection PubMed
description Ductal carcinoma in situ (DCIS) is a preinvasive form of breast cancer with a highly variable potential of becoming invasive and affecting mortality of the patients. Due to the lack of accurate markers of disease progression, many women with detected DCIS are currently overtreated. To distinguish those DCIS cases who are likely to require therapy from those who should be left untreated, there is a need for robust and predictive biomarkers extracted from molecular or genetic profiles. We developed a supervised machine learning approach that implements multi-omics feature selection and model regularization for the identification of biomarker combinations that could be used to distinguish low-risk DCIS lesions from those with a higher likelihood of progression. To investigate the genetic heterogeneity of disease progression, we applied this approach to 40 pure DCIS and 259 invasive breast cancer (IBC) samples profiled with genome-wide transcriptomics, DNA methylation, and DNA copy number variation. Feature selection using the multi-omics Lasso-regularized algorithm identified both known genes involved in breast cancer development, as well as novel markers for early detection. Even though the gene expression-based model features led to the highest classification accuracy alone, methylation data provided a complementary source of features and improved especially the sensitivity of correctly classifying DCIS cases. We also identified a number of repeatedly misclassified DCIS cases when using either the expression or methylation markers. A small panel of 10 gene markers was able to distinguish DCIS and IBC cases with high accuracy in nested cross-validation (AU-ROC = 0.99). The marker panel was not specific to any of the established breast cancer subtypes, suggesting that the 10-gene signature may provide a subtype-agnostic and cost-effective approach for breast cancer detection and patient stratification. We further confirmed high accuracy of the 10-gene signature in an external validation cohort (AU-ROC = 0.95), profiled using distinct transcriptomic assay, hence demonstrating robustness of the risk signature.
format Online
Article
Text
id pubmed-8209521
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-82095212021-06-18 Multi-Omics Marker Analysis Enables Early Prediction of Breast Tumor Progression Xu, Haifeng Lien, Tonje Bergholtz, Helga Fleischer, Thomas Djerroudi, Lounes Vincent-Salomon, Anne Sørlie, Therese Aittokallio, Tero Front Genet Genetics Ductal carcinoma in situ (DCIS) is a preinvasive form of breast cancer with a highly variable potential of becoming invasive and affecting mortality of the patients. Due to the lack of accurate markers of disease progression, many women with detected DCIS are currently overtreated. To distinguish those DCIS cases who are likely to require therapy from those who should be left untreated, there is a need for robust and predictive biomarkers extracted from molecular or genetic profiles. We developed a supervised machine learning approach that implements multi-omics feature selection and model regularization for the identification of biomarker combinations that could be used to distinguish low-risk DCIS lesions from those with a higher likelihood of progression. To investigate the genetic heterogeneity of disease progression, we applied this approach to 40 pure DCIS and 259 invasive breast cancer (IBC) samples profiled with genome-wide transcriptomics, DNA methylation, and DNA copy number variation. Feature selection using the multi-omics Lasso-regularized algorithm identified both known genes involved in breast cancer development, as well as novel markers for early detection. Even though the gene expression-based model features led to the highest classification accuracy alone, methylation data provided a complementary source of features and improved especially the sensitivity of correctly classifying DCIS cases. We also identified a number of repeatedly misclassified DCIS cases when using either the expression or methylation markers. A small panel of 10 gene markers was able to distinguish DCIS and IBC cases with high accuracy in nested cross-validation (AU-ROC = 0.99). The marker panel was not specific to any of the established breast cancer subtypes, suggesting that the 10-gene signature may provide a subtype-agnostic and cost-effective approach for breast cancer detection and patient stratification. We further confirmed high accuracy of the 10-gene signature in an external validation cohort (AU-ROC = 0.95), profiled using distinct transcriptomic assay, hence demonstrating robustness of the risk signature. Frontiers Media S.A. 2021-06-03 /pmc/articles/PMC8209521/ /pubmed/34149812 http://dx.doi.org/10.3389/fgene.2021.670749 Text en Copyright © 2021 Xu, Lien, Bergholtz, Fleischer, Djerroudi, Vincent-Salomon, Sørlie and Aittokallio. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Xu, Haifeng
Lien, Tonje
Bergholtz, Helga
Fleischer, Thomas
Djerroudi, Lounes
Vincent-Salomon, Anne
Sørlie, Therese
Aittokallio, Tero
Multi-Omics Marker Analysis Enables Early Prediction of Breast Tumor Progression
title Multi-Omics Marker Analysis Enables Early Prediction of Breast Tumor Progression
title_full Multi-Omics Marker Analysis Enables Early Prediction of Breast Tumor Progression
title_fullStr Multi-Omics Marker Analysis Enables Early Prediction of Breast Tumor Progression
title_full_unstemmed Multi-Omics Marker Analysis Enables Early Prediction of Breast Tumor Progression
title_short Multi-Omics Marker Analysis Enables Early Prediction of Breast Tumor Progression
title_sort multi-omics marker analysis enables early prediction of breast tumor progression
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8209521/
https://www.ncbi.nlm.nih.gov/pubmed/34149812
http://dx.doi.org/10.3389/fgene.2021.670749
work_keys_str_mv AT xuhaifeng multiomicsmarkeranalysisenablesearlypredictionofbreasttumorprogression
AT lientonje multiomicsmarkeranalysisenablesearlypredictionofbreasttumorprogression
AT bergholtzhelga multiomicsmarkeranalysisenablesearlypredictionofbreasttumorprogression
AT fleischerthomas multiomicsmarkeranalysisenablesearlypredictionofbreasttumorprogression
AT djerroudilounes multiomicsmarkeranalysisenablesearlypredictionofbreasttumorprogression
AT vincentsalomonanne multiomicsmarkeranalysisenablesearlypredictionofbreasttumorprogression
AT sørlietherese multiomicsmarkeranalysisenablesearlypredictionofbreasttumorprogression
AT aittokalliotero multiomicsmarkeranalysisenablesearlypredictionofbreasttumorprogression