Cargando…
Prediction of PIK3CA mutations from cancer gene expression data
Breast cancers with PIK3CA mutations can be treated with PIK3CA inhibitors in hormone receptor-positive HER2 negative subtypes. We applied a supervised elastic net penalized logistic regression model to predict PIK3CA mutations from gene expression data. This regression approach was applied to predi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7652327/ https://www.ncbi.nlm.nih.gov/pubmed/33166334 http://dx.doi.org/10.1371/journal.pone.0241514 |
_version_ | 1783607689671081984 |
---|---|
author | Kang, Jun Lee, Ahwon Lee, Youn Soo |
author_facet | Kang, Jun Lee, Ahwon Lee, Youn Soo |
author_sort | Kang, Jun |
collection | PubMed |
description | Breast cancers with PIK3CA mutations can be treated with PIK3CA inhibitors in hormone receptor-positive HER2 negative subtypes. We applied a supervised elastic net penalized logistic regression model to predict PIK3CA mutations from gene expression data. This regression approach was applied to predict modeling using the TCGA pan-cancer dataset. Approximately 10,000 cases were available for PIK3CA mutation and mRNA expression data. In 10-fold cross-validation, the model with λ = 0.01 and α = 1.0 (ridge regression) showed the best performance, in terms of area under the receiver operating characteristic (AUROC). The final model was developed with selected hyper-parameters using the entire training set. The training set AUROC was 0.93, and the test set AUROC was 0.84. The area under the precision-recall (AUPR) of the training set was 0.66, and the test set AUPR was 0.39. Cancer types were the most important predictors. Both insulin like growth factor 1 receptor (IGF1R) and the phosphatase and tensin homolog (PTEN) were the most significant genes in gene expression predictors. Our study suggests that predicting genomic alterations using gene expression data is possible, with good outcomes. |
format | Online Article Text |
id | pubmed-7652327 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-76523272020-11-18 Prediction of PIK3CA mutations from cancer gene expression data Kang, Jun Lee, Ahwon Lee, Youn Soo PLoS One Research Article Breast cancers with PIK3CA mutations can be treated with PIK3CA inhibitors in hormone receptor-positive HER2 negative subtypes. We applied a supervised elastic net penalized logistic regression model to predict PIK3CA mutations from gene expression data. This regression approach was applied to predict modeling using the TCGA pan-cancer dataset. Approximately 10,000 cases were available for PIK3CA mutation and mRNA expression data. In 10-fold cross-validation, the model with λ = 0.01 and α = 1.0 (ridge regression) showed the best performance, in terms of area under the receiver operating characteristic (AUROC). The final model was developed with selected hyper-parameters using the entire training set. The training set AUROC was 0.93, and the test set AUROC was 0.84. The area under the precision-recall (AUPR) of the training set was 0.66, and the test set AUPR was 0.39. Cancer types were the most important predictors. Both insulin like growth factor 1 receptor (IGF1R) and the phosphatase and tensin homolog (PTEN) were the most significant genes in gene expression predictors. Our study suggests that predicting genomic alterations using gene expression data is possible, with good outcomes. Public Library of Science 2020-11-09 /pmc/articles/PMC7652327/ /pubmed/33166334 http://dx.doi.org/10.1371/journal.pone.0241514 Text en © 2020 Kang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Kang, Jun Lee, Ahwon Lee, Youn Soo Prediction of PIK3CA mutations from cancer gene expression data |
title | Prediction of PIK3CA mutations from cancer gene expression data |
title_full | Prediction of PIK3CA mutations from cancer gene expression data |
title_fullStr | Prediction of PIK3CA mutations from cancer gene expression data |
title_full_unstemmed | Prediction of PIK3CA mutations from cancer gene expression data |
title_short | Prediction of PIK3CA mutations from cancer gene expression data |
title_sort | prediction of pik3ca mutations from cancer gene expression data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7652327/ https://www.ncbi.nlm.nih.gov/pubmed/33166334 http://dx.doi.org/10.1371/journal.pone.0241514 |
work_keys_str_mv | AT kangjun predictionofpik3camutationsfromcancergeneexpressiondata AT leeahwon predictionofpik3camutationsfromcancergeneexpressiondata AT leeyounsoo predictionofpik3camutationsfromcancergeneexpressiondata |