Cargando…

Prediction of PIK3CA mutations from cancer gene expression data

Breast cancers with PIK3CA mutations can be treated with PIK3CA inhibitors in hormone receptor-positive HER2 negative subtypes. We applied a supervised elastic net penalized logistic regression model to predict PIK3CA mutations from gene expression data. This regression approach was applied to predi...

Descripción completa

Detalles Bibliográficos
Autores principales: Kang, Jun, Lee, Ahwon, Lee, Youn Soo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7652327/
https://www.ncbi.nlm.nih.gov/pubmed/33166334
http://dx.doi.org/10.1371/journal.pone.0241514
_version_ 1783607689671081984
author Kang, Jun
Lee, Ahwon
Lee, Youn Soo
author_facet Kang, Jun
Lee, Ahwon
Lee, Youn Soo
author_sort Kang, Jun
collection PubMed
description Breast cancers with PIK3CA mutations can be treated with PIK3CA inhibitors in hormone receptor-positive HER2 negative subtypes. We applied a supervised elastic net penalized logistic regression model to predict PIK3CA mutations from gene expression data. This regression approach was applied to predict modeling using the TCGA pan-cancer dataset. Approximately 10,000 cases were available for PIK3CA mutation and mRNA expression data. In 10-fold cross-validation, the model with λ = 0.01 and α = 1.0 (ridge regression) showed the best performance, in terms of area under the receiver operating characteristic (AUROC). The final model was developed with selected hyper-parameters using the entire training set. The training set AUROC was 0.93, and the test set AUROC was 0.84. The area under the precision-recall (AUPR) of the training set was 0.66, and the test set AUPR was 0.39. Cancer types were the most important predictors. Both insulin like growth factor 1 receptor (IGF1R) and the phosphatase and tensin homolog (PTEN) were the most significant genes in gene expression predictors. Our study suggests that predicting genomic alterations using gene expression data is possible, with good outcomes.
format Online
Article
Text
id pubmed-7652327
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-76523272020-11-18 Prediction of PIK3CA mutations from cancer gene expression data Kang, Jun Lee, Ahwon Lee, Youn Soo PLoS One Research Article Breast cancers with PIK3CA mutations can be treated with PIK3CA inhibitors in hormone receptor-positive HER2 negative subtypes. We applied a supervised elastic net penalized logistic regression model to predict PIK3CA mutations from gene expression data. This regression approach was applied to predict modeling using the TCGA pan-cancer dataset. Approximately 10,000 cases were available for PIK3CA mutation and mRNA expression data. In 10-fold cross-validation, the model with λ = 0.01 and α = 1.0 (ridge regression) showed the best performance, in terms of area under the receiver operating characteristic (AUROC). The final model was developed with selected hyper-parameters using the entire training set. The training set AUROC was 0.93, and the test set AUROC was 0.84. The area under the precision-recall (AUPR) of the training set was 0.66, and the test set AUPR was 0.39. Cancer types were the most important predictors. Both insulin like growth factor 1 receptor (IGF1R) and the phosphatase and tensin homolog (PTEN) were the most significant genes in gene expression predictors. Our study suggests that predicting genomic alterations using gene expression data is possible, with good outcomes. Public Library of Science 2020-11-09 /pmc/articles/PMC7652327/ /pubmed/33166334 http://dx.doi.org/10.1371/journal.pone.0241514 Text en © 2020 Kang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kang, Jun
Lee, Ahwon
Lee, Youn Soo
Prediction of PIK3CA mutations from cancer gene expression data
title Prediction of PIK3CA mutations from cancer gene expression data
title_full Prediction of PIK3CA mutations from cancer gene expression data
title_fullStr Prediction of PIK3CA mutations from cancer gene expression data
title_full_unstemmed Prediction of PIK3CA mutations from cancer gene expression data
title_short Prediction of PIK3CA mutations from cancer gene expression data
title_sort prediction of pik3ca mutations from cancer gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7652327/
https://www.ncbi.nlm.nih.gov/pubmed/33166334
http://dx.doi.org/10.1371/journal.pone.0241514
work_keys_str_mv AT kangjun predictionofpik3camutationsfromcancergeneexpressiondata
AT leeahwon predictionofpik3camutationsfromcancergeneexpressiondata
AT leeyounsoo predictionofpik3camutationsfromcancergeneexpressiondata