Cargando…
Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data
[Image: see text] Metabolomics holds the promise as a new technology to diagnose highly heterogeneous diseases. Conventionally, metabolomics data analysis for diagnosis is done using various statistical and machine learning based classification methods. However, it remains unknown if deep neural net...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical
Society
2017
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5759031/ https://www.ncbi.nlm.nih.gov/pubmed/29110491 http://dx.doi.org/10.1021/acs.jproteome.7b00595 |
_version_ | 1783291117718994944 |
---|---|
author | Alakwaa, Fadhl M. Chaudhary, Kumardeep Garmire, Lana X. |
author_facet | Alakwaa, Fadhl M. Chaudhary, Kumardeep Garmire, Lana X. |
author_sort | Alakwaa, Fadhl M. |
collection | PubMed |
description | [Image: see text] Metabolomics holds the promise as a new technology to diagnose highly heterogeneous diseases. Conventionally, metabolomics data analysis for diagnosis is done using various statistical and machine learning based classification methods. However, it remains unknown if deep neural network, a class of increasingly popular machine learning methods, is suitable to classify metabolomics data. Here we use a cohort of 271 breast cancer tissues, 204 positive estrogen receptor (ER+), and 67 negative estrogen receptor (ER−) to test the accuracies of feed-forward networks, a deep learning (DL) framework, as well as six widely used machine learning models, namely random forest (RF), support vector machines (SVM), recursive partitioning and regression trees (RPART), linear discriminant analysis (LDA), prediction analysis for microarrays (PAM), and generalized boosted models (GBM). DL framework has the highest area under the curve (AUC) of 0.93 in classifying ER+/ER– patients, compared to the other six machine learning algorithms. Furthermore, the biological interpretation of the first hidden layer reveals eight commonly enriched significant metabolomics pathways (adjusted P-value <0.05) that cannot be discovered by other machine learning methods. Among them, protein digestion and absorption and ATP-binding cassette (ABC) transporters pathways are also confirmed in integrated analysis between metabolomics and gene expression data in these samples. In summary, deep learning method shows advantages for metabolomics based breast cancer ER status classification, with both the highest prediction accuracy (AUC = 0.93) and better revelation of disease biology. We encourage the adoption of feed-forward networks based deep learning method in the metabolomics research community for classification. |
format | Online Article Text |
id | pubmed-5759031 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | American Chemical
Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-57590312018-01-10 Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data Alakwaa, Fadhl M. Chaudhary, Kumardeep Garmire, Lana X. J Proteome Res [Image: see text] Metabolomics holds the promise as a new technology to diagnose highly heterogeneous diseases. Conventionally, metabolomics data analysis for diagnosis is done using various statistical and machine learning based classification methods. However, it remains unknown if deep neural network, a class of increasingly popular machine learning methods, is suitable to classify metabolomics data. Here we use a cohort of 271 breast cancer tissues, 204 positive estrogen receptor (ER+), and 67 negative estrogen receptor (ER−) to test the accuracies of feed-forward networks, a deep learning (DL) framework, as well as six widely used machine learning models, namely random forest (RF), support vector machines (SVM), recursive partitioning and regression trees (RPART), linear discriminant analysis (LDA), prediction analysis for microarrays (PAM), and generalized boosted models (GBM). DL framework has the highest area under the curve (AUC) of 0.93 in classifying ER+/ER– patients, compared to the other six machine learning algorithms. Furthermore, the biological interpretation of the first hidden layer reveals eight commonly enriched significant metabolomics pathways (adjusted P-value <0.05) that cannot be discovered by other machine learning methods. Among them, protein digestion and absorption and ATP-binding cassette (ABC) transporters pathways are also confirmed in integrated analysis between metabolomics and gene expression data in these samples. In summary, deep learning method shows advantages for metabolomics based breast cancer ER status classification, with both the highest prediction accuracy (AUC = 0.93) and better revelation of disease biology. We encourage the adoption of feed-forward networks based deep learning method in the metabolomics research community for classification. American Chemical Society 2017-11-07 2018-01-05 /pmc/articles/PMC5759031/ /pubmed/29110491 http://dx.doi.org/10.1021/acs.jproteome.7b00595 Text en Copyright © 2017 American Chemical Society This is an open access article published under an ACS AuthorChoice License (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html) , which permits copying and redistribution of the article or any adaptations for non-commercial purposes. |
spellingShingle | Alakwaa, Fadhl M. Chaudhary, Kumardeep Garmire, Lana X. Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data |
title | Deep Learning
Accurately Predicts Estrogen Receptor
Status in Breast Cancer Metabolomics Data |
title_full | Deep Learning
Accurately Predicts Estrogen Receptor
Status in Breast Cancer Metabolomics Data |
title_fullStr | Deep Learning
Accurately Predicts Estrogen Receptor
Status in Breast Cancer Metabolomics Data |
title_full_unstemmed | Deep Learning
Accurately Predicts Estrogen Receptor
Status in Breast Cancer Metabolomics Data |
title_short | Deep Learning
Accurately Predicts Estrogen Receptor
Status in Breast Cancer Metabolomics Data |
title_sort | deep learning
accurately predicts estrogen receptor
status in breast cancer metabolomics data |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5759031/ https://www.ncbi.nlm.nih.gov/pubmed/29110491 http://dx.doi.org/10.1021/acs.jproteome.7b00595 |
work_keys_str_mv | AT alakwaafadhlm deeplearningaccuratelypredictsestrogenreceptorstatusinbreastcancermetabolomicsdata AT chaudharykumardeep deeplearningaccuratelypredictsestrogenreceptorstatusinbreastcancermetabolomicsdata AT garmirelanax deeplearningaccuratelypredictsestrogenreceptorstatusinbreastcancermetabolomicsdata |