Cargando…

Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data

[Image: see text] Metabolomics holds the promise as a new technology to diagnose highly heterogeneous diseases. Conventionally, metabolomics data analysis for diagnosis is done using various statistical and machine learning based classification methods. However, it remains unknown if deep neural net...

Descripción completa

Detalles Bibliográficos
Autores principales: Alakwaa, Fadhl M., Chaudhary, Kumardeep, Garmire, Lana X.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2017
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5759031/
https://www.ncbi.nlm.nih.gov/pubmed/29110491
http://dx.doi.org/10.1021/acs.jproteome.7b00595
_version_ 1783291117718994944
author Alakwaa, Fadhl M.
Chaudhary, Kumardeep
Garmire, Lana X.
author_facet Alakwaa, Fadhl M.
Chaudhary, Kumardeep
Garmire, Lana X.
author_sort Alakwaa, Fadhl M.
collection PubMed
description [Image: see text] Metabolomics holds the promise as a new technology to diagnose highly heterogeneous diseases. Conventionally, metabolomics data analysis for diagnosis is done using various statistical and machine learning based classification methods. However, it remains unknown if deep neural network, a class of increasingly popular machine learning methods, is suitable to classify metabolomics data. Here we use a cohort of 271 breast cancer tissues, 204 positive estrogen receptor (ER+), and 67 negative estrogen receptor (ER−) to test the accuracies of feed-forward networks, a deep learning (DL) framework, as well as six widely used machine learning models, namely random forest (RF), support vector machines (SVM), recursive partitioning and regression trees (RPART), linear discriminant analysis (LDA), prediction analysis for microarrays (PAM), and generalized boosted models (GBM). DL framework has the highest area under the curve (AUC) of 0.93 in classifying ER+/ER– patients, compared to the other six machine learning algorithms. Furthermore, the biological interpretation of the first hidden layer reveals eight commonly enriched significant metabolomics pathways (adjusted P-value <0.05) that cannot be discovered by other machine learning methods. Among them, protein digestion and absorption and ATP-binding cassette (ABC) transporters pathways are also confirmed in integrated analysis between metabolomics and gene expression data in these samples. In summary, deep learning method shows advantages for metabolomics based breast cancer ER status classification, with both the highest prediction accuracy (AUC = 0.93) and better revelation of disease biology. We encourage the adoption of feed-forward networks based deep learning method in the metabolomics research community for classification.
format Online
Article
Text
id pubmed-5759031
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-57590312018-01-10 Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data Alakwaa, Fadhl M. Chaudhary, Kumardeep Garmire, Lana X. J Proteome Res [Image: see text] Metabolomics holds the promise as a new technology to diagnose highly heterogeneous diseases. Conventionally, metabolomics data analysis for diagnosis is done using various statistical and machine learning based classification methods. However, it remains unknown if deep neural network, a class of increasingly popular machine learning methods, is suitable to classify metabolomics data. Here we use a cohort of 271 breast cancer tissues, 204 positive estrogen receptor (ER+), and 67 negative estrogen receptor (ER−) to test the accuracies of feed-forward networks, a deep learning (DL) framework, as well as six widely used machine learning models, namely random forest (RF), support vector machines (SVM), recursive partitioning and regression trees (RPART), linear discriminant analysis (LDA), prediction analysis for microarrays (PAM), and generalized boosted models (GBM). DL framework has the highest area under the curve (AUC) of 0.93 in classifying ER+/ER– patients, compared to the other six machine learning algorithms. Furthermore, the biological interpretation of the first hidden layer reveals eight commonly enriched significant metabolomics pathways (adjusted P-value <0.05) that cannot be discovered by other machine learning methods. Among them, protein digestion and absorption and ATP-binding cassette (ABC) transporters pathways are also confirmed in integrated analysis between metabolomics and gene expression data in these samples. In summary, deep learning method shows advantages for metabolomics based breast cancer ER status classification, with both the highest prediction accuracy (AUC = 0.93) and better revelation of disease biology. We encourage the adoption of feed-forward networks based deep learning method in the metabolomics research community for classification. American Chemical Society 2017-11-07 2018-01-05 /pmc/articles/PMC5759031/ /pubmed/29110491 http://dx.doi.org/10.1021/acs.jproteome.7b00595 Text en Copyright © 2017 American Chemical Society This is an open access article published under an ACS AuthorChoice License (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html) , which permits copying and redistribution of the article or any adaptations for non-commercial purposes.
spellingShingle Alakwaa, Fadhl M.
Chaudhary, Kumardeep
Garmire, Lana X.
Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data
title Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data
title_full Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data
title_fullStr Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data
title_full_unstemmed Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data
title_short Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data
title_sort deep learning accurately predicts estrogen receptor status in breast cancer metabolomics data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5759031/
https://www.ncbi.nlm.nih.gov/pubmed/29110491
http://dx.doi.org/10.1021/acs.jproteome.7b00595
work_keys_str_mv AT alakwaafadhlm deeplearningaccuratelypredictsestrogenreceptorstatusinbreastcancermetabolomicsdata
AT chaudharykumardeep deeplearningaccuratelypredictsestrogenreceptorstatusinbreastcancermetabolomicsdata
AT garmirelanax deeplearningaccuratelypredictsestrogenreceptorstatusinbreastcancermetabolomicsdata