Cargando…

Breast Cancer Case Identification Based on Deep Learning and Bioinformatics Analysis

Mastering the molecular mechanism of breast cancer (BC) can provide an in-depth understanding of BC pathology. This study explored existing technologies for diagnosing BC, such as mammography, ultrasound, magnetic resonance imaging (MRI), computed tomography (CT), and positron emission tomography (P...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Dongfang, Chen, Cheng, Chen, Chen, Chen, Fangfang, Zhang, Ningrui, Yan, Ziwei, Lv, Xiaoyi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8165442/
https://www.ncbi.nlm.nih.gov/pubmed/34079578
http://dx.doi.org/10.3389/fgene.2021.628136
_version_ 1783701322916167680
author Jia, Dongfang
Chen, Cheng
Chen, Chen
Chen, Fangfang
Zhang, Ningrui
Yan, Ziwei
Lv, Xiaoyi
author_facet Jia, Dongfang
Chen, Cheng
Chen, Chen
Chen, Fangfang
Zhang, Ningrui
Yan, Ziwei
Lv, Xiaoyi
author_sort Jia, Dongfang
collection PubMed
description Mastering the molecular mechanism of breast cancer (BC) can provide an in-depth understanding of BC pathology. This study explored existing technologies for diagnosing BC, such as mammography, ultrasound, magnetic resonance imaging (MRI), computed tomography (CT), and positron emission tomography (PET) and summarized the disadvantages of the existing cancer diagnosis. The purpose of this article is to use gene expression profiles of The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) to classify BC samples and normal samples. The method proposed in this article triumphs over some of the shortcomings of traditional diagnostic methods and can conduct BC diagnosis more rapidly with high sensitivity and have no radiation. This study first selected the genes most relevant to cancer through weighted gene co-expression network analysis (WGCNA) and differential expression analysis (DEA). Then it used the protein–protein interaction (PPI) network to screen 23 hub genes. Finally, it used the support vector machine (SVM), decision tree (DT), Bayesian network (BN), artificial neural network (ANN), convolutional neural network CNN-LeNet and CNN-AlexNet to process the expression levels of 23 hub genes. For gene expression profiles, the ANN model has the best performance in the classification of cancer samples. The ten-time average accuracy is 97.36% (±0.34%), the F1 value is 0.8535 (±0.0260), the sensitivity is 98.32% (±0.32%), the specificity is 89.59% (±3.53%) and the AUC is 0.99. In summary, this method effectively classifies cancer samples and normal samples and provides reasonable new ideas for the early diagnosis of cancer in the future.
format Online
Article
Text
id pubmed-8165442
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-81654422021-06-01 Breast Cancer Case Identification Based on Deep Learning and Bioinformatics Analysis Jia, Dongfang Chen, Cheng Chen, Chen Chen, Fangfang Zhang, Ningrui Yan, Ziwei Lv, Xiaoyi Front Genet Genetics Mastering the molecular mechanism of breast cancer (BC) can provide an in-depth understanding of BC pathology. This study explored existing technologies for diagnosing BC, such as mammography, ultrasound, magnetic resonance imaging (MRI), computed tomography (CT), and positron emission tomography (PET) and summarized the disadvantages of the existing cancer diagnosis. The purpose of this article is to use gene expression profiles of The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) to classify BC samples and normal samples. The method proposed in this article triumphs over some of the shortcomings of traditional diagnostic methods and can conduct BC diagnosis more rapidly with high sensitivity and have no radiation. This study first selected the genes most relevant to cancer through weighted gene co-expression network analysis (WGCNA) and differential expression analysis (DEA). Then it used the protein–protein interaction (PPI) network to screen 23 hub genes. Finally, it used the support vector machine (SVM), decision tree (DT), Bayesian network (BN), artificial neural network (ANN), convolutional neural network CNN-LeNet and CNN-AlexNet to process the expression levels of 23 hub genes. For gene expression profiles, the ANN model has the best performance in the classification of cancer samples. The ten-time average accuracy is 97.36% (±0.34%), the F1 value is 0.8535 (±0.0260), the sensitivity is 98.32% (±0.32%), the specificity is 89.59% (±3.53%) and the AUC is 0.99. In summary, this method effectively classifies cancer samples and normal samples and provides reasonable new ideas for the early diagnosis of cancer in the future. Frontiers Media S.A. 2021-05-17 /pmc/articles/PMC8165442/ /pubmed/34079578 http://dx.doi.org/10.3389/fgene.2021.628136 Text en Copyright © 2021 Jia, Chen, Chen, Chen, Zhang, Yan and Lv. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Jia, Dongfang
Chen, Cheng
Chen, Chen
Chen, Fangfang
Zhang, Ningrui
Yan, Ziwei
Lv, Xiaoyi
Breast Cancer Case Identification Based on Deep Learning and Bioinformatics Analysis
title Breast Cancer Case Identification Based on Deep Learning and Bioinformatics Analysis
title_full Breast Cancer Case Identification Based on Deep Learning and Bioinformatics Analysis
title_fullStr Breast Cancer Case Identification Based on Deep Learning and Bioinformatics Analysis
title_full_unstemmed Breast Cancer Case Identification Based on Deep Learning and Bioinformatics Analysis
title_short Breast Cancer Case Identification Based on Deep Learning and Bioinformatics Analysis
title_sort breast cancer case identification based on deep learning and bioinformatics analysis
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8165442/
https://www.ncbi.nlm.nih.gov/pubmed/34079578
http://dx.doi.org/10.3389/fgene.2021.628136
work_keys_str_mv AT jiadongfang breastcancercaseidentificationbasedondeeplearningandbioinformaticsanalysis
AT chencheng breastcancercaseidentificationbasedondeeplearningandbioinformaticsanalysis
AT chenchen breastcancercaseidentificationbasedondeeplearningandbioinformaticsanalysis
AT chenfangfang breastcancercaseidentificationbasedondeeplearningandbioinformaticsanalysis
AT zhangningrui breastcancercaseidentificationbasedondeeplearningandbioinformaticsanalysis
AT yanziwei breastcancercaseidentificationbasedondeeplearningandbioinformaticsanalysis
AT lvxiaoyi breastcancercaseidentificationbasedondeeplearningandbioinformaticsanalysis