Cargando…

Deep learning generates custom-made logistic regression models for explaining how breast cancer subtypes are classified

Differentiating the intrinsic subtypes of breast cancer is crucial for deciding the best treatment strategy. Deep learning can predict the subtypes from genetic information more accurately than conventional statistical methods, but to date, deep learning has not been directly utilized to examine whi...

Descripción completa

Detalles Bibliográficos
Autores principales: Shibahara, Takuma, Wada, Chisa, Yamashita, Yasuho, Fujita, Kazuhiro, Sato, Masamichi, Kuwata, Junichi, Okamoto, Atsushi, Ono, Yoshimasa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10202302/
https://www.ncbi.nlm.nih.gov/pubmed/37216350
http://dx.doi.org/10.1371/journal.pone.0286072
_version_ 1785045413635555328
author Shibahara, Takuma
Wada, Chisa
Yamashita, Yasuho
Fujita, Kazuhiro
Sato, Masamichi
Kuwata, Junichi
Okamoto, Atsushi
Ono, Yoshimasa
author_facet Shibahara, Takuma
Wada, Chisa
Yamashita, Yasuho
Fujita, Kazuhiro
Sato, Masamichi
Kuwata, Junichi
Okamoto, Atsushi
Ono, Yoshimasa
author_sort Shibahara, Takuma
collection PubMed
description Differentiating the intrinsic subtypes of breast cancer is crucial for deciding the best treatment strategy. Deep learning can predict the subtypes from genetic information more accurately than conventional statistical methods, but to date, deep learning has not been directly utilized to examine which genes are associated with which subtypes. To clarify the mechanisms embedded in the intrinsic subtypes, we developed an explainable deep learning model called a point-wise linear (PWL) model that generates a custom-made logistic regression for each patient. Logistic regression, which is familiar to both physicians and medical informatics researchers, allows us to analyze the importance of the feature variables, and the PWL model harnesses these practical abilities of logistic regression. In this study, we show that analyzing breast cancer subtypes is clinically beneficial for patients and one of the best ways to validate the capability of the PWL model. First, we trained the PWL model with RNA-seq data to predict PAM50 intrinsic subtypes and applied it to the 41/50 genes of PAM50 through the subtype prediction task. Second, we developed a deep enrichment analysis method to reveal the relationships between the PAM50 subtypes and the copy numbers of breast cancer. Our findings showed that the PWL model utilized genes relevant to the cell cycle-related pathways. These preliminary successes in breast cancer subtype analysis demonstrate the potential of our analysis strategy to clarify the mechanisms underlying breast cancer and improve overall clinical outcomes.
format Online
Article
Text
id pubmed-10202302
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-102023022023-05-23 Deep learning generates custom-made logistic regression models for explaining how breast cancer subtypes are classified Shibahara, Takuma Wada, Chisa Yamashita, Yasuho Fujita, Kazuhiro Sato, Masamichi Kuwata, Junichi Okamoto, Atsushi Ono, Yoshimasa PLoS One Research Article Differentiating the intrinsic subtypes of breast cancer is crucial for deciding the best treatment strategy. Deep learning can predict the subtypes from genetic information more accurately than conventional statistical methods, but to date, deep learning has not been directly utilized to examine which genes are associated with which subtypes. To clarify the mechanisms embedded in the intrinsic subtypes, we developed an explainable deep learning model called a point-wise linear (PWL) model that generates a custom-made logistic regression for each patient. Logistic regression, which is familiar to both physicians and medical informatics researchers, allows us to analyze the importance of the feature variables, and the PWL model harnesses these practical abilities of logistic regression. In this study, we show that analyzing breast cancer subtypes is clinically beneficial for patients and one of the best ways to validate the capability of the PWL model. First, we trained the PWL model with RNA-seq data to predict PAM50 intrinsic subtypes and applied it to the 41/50 genes of PAM50 through the subtype prediction task. Second, we developed a deep enrichment analysis method to reveal the relationships between the PAM50 subtypes and the copy numbers of breast cancer. Our findings showed that the PWL model utilized genes relevant to the cell cycle-related pathways. These preliminary successes in breast cancer subtype analysis demonstrate the potential of our analysis strategy to clarify the mechanisms underlying breast cancer and improve overall clinical outcomes. Public Library of Science 2023-05-22 /pmc/articles/PMC10202302/ /pubmed/37216350 http://dx.doi.org/10.1371/journal.pone.0286072 Text en © 2023 Shibahara et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Shibahara, Takuma
Wada, Chisa
Yamashita, Yasuho
Fujita, Kazuhiro
Sato, Masamichi
Kuwata, Junichi
Okamoto, Atsushi
Ono, Yoshimasa
Deep learning generates custom-made logistic regression models for explaining how breast cancer subtypes are classified
title Deep learning generates custom-made logistic regression models for explaining how breast cancer subtypes are classified
title_full Deep learning generates custom-made logistic regression models for explaining how breast cancer subtypes are classified
title_fullStr Deep learning generates custom-made logistic regression models for explaining how breast cancer subtypes are classified
title_full_unstemmed Deep learning generates custom-made logistic regression models for explaining how breast cancer subtypes are classified
title_short Deep learning generates custom-made logistic regression models for explaining how breast cancer subtypes are classified
title_sort deep learning generates custom-made logistic regression models for explaining how breast cancer subtypes are classified
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10202302/
https://www.ncbi.nlm.nih.gov/pubmed/37216350
http://dx.doi.org/10.1371/journal.pone.0286072
work_keys_str_mv AT shibaharatakuma deeplearninggeneratescustommadelogisticregressionmodelsforexplaininghowbreastcancersubtypesareclassified
AT wadachisa deeplearninggeneratescustommadelogisticregressionmodelsforexplaininghowbreastcancersubtypesareclassified
AT yamashitayasuho deeplearninggeneratescustommadelogisticregressionmodelsforexplaininghowbreastcancersubtypesareclassified
AT fujitakazuhiro deeplearninggeneratescustommadelogisticregressionmodelsforexplaininghowbreastcancersubtypesareclassified
AT satomasamichi deeplearninggeneratescustommadelogisticregressionmodelsforexplaininghowbreastcancersubtypesareclassified
AT kuwatajunichi deeplearninggeneratescustommadelogisticregressionmodelsforexplaininghowbreastcancersubtypesareclassified
AT okamotoatsushi deeplearninggeneratescustommadelogisticregressionmodelsforexplaininghowbreastcancersubtypesareclassified
AT onoyoshimasa deeplearninggeneratescustommadelogisticregressionmodelsforexplaininghowbreastcancersubtypesareclassified