Cargando…
Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms
Breast cancer is one of the most common malignancies in women. Patient-derived tumor xenograft (PDX) model is a cutting-edge approach for drug research on breast cancer. However, PDX still exhibits differences from original human tumors, thereby challenging the molecular understanding of tumorigenes...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5867876/ https://www.ncbi.nlm.nih.gov/pubmed/29534550 http://dx.doi.org/10.3390/genes9030155 |
_version_ | 1783309043194920960 |
---|---|
author | Wang, Deling Li, Jia-Rui Zhang, Yu-Hang Chen, Lei Huang, Tao Cai, Yu-Dong |
author_facet | Wang, Deling Li, Jia-Rui Zhang, Yu-Hang Chen, Lei Huang, Tao Cai, Yu-Dong |
author_sort | Wang, Deling |
collection | PubMed |
description | Breast cancer is one of the most common malignancies in women. Patient-derived tumor xenograft (PDX) model is a cutting-edge approach for drug research on breast cancer. However, PDX still exhibits differences from original human tumors, thereby challenging the molecular understanding of tumorigenesis. In particular, gene expression changes after tissues are transplanted from human to mouse model. In this study, we propose a novel computational method by incorporating several machine learning algorithms, including Monte Carlo feature selection (MCFS), random forest (RF), and rough set-based rule learning, to identify genes with significant expression differences between PDX and original human tumors. First, 831 breast tumors, including 657 PDX and 174 human tumors, were collected. Based on MCFS and RF, 32 genes were then identified to be informative for the prediction of PDX and human tumors and can be used to construct a prediction model. The prediction model exhibits a Matthews coefficient correlation value of 0.777. Seven interpretable interactions within the informative gene were detected based on the rough set-based rule learning. Furthermore, the seven interpretable interactions can be well supported by previous experimental studies. Our study not only presents a method for identifying informative genes with differential expression but also provides insights into the mechanism through which gene expression changes after being transplanted from human tumor into mouse model. This work would be helpful for research and drug development for breast cancer. |
format | Online Article Text |
id | pubmed-5867876 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-58678762018-03-27 Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms Wang, Deling Li, Jia-Rui Zhang, Yu-Hang Chen, Lei Huang, Tao Cai, Yu-Dong Genes (Basel) Article Breast cancer is one of the most common malignancies in women. Patient-derived tumor xenograft (PDX) model is a cutting-edge approach for drug research on breast cancer. However, PDX still exhibits differences from original human tumors, thereby challenging the molecular understanding of tumorigenesis. In particular, gene expression changes after tissues are transplanted from human to mouse model. In this study, we propose a novel computational method by incorporating several machine learning algorithms, including Monte Carlo feature selection (MCFS), random forest (RF), and rough set-based rule learning, to identify genes with significant expression differences between PDX and original human tumors. First, 831 breast tumors, including 657 PDX and 174 human tumors, were collected. Based on MCFS and RF, 32 genes were then identified to be informative for the prediction of PDX and human tumors and can be used to construct a prediction model. The prediction model exhibits a Matthews coefficient correlation value of 0.777. Seven interpretable interactions within the informative gene were detected based on the rough set-based rule learning. Furthermore, the seven interpretable interactions can be well supported by previous experimental studies. Our study not only presents a method for identifying informative genes with differential expression but also provides insights into the mechanism through which gene expression changes after being transplanted from human tumor into mouse model. This work would be helpful for research and drug development for breast cancer. MDPI 2018-03-12 /pmc/articles/PMC5867876/ /pubmed/29534550 http://dx.doi.org/10.3390/genes9030155 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Deling Li, Jia-Rui Zhang, Yu-Hang Chen, Lei Huang, Tao Cai, Yu-Dong Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms |
title | Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms |
title_full | Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms |
title_fullStr | Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms |
title_full_unstemmed | Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms |
title_short | Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms |
title_sort | identification of differentially expressed genes between original breast cancer and xenograft using machine learning algorithms |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5867876/ https://www.ncbi.nlm.nih.gov/pubmed/29534550 http://dx.doi.org/10.3390/genes9030155 |
work_keys_str_mv | AT wangdeling identificationofdifferentiallyexpressedgenesbetweenoriginalbreastcancerandxenograftusingmachinelearningalgorithms AT lijiarui identificationofdifferentiallyexpressedgenesbetweenoriginalbreastcancerandxenograftusingmachinelearningalgorithms AT zhangyuhang identificationofdifferentiallyexpressedgenesbetweenoriginalbreastcancerandxenograftusingmachinelearningalgorithms AT chenlei identificationofdifferentiallyexpressedgenesbetweenoriginalbreastcancerandxenograftusingmachinelearningalgorithms AT huangtao identificationofdifferentiallyexpressedgenesbetweenoriginalbreastcancerandxenograftusingmachinelearningalgorithms AT caiyudong identificationofdifferentiallyexpressedgenesbetweenoriginalbreastcancerandxenograftusingmachinelearningalgorithms |