Cargando…

Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms

Breast cancer is one of the most common malignancies in women. Patient-derived tumor xenograft (PDX) model is a cutting-edge approach for drug research on breast cancer. However, PDX still exhibits differences from original human tumors, thereby challenging the molecular understanding of tumorigenes...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Deling, Li, Jia-Rui, Zhang, Yu-Hang, Chen, Lei, Huang, Tao, Cai, Yu-Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5867876/
https://www.ncbi.nlm.nih.gov/pubmed/29534550
http://dx.doi.org/10.3390/genes9030155
_version_ 1783309043194920960
author Wang, Deling
Li, Jia-Rui
Zhang, Yu-Hang
Chen, Lei
Huang, Tao
Cai, Yu-Dong
author_facet Wang, Deling
Li, Jia-Rui
Zhang, Yu-Hang
Chen, Lei
Huang, Tao
Cai, Yu-Dong
author_sort Wang, Deling
collection PubMed
description Breast cancer is one of the most common malignancies in women. Patient-derived tumor xenograft (PDX) model is a cutting-edge approach for drug research on breast cancer. However, PDX still exhibits differences from original human tumors, thereby challenging the molecular understanding of tumorigenesis. In particular, gene expression changes after tissues are transplanted from human to mouse model. In this study, we propose a novel computational method by incorporating several machine learning algorithms, including Monte Carlo feature selection (MCFS), random forest (RF), and rough set-based rule learning, to identify genes with significant expression differences between PDX and original human tumors. First, 831 breast tumors, including 657 PDX and 174 human tumors, were collected. Based on MCFS and RF, 32 genes were then identified to be informative for the prediction of PDX and human tumors and can be used to construct a prediction model. The prediction model exhibits a Matthews coefficient correlation value of 0.777. Seven interpretable interactions within the informative gene were detected based on the rough set-based rule learning. Furthermore, the seven interpretable interactions can be well supported by previous experimental studies. Our study not only presents a method for identifying informative genes with differential expression but also provides insights into the mechanism through which gene expression changes after being transplanted from human tumor into mouse model. This work would be helpful for research and drug development for breast cancer.
format Online
Article
Text
id pubmed-5867876
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-58678762018-03-27 Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms Wang, Deling Li, Jia-Rui Zhang, Yu-Hang Chen, Lei Huang, Tao Cai, Yu-Dong Genes (Basel) Article Breast cancer is one of the most common malignancies in women. Patient-derived tumor xenograft (PDX) model is a cutting-edge approach for drug research on breast cancer. However, PDX still exhibits differences from original human tumors, thereby challenging the molecular understanding of tumorigenesis. In particular, gene expression changes after tissues are transplanted from human to mouse model. In this study, we propose a novel computational method by incorporating several machine learning algorithms, including Monte Carlo feature selection (MCFS), random forest (RF), and rough set-based rule learning, to identify genes with significant expression differences between PDX and original human tumors. First, 831 breast tumors, including 657 PDX and 174 human tumors, were collected. Based on MCFS and RF, 32 genes were then identified to be informative for the prediction of PDX and human tumors and can be used to construct a prediction model. The prediction model exhibits a Matthews coefficient correlation value of 0.777. Seven interpretable interactions within the informative gene were detected based on the rough set-based rule learning. Furthermore, the seven interpretable interactions can be well supported by previous experimental studies. Our study not only presents a method for identifying informative genes with differential expression but also provides insights into the mechanism through which gene expression changes after being transplanted from human tumor into mouse model. This work would be helpful for research and drug development for breast cancer. MDPI 2018-03-12 /pmc/articles/PMC5867876/ /pubmed/29534550 http://dx.doi.org/10.3390/genes9030155 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Deling
Li, Jia-Rui
Zhang, Yu-Hang
Chen, Lei
Huang, Tao
Cai, Yu-Dong
Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms
title Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms
title_full Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms
title_fullStr Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms
title_full_unstemmed Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms
title_short Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms
title_sort identification of differentially expressed genes between original breast cancer and xenograft using machine learning algorithms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5867876/
https://www.ncbi.nlm.nih.gov/pubmed/29534550
http://dx.doi.org/10.3390/genes9030155
work_keys_str_mv AT wangdeling identificationofdifferentiallyexpressedgenesbetweenoriginalbreastcancerandxenograftusingmachinelearningalgorithms
AT lijiarui identificationofdifferentiallyexpressedgenesbetweenoriginalbreastcancerandxenograftusingmachinelearningalgorithms
AT zhangyuhang identificationofdifferentiallyexpressedgenesbetweenoriginalbreastcancerandxenograftusingmachinelearningalgorithms
AT chenlei identificationofdifferentiallyexpressedgenesbetweenoriginalbreastcancerandxenograftusingmachinelearningalgorithms
AT huangtao identificationofdifferentiallyexpressedgenesbetweenoriginalbreastcancerandxenograftusingmachinelearningalgorithms
AT caiyudong identificationofdifferentiallyexpressedgenesbetweenoriginalbreastcancerandxenograftusingmachinelearningalgorithms