Cargando…
A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes
Tissue-specific gene expression has long been recognized as a crucial key for understanding tissue development and function. Efforts have been made in the past decade to identify tissue-specific expression profiles, such as the Human Proteome Atlas and FANTOM5. However, these studies mainly focused...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6162521/ https://www.ncbi.nlm.nih.gov/pubmed/30205473 http://dx.doi.org/10.3390/genes9090449 |
_version_ | 1783359166179442688 |
---|---|
author | Li, JiaRui Chen, Lei Zhang, Yu-Hang Kong, XiangYin Huang, Tao Cai, Yu-Dong |
author_facet | Li, JiaRui Chen, Lei Zhang, Yu-Hang Kong, XiangYin Huang, Tao Cai, Yu-Dong |
author_sort | Li, JiaRui |
collection | PubMed |
description | Tissue-specific gene expression has long been recognized as a crucial key for understanding tissue development and function. Efforts have been made in the past decade to identify tissue-specific expression profiles, such as the Human Proteome Atlas and FANTOM5. However, these studies mainly focused on “qualitatively tissue-specific expressed genes” which are highly enriched in one or a group of tissues but paid less attention to “quantitatively tissue-specific expressed genes”, which are expressed in all or most tissues but with differential expression levels. In this study, we applied machine learning algorithms to build a computational method for identifying “quantitatively tissue-specific expressed genes” capable of distinguishing 25 human tissues from their expression patterns. Our results uncovered the expression of 432 genes as optimal features for tissue classification, which were obtained with a Matthews Correlation Coefficient (MCC) of more than 0.99 yielded by a support vector machine (SVM). This constructed model was superior to the SVM model using tissue enriched genes and yielded MCC of 0.985 on an independent test dataset, indicating its good generalization ability. These 432 genes were proven to be widely expressed in multiple tissues and a literature review of the top 23 genes found that most of them support their discriminating powers. As a complement to previous studies, our discovery of these quantitatively tissue-specific genes provides insights into the detailed understanding of tissue development and function. |
format | Online Article Text |
id | pubmed-6162521 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-61625212018-10-10 A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes Li, JiaRui Chen, Lei Zhang, Yu-Hang Kong, XiangYin Huang, Tao Cai, Yu-Dong Genes (Basel) Article Tissue-specific gene expression has long been recognized as a crucial key for understanding tissue development and function. Efforts have been made in the past decade to identify tissue-specific expression profiles, such as the Human Proteome Atlas and FANTOM5. However, these studies mainly focused on “qualitatively tissue-specific expressed genes” which are highly enriched in one or a group of tissues but paid less attention to “quantitatively tissue-specific expressed genes”, which are expressed in all or most tissues but with differential expression levels. In this study, we applied machine learning algorithms to build a computational method for identifying “quantitatively tissue-specific expressed genes” capable of distinguishing 25 human tissues from their expression patterns. Our results uncovered the expression of 432 genes as optimal features for tissue classification, which were obtained with a Matthews Correlation Coefficient (MCC) of more than 0.99 yielded by a support vector machine (SVM). This constructed model was superior to the SVM model using tissue enriched genes and yielded MCC of 0.985 on an independent test dataset, indicating its good generalization ability. These 432 genes were proven to be widely expressed in multiple tissues and a literature review of the top 23 genes found that most of them support their discriminating powers. As a complement to previous studies, our discovery of these quantitatively tissue-specific genes provides insights into the detailed understanding of tissue development and function. MDPI 2018-09-07 /pmc/articles/PMC6162521/ /pubmed/30205473 http://dx.doi.org/10.3390/genes9090449 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Li, JiaRui Chen, Lei Zhang, Yu-Hang Kong, XiangYin Huang, Tao Cai, Yu-Dong A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes |
title | A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes |
title_full | A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes |
title_fullStr | A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes |
title_full_unstemmed | A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes |
title_short | A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes |
title_sort | computational method for classifying different human tissues with quantitatively tissue-specific expressed genes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6162521/ https://www.ncbi.nlm.nih.gov/pubmed/30205473 http://dx.doi.org/10.3390/genes9090449 |
work_keys_str_mv | AT lijiarui acomputationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes AT chenlei acomputationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes AT zhangyuhang acomputationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes AT kongxiangyin acomputationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes AT huangtao acomputationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes AT caiyudong acomputationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes AT lijiarui computationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes AT chenlei computationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes AT zhangyuhang computationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes AT kongxiangyin computationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes AT huangtao computationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes AT caiyudong computationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes |