Cargando…

A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes

Tissue-specific gene expression has long been recognized as a crucial key for understanding tissue development and function. Efforts have been made in the past decade to identify tissue-specific expression profiles, such as the Human Proteome Atlas and FANTOM5. However, these studies mainly focused...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, JiaRui, Chen, Lei, Zhang, Yu-Hang, Kong, XiangYin, Huang, Tao, Cai, Yu-Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6162521/
https://www.ncbi.nlm.nih.gov/pubmed/30205473
http://dx.doi.org/10.3390/genes9090449
_version_ 1783359166179442688
author Li, JiaRui
Chen, Lei
Zhang, Yu-Hang
Kong, XiangYin
Huang, Tao
Cai, Yu-Dong
author_facet Li, JiaRui
Chen, Lei
Zhang, Yu-Hang
Kong, XiangYin
Huang, Tao
Cai, Yu-Dong
author_sort Li, JiaRui
collection PubMed
description Tissue-specific gene expression has long been recognized as a crucial key for understanding tissue development and function. Efforts have been made in the past decade to identify tissue-specific expression profiles, such as the Human Proteome Atlas and FANTOM5. However, these studies mainly focused on “qualitatively tissue-specific expressed genes” which are highly enriched in one or a group of tissues but paid less attention to “quantitatively tissue-specific expressed genes”, which are expressed in all or most tissues but with differential expression levels. In this study, we applied machine learning algorithms to build a computational method for identifying “quantitatively tissue-specific expressed genes” capable of distinguishing 25 human tissues from their expression patterns. Our results uncovered the expression of 432 genes as optimal features for tissue classification, which were obtained with a Matthews Correlation Coefficient (MCC) of more than 0.99 yielded by a support vector machine (SVM). This constructed model was superior to the SVM model using tissue enriched genes and yielded MCC of 0.985 on an independent test dataset, indicating its good generalization ability. These 432 genes were proven to be widely expressed in multiple tissues and a literature review of the top 23 genes found that most of them support their discriminating powers. As a complement to previous studies, our discovery of these quantitatively tissue-specific genes provides insights into the detailed understanding of tissue development and function.
format Online
Article
Text
id pubmed-6162521
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-61625212018-10-10 A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes Li, JiaRui Chen, Lei Zhang, Yu-Hang Kong, XiangYin Huang, Tao Cai, Yu-Dong Genes (Basel) Article Tissue-specific gene expression has long been recognized as a crucial key for understanding tissue development and function. Efforts have been made in the past decade to identify tissue-specific expression profiles, such as the Human Proteome Atlas and FANTOM5. However, these studies mainly focused on “qualitatively tissue-specific expressed genes” which are highly enriched in one or a group of tissues but paid less attention to “quantitatively tissue-specific expressed genes”, which are expressed in all or most tissues but with differential expression levels. In this study, we applied machine learning algorithms to build a computational method for identifying “quantitatively tissue-specific expressed genes” capable of distinguishing 25 human tissues from their expression patterns. Our results uncovered the expression of 432 genes as optimal features for tissue classification, which were obtained with a Matthews Correlation Coefficient (MCC) of more than 0.99 yielded by a support vector machine (SVM). This constructed model was superior to the SVM model using tissue enriched genes and yielded MCC of 0.985 on an independent test dataset, indicating its good generalization ability. These 432 genes were proven to be widely expressed in multiple tissues and a literature review of the top 23 genes found that most of them support their discriminating powers. As a complement to previous studies, our discovery of these quantitatively tissue-specific genes provides insights into the detailed understanding of tissue development and function. MDPI 2018-09-07 /pmc/articles/PMC6162521/ /pubmed/30205473 http://dx.doi.org/10.3390/genes9090449 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, JiaRui
Chen, Lei
Zhang, Yu-Hang
Kong, XiangYin
Huang, Tao
Cai, Yu-Dong
A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes
title A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes
title_full A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes
title_fullStr A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes
title_full_unstemmed A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes
title_short A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes
title_sort computational method for classifying different human tissues with quantitatively tissue-specific expressed genes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6162521/
https://www.ncbi.nlm.nih.gov/pubmed/30205473
http://dx.doi.org/10.3390/genes9090449
work_keys_str_mv AT lijiarui acomputationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes
AT chenlei acomputationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes
AT zhangyuhang acomputationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes
AT kongxiangyin acomputationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes
AT huangtao acomputationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes
AT caiyudong acomputationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes
AT lijiarui computationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes
AT chenlei computationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes
AT zhangyuhang computationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes
AT kongxiangyin computationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes
AT huangtao computationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes
AT caiyudong computationalmethodforclassifyingdifferenthumantissueswithquantitativelytissuespecificexpressedgenes