Cargando…

Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways

Identifying essential genes in a given organism is important for research on their fundamental roles in organism survival. Furthermore, if possible, uncovering the links between core functions or pathways with these essential genes will further help us obtain deep insight into the key roles of these...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Lei, Zhang, Yu-Hang, Wang, ShaoPeng, Zhang, YunHua, Huang, Tao, Cai, Yu-Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5584762/
https://www.ncbi.nlm.nih.gov/pubmed/28873455
http://dx.doi.org/10.1371/journal.pone.0184129
_version_ 1783261501905174528
author Chen, Lei
Zhang, Yu-Hang
Wang, ShaoPeng
Zhang, YunHua
Huang, Tao
Cai, Yu-Dong
author_facet Chen, Lei
Zhang, Yu-Hang
Wang, ShaoPeng
Zhang, YunHua
Huang, Tao
Cai, Yu-Dong
author_sort Chen, Lei
collection PubMed
description Identifying essential genes in a given organism is important for research on their fundamental roles in organism survival. Furthermore, if possible, uncovering the links between core functions or pathways with these essential genes will further help us obtain deep insight into the key roles of these genes. In this study, we investigated the essential and non-essential genes reported in a previous study and extracted gene ontology (GO) terms and biological pathways that are important for the determination of essential genes. Through the enrichment theory of GO and KEGG pathways, we encoded each essential/non-essential gene into a vector in which each component represented the relationship between the gene and one GO term or KEGG pathway. To analyze these relationships, the maximum relevance minimum redundancy (mRMR) was adopted. Then, the incremental feature selection (IFS) and support vector machine (SVM) were employed to extract important GO terms and KEGG pathways. A prediction model was built simultaneously using the extracted GO terms and KEGG pathways, which yielded nearly perfect performance, with a Matthews correlation coefficient of 0.951, for distinguishing essential and non-essential genes. To fully investigate the key factors influencing the fundamental roles of essential genes, the 21 most important GO terms and three KEGG pathways were analyzed in detail. In addition, several genes was provided in this study, which were predicted to be essential genes by our prediction model. We suggest that this study provides more functional and pathway information on the essential genes and provides a new way to investigate related problems.
format Online
Article
Text
id pubmed-5584762
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55847622017-09-15 Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways Chen, Lei Zhang, Yu-Hang Wang, ShaoPeng Zhang, YunHua Huang, Tao Cai, Yu-Dong PLoS One Research Article Identifying essential genes in a given organism is important for research on their fundamental roles in organism survival. Furthermore, if possible, uncovering the links between core functions or pathways with these essential genes will further help us obtain deep insight into the key roles of these genes. In this study, we investigated the essential and non-essential genes reported in a previous study and extracted gene ontology (GO) terms and biological pathways that are important for the determination of essential genes. Through the enrichment theory of GO and KEGG pathways, we encoded each essential/non-essential gene into a vector in which each component represented the relationship between the gene and one GO term or KEGG pathway. To analyze these relationships, the maximum relevance minimum redundancy (mRMR) was adopted. Then, the incremental feature selection (IFS) and support vector machine (SVM) were employed to extract important GO terms and KEGG pathways. A prediction model was built simultaneously using the extracted GO terms and KEGG pathways, which yielded nearly perfect performance, with a Matthews correlation coefficient of 0.951, for distinguishing essential and non-essential genes. To fully investigate the key factors influencing the fundamental roles of essential genes, the 21 most important GO terms and three KEGG pathways were analyzed in detail. In addition, several genes was provided in this study, which were predicted to be essential genes by our prediction model. We suggest that this study provides more functional and pathway information on the essential genes and provides a new way to investigate related problems. Public Library of Science 2017-09-05 /pmc/articles/PMC5584762/ /pubmed/28873455 http://dx.doi.org/10.1371/journal.pone.0184129 Text en © 2017 Chen et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Chen, Lei
Zhang, Yu-Hang
Wang, ShaoPeng
Zhang, YunHua
Huang, Tao
Cai, Yu-Dong
Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways
title Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways
title_full Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways
title_fullStr Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways
title_full_unstemmed Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways
title_short Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways
title_sort prediction and analysis of essential genes using the enrichments of gene ontology and kegg pathways
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5584762/
https://www.ncbi.nlm.nih.gov/pubmed/28873455
http://dx.doi.org/10.1371/journal.pone.0184129
work_keys_str_mv AT chenlei predictionandanalysisofessentialgenesusingtheenrichmentsofgeneontologyandkeggpathways
AT zhangyuhang predictionandanalysisofessentialgenesusingtheenrichmentsofgeneontologyandkeggpathways
AT wangshaopeng predictionandanalysisofessentialgenesusingtheenrichmentsofgeneontologyandkeggpathways
AT zhangyunhua predictionandanalysisofessentialgenesusingtheenrichmentsofgeneontologyandkeggpathways
AT huangtao predictionandanalysisofessentialgenesusingtheenrichmentsofgeneontologyandkeggpathways
AT caiyudong predictionandanalysisofessentialgenesusingtheenrichmentsofgeneontologyandkeggpathways