Cargando…

Gene Function Prediction Based on the Gene Ontology Hierarchical Structure

The information of the Gene Ontology annotation is helpful in the explanation of life science phenomena, and can provide great support for the research of the biomedical field. The use of the Gene Ontology is gradually affecting the way people store and understand bioinformatic data. To facilitate t...

Descripción completa

Detalles Bibliográficos
Autores principales: Cheng, Liangxi, Lin, Hongfei, Hu, Yuncui, Wang, Jian, Yang, Zhihao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4156439/
https://www.ncbi.nlm.nih.gov/pubmed/25192339
http://dx.doi.org/10.1371/journal.pone.0107187
_version_ 1782333743687532544
author Cheng, Liangxi
Lin, Hongfei
Hu, Yuncui
Wang, Jian
Yang, Zhihao
author_facet Cheng, Liangxi
Lin, Hongfei
Hu, Yuncui
Wang, Jian
Yang, Zhihao
author_sort Cheng, Liangxi
collection PubMed
description The information of the Gene Ontology annotation is helpful in the explanation of life science phenomena, and can provide great support for the research of the biomedical field. The use of the Gene Ontology is gradually affecting the way people store and understand bioinformatic data. To facilitate the prediction of gene functions with the aid of text mining methods and existing resources, we transform it into a multi-label top-down classification problem and develop a method that uses the hierarchical relationships in the Gene Ontology structure to relieve the quantitative imbalance of positive and negative training samples. Meanwhile the method enhances the discriminating ability of classifiers by retaining and highlighting the key training samples. Additionally, the top-down classifier based on a tree structure takes the relationship of target classes into consideration and thus solves the incompatibility between the classification results and the Gene Ontology structure. Our experiment on the Gene Ontology annotation corpus achieves an F-value performance of 50.7% (precision: 52.7% recall: 48.9%). The experimental results demonstrate that when the size of training set is small, it can be expanded via topological propagation of associated documents between the parent and child nodes in the tree structure. The top-down classification model applies to the set of texts in an ontology structure or with a hierarchical relationship.
format Online
Article
Text
id pubmed-4156439
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41564392014-09-09 Gene Function Prediction Based on the Gene Ontology Hierarchical Structure Cheng, Liangxi Lin, Hongfei Hu, Yuncui Wang, Jian Yang, Zhihao PLoS One Research Article The information of the Gene Ontology annotation is helpful in the explanation of life science phenomena, and can provide great support for the research of the biomedical field. The use of the Gene Ontology is gradually affecting the way people store and understand bioinformatic data. To facilitate the prediction of gene functions with the aid of text mining methods and existing resources, we transform it into a multi-label top-down classification problem and develop a method that uses the hierarchical relationships in the Gene Ontology structure to relieve the quantitative imbalance of positive and negative training samples. Meanwhile the method enhances the discriminating ability of classifiers by retaining and highlighting the key training samples. Additionally, the top-down classifier based on a tree structure takes the relationship of target classes into consideration and thus solves the incompatibility between the classification results and the Gene Ontology structure. Our experiment on the Gene Ontology annotation corpus achieves an F-value performance of 50.7% (precision: 52.7% recall: 48.9%). The experimental results demonstrate that when the size of training set is small, it can be expanded via topological propagation of associated documents between the parent and child nodes in the tree structure. The top-down classification model applies to the set of texts in an ontology structure or with a hierarchical relationship. Public Library of Science 2014-09-05 /pmc/articles/PMC4156439/ /pubmed/25192339 http://dx.doi.org/10.1371/journal.pone.0107187 Text en © 2014 Cheng et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Cheng, Liangxi
Lin, Hongfei
Hu, Yuncui
Wang, Jian
Yang, Zhihao
Gene Function Prediction Based on the Gene Ontology Hierarchical Structure
title Gene Function Prediction Based on the Gene Ontology Hierarchical Structure
title_full Gene Function Prediction Based on the Gene Ontology Hierarchical Structure
title_fullStr Gene Function Prediction Based on the Gene Ontology Hierarchical Structure
title_full_unstemmed Gene Function Prediction Based on the Gene Ontology Hierarchical Structure
title_short Gene Function Prediction Based on the Gene Ontology Hierarchical Structure
title_sort gene function prediction based on the gene ontology hierarchical structure
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4156439/
https://www.ncbi.nlm.nih.gov/pubmed/25192339
http://dx.doi.org/10.1371/journal.pone.0107187
work_keys_str_mv AT chengliangxi genefunctionpredictionbasedonthegeneontologyhierarchicalstructure
AT linhongfei genefunctionpredictionbasedonthegeneontologyhierarchicalstructure
AT huyuncui genefunctionpredictionbasedonthegeneontologyhierarchicalstructure
AT wangjian genefunctionpredictionbasedonthegeneontologyhierarchicalstructure
AT yangzhihao genefunctionpredictionbasedonthegeneontologyhierarchicalstructure