Cargando…

Gene function prediction based on combining gene ontology hierarchy with multi-instance multi-label learning

Gene function annotation is the main challenge in the post genome era, which is an important part of the genome annotation. The sequencing of the human genome project produces a whole genome data, providing abundant biological information for the study of gene function annotation. However, to obtain...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Zejun, Liao, Bo, Li, Yun, Liu, Wenhua, Chen, Min, Cai, Lijun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9083914/
https://www.ncbi.nlm.nih.gov/pubmed/35542493
http://dx.doi.org/10.1039/c8ra05122d
Descripción
Sumario:Gene function annotation is the main challenge in the post genome era, which is an important part of the genome annotation. The sequencing of the human genome project produces a whole genome data, providing abundant biological information for the study of gene function annotation. However, to obtain useful knowledge from a large amount of data, a potential strategy is to apply machine learning methods to mine these data and predict gene function. In this study, we improved multi-instance hierarchical clustering by using gene ontology hierarchy to annotate gene function, which combines gene ontology hierarchy with multi-instance multi-label learning frame structure. Then, we used multi-label support vector machine (MLSVM) and multi-label k-nearest neighbor (MLKNN) algorithm to predict the function of gene. Finally, we verified our method in four yeast expression datasets. The performance of the simulated experiments proved that our method is efficient.