Cargando…

Partial AUC maximization for essential gene prediction using genetic algorithms

Identifying genes indispensable for an organism‘s life and their characteristics is one of the central questions in current biological research, and hence it would be helpful to develop computational approaches towards the prediction of essential genes. The performance of a predictor is usually meas...

Descripción completa

Detalles Bibliográficos
Autores principales: Hwang, Kyu-Baek, Ha, Beom-Yong, Ju, Sanghun, Kim, Sangsoo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korean Society for Biochemistry and Molecular Biology 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4133830/
https://www.ncbi.nlm.nih.gov/pubmed/23351383
http://dx.doi.org/10.5483/BMBRep.2013.46.1.159
_version_ 1782330795767103488
author Hwang, Kyu-Baek
Ha, Beom-Yong
Ju, Sanghun
Kim, Sangsoo
author_facet Hwang, Kyu-Baek
Ha, Beom-Yong
Ju, Sanghun
Kim, Sangsoo
author_sort Hwang, Kyu-Baek
collection PubMed
description Identifying genes indispensable for an organism‘s life and their characteristics is one of the central questions in current biological research, and hence it would be helpful to develop computational approaches towards the prediction of essential genes. The performance of a predictor is usually measured by the area under the receiver operating characteristic curve (AUC). We propose a novel method by implementing genetic algorithms to maximize the partial AUC that is restricted to a specific interval of lower false positive rate (FPR), the region relevant to follow-up experimental validation. Our predictor uses various features based on sequence information, proteinprotein interaction network topology, and gene expression profiles. A feature selection wrapper was developed to alleviate the over-fitting problem and to weigh each feature’s relevance to prediction. We evaluated our method using the proteome of budding yeast. Our implementation of genetic algorithms maximizing the partial AUC below 0.05 or 0.10 of FPR outperformed other popular classification methods. [BMB Reports 2013; 46(1): 41-46]
format Online
Article
Text
id pubmed-4133830
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Korean Society for Biochemistry and Molecular Biology
record_format MEDLINE/PubMed
spelling pubmed-41338302014-09-16 Partial AUC maximization for essential gene prediction using genetic algorithms Hwang, Kyu-Baek Ha, Beom-Yong Ju, Sanghun Kim, Sangsoo BMB Rep Research Articles Identifying genes indispensable for an organism‘s life and their characteristics is one of the central questions in current biological research, and hence it would be helpful to develop computational approaches towards the prediction of essential genes. The performance of a predictor is usually measured by the area under the receiver operating characteristic curve (AUC). We propose a novel method by implementing genetic algorithms to maximize the partial AUC that is restricted to a specific interval of lower false positive rate (FPR), the region relevant to follow-up experimental validation. Our predictor uses various features based on sequence information, proteinprotein interaction network topology, and gene expression profiles. A feature selection wrapper was developed to alleviate the over-fitting problem and to weigh each feature’s relevance to prediction. We evaluated our method using the proteome of budding yeast. Our implementation of genetic algorithms maximizing the partial AUC below 0.05 or 0.10 of FPR outperformed other popular classification methods. [BMB Reports 2013; 46(1): 41-46] Korean Society for Biochemistry and Molecular Biology 2013-01 /pmc/articles/PMC4133830/ /pubmed/23351383 http://dx.doi.org/10.5483/BMBRep.2013.46.1.159 Text en Copyright © 2013, Korean Society for Biochemistry and Molecular Biology http://creativecommons.org/licenses/by-nc/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Hwang, Kyu-Baek
Ha, Beom-Yong
Ju, Sanghun
Kim, Sangsoo
Partial AUC maximization for essential gene prediction using genetic algorithms
title Partial AUC maximization for essential gene prediction using genetic algorithms
title_full Partial AUC maximization for essential gene prediction using genetic algorithms
title_fullStr Partial AUC maximization for essential gene prediction using genetic algorithms
title_full_unstemmed Partial AUC maximization for essential gene prediction using genetic algorithms
title_short Partial AUC maximization for essential gene prediction using genetic algorithms
title_sort partial auc maximization for essential gene prediction using genetic algorithms
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4133830/
https://www.ncbi.nlm.nih.gov/pubmed/23351383
http://dx.doi.org/10.5483/BMBRep.2013.46.1.159
work_keys_str_mv AT hwangkyubaek partialaucmaximizationforessentialgenepredictionusinggeneticalgorithms
AT habeomyong partialaucmaximizationforessentialgenepredictionusinggeneticalgorithms
AT jusanghun partialaucmaximizationforessentialgenepredictionusinggeneticalgorithms
AT kimsangsoo partialaucmaximizationforessentialgenepredictionusinggeneticalgorithms