Cargando…

Drug-target interaction prediction via class imbalance-aware ensemble learning

BACKGROUND: Multiple computational methods for predicting drug-target interactions have been developed to facilitate the drug discovery process. These methods use available data on known drug-target interactions to train classifiers with the purpose of predicting new undiscovered interactions. Howev...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ezzat, Ali, Wu, Min, Li, Xiao-Li, Kwoh, Chee-Keong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2016
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5259867/ https://www.ncbi.nlm.nih.gov/pubmed/28155697 http://dx.doi.org/10.1186/s12859-016-1377-y

_version_	1782499290676985856
author	Ezzat, Ali Wu, Min Li, Xiao-Li Kwoh, Chee-Keong
author_facet	Ezzat, Ali Wu, Min Li, Xiao-Li Kwoh, Chee-Keong
author_sort	Ezzat, Ali
collection	PubMed
description	BACKGROUND: Multiple computational methods for predicting drug-target interactions have been developed to facilitate the drug discovery process. These methods use available data on known drug-target interactions to train classifiers with the purpose of predicting new undiscovered interactions. However, a key challenge regarding this data that has not yet been addressed by these methods, namely class imbalance, is potentially degrading the prediction performance. Class imbalance can be divided into two sub-problems. Firstly, the number of known interacting drug-target pairs is much smaller than that of non-interacting drug-target pairs. This imbalance ratio between interacting and non-interacting drug-target pairs is referred to as the between-class imbalance. Between-class imbalance degrades prediction performance due to the bias in prediction results towards the majority class (i.e. the non-interacting pairs), leading to more prediction errors in the minority class (i.e. the interacting pairs). Secondly, there are multiple types of drug-target interactions in the data with some types having relatively fewer members (or are less represented) than others. This variation in representation of the different interaction types leads to another kind of imbalance referred to as the within-class imbalance. In within-class imbalance, prediction results are biased towards the better represented interaction types, leading to more prediction errors in the less represented interaction types. RESULTS: We propose an ensemble learning method that incorporates techniques to address the issues of between-class imbalance and within-class imbalance. Experiments show that the proposed method improves results over 4 state-of-the-art methods. In addition, we simulated cases for new drugs and targets to see how our method would perform in predicting their interactions. New drugs and targets are those for which no prior interactions are known. Our method displayed satisfactory prediction performance and was able to predict many of the interactions successfully. CONCLUSIONS: Our proposed method has improved the prediction performance over the existing work, thus proving the importance of addressing problems pertaining to class imbalance in the data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1377-y) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-5259867
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-52598672017-01-26 Drug-target interaction prediction via class imbalance-aware ensemble learning Ezzat, Ali Wu, Min Li, Xiao-Li Kwoh, Chee-Keong BMC Bioinformatics Research BACKGROUND: Multiple computational methods for predicting drug-target interactions have been developed to facilitate the drug discovery process. These methods use available data on known drug-target interactions to train classifiers with the purpose of predicting new undiscovered interactions. However, a key challenge regarding this data that has not yet been addressed by these methods, namely class imbalance, is potentially degrading the prediction performance. Class imbalance can be divided into two sub-problems. Firstly, the number of known interacting drug-target pairs is much smaller than that of non-interacting drug-target pairs. This imbalance ratio between interacting and non-interacting drug-target pairs is referred to as the between-class imbalance. Between-class imbalance degrades prediction performance due to the bias in prediction results towards the majority class (i.e. the non-interacting pairs), leading to more prediction errors in the minority class (i.e. the interacting pairs). Secondly, there are multiple types of drug-target interactions in the data with some types having relatively fewer members (or are less represented) than others. This variation in representation of the different interaction types leads to another kind of imbalance referred to as the within-class imbalance. In within-class imbalance, prediction results are biased towards the better represented interaction types, leading to more prediction errors in the less represented interaction types. RESULTS: We propose an ensemble learning method that incorporates techniques to address the issues of between-class imbalance and within-class imbalance. Experiments show that the proposed method improves results over 4 state-of-the-art methods. In addition, we simulated cases for new drugs and targets to see how our method would perform in predicting their interactions. New drugs and targets are those for which no prior interactions are known. Our method displayed satisfactory prediction performance and was able to predict many of the interactions successfully. CONCLUSIONS: Our proposed method has improved the prediction performance over the existing work, thus proving the importance of addressing problems pertaining to class imbalance in the data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1377-y) contains supplementary material, which is available to authorized users. BioMed Central 2016-12-22 /pmc/articles/PMC5259867/ /pubmed/28155697 http://dx.doi.org/10.1186/s12859-016-1377-y Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Ezzat, Ali Wu, Min Li, Xiao-Li Kwoh, Chee-Keong Drug-target interaction prediction via class imbalance-aware ensemble learning
title	Drug-target interaction prediction via class imbalance-aware ensemble learning
title_full	Drug-target interaction prediction via class imbalance-aware ensemble learning
title_fullStr	Drug-target interaction prediction via class imbalance-aware ensemble learning
title_full_unstemmed	Drug-target interaction prediction via class imbalance-aware ensemble learning
title_short	Drug-target interaction prediction via class imbalance-aware ensemble learning
title_sort	drug-target interaction prediction via class imbalance-aware ensemble learning
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5259867/ https://www.ncbi.nlm.nih.gov/pubmed/28155697 http://dx.doi.org/10.1186/s12859-016-1377-y
work_keys_str_mv	AT ezzatali drugtargetinteractionpredictionviaclassimbalanceawareensemblelearning AT wumin drugtargetinteractionpredictionviaclassimbalanceawareensemblelearning AT lixiaoli drugtargetinteractionpredictionviaclassimbalanceawareensemblelearning AT kwohcheekeong drugtargetinteractionpredictionviaclassimbalanceawareensemblelearning

Drug-target interaction prediction via class imbalance-aware ensemble learning

Ejemplares similares