Cargando…

Virtual screening by a new Clustering-based Weighted Similarity Extreme Learning Machine approach

Machine learning techniques are becoming popular in virtual screening tasks. One of the powerful machine learning algorithms is Extreme Learning Machine (ELM) which has been applied to many applications and has recently been applied to virtual screening. We propose the Weighted Similarity ELM (WS-EL...

Descripción completa

Detalles Bibliográficos
Autores principales: Pasupa, Kitsuchart, Kudisthalert, Wasu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5898726/
https://www.ncbi.nlm.nih.gov/pubmed/29652912
http://dx.doi.org/10.1371/journal.pone.0195478
_version_ 1783314178079981568
author Pasupa, Kitsuchart
Kudisthalert, Wasu
author_facet Pasupa, Kitsuchart
Kudisthalert, Wasu
author_sort Pasupa, Kitsuchart
collection PubMed
description Machine learning techniques are becoming popular in virtual screening tasks. One of the powerful machine learning algorithms is Extreme Learning Machine (ELM) which has been applied to many applications and has recently been applied to virtual screening. We propose the Weighted Similarity ELM (WS-ELM) which is based on a single layer feed-forward neural network in a conjunction of 16 different similarity coefficients as activation function in the hidden layer. It is known that the performance of conventional ELM is not robust due to random weight selection in the hidden layer. Thus, we propose a Clustering-based WS-ELM (CWS-ELM) that deterministically assigns weights by utilising clustering algorithms i.e. k-means clustering and support vector clustering. The experiments were conducted on one of the most challenging datasets–Maximum Unbiased Validation Dataset–which contains 17 activity classes carefully selected from PubChem. The proposed algorithms were then compared with other machine learning techniques such as support vector machine, random forest, and similarity searching. The results show that CWS-ELM in conjunction with support vector clustering yields the best performance when utilised together with Sokal/Sneath(1) coefficient. Furthermore, ECFP_6 fingerprint presents the best results in our framework compared to the other types of fingerprints, namely ECFP_4, FCFP_4, and FCFP_6.
format Online
Article
Text
id pubmed-5898726
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-58987262018-04-27 Virtual screening by a new Clustering-based Weighted Similarity Extreme Learning Machine approach Pasupa, Kitsuchart Kudisthalert, Wasu PLoS One Research Article Machine learning techniques are becoming popular in virtual screening tasks. One of the powerful machine learning algorithms is Extreme Learning Machine (ELM) which has been applied to many applications and has recently been applied to virtual screening. We propose the Weighted Similarity ELM (WS-ELM) which is based on a single layer feed-forward neural network in a conjunction of 16 different similarity coefficients as activation function in the hidden layer. It is known that the performance of conventional ELM is not robust due to random weight selection in the hidden layer. Thus, we propose a Clustering-based WS-ELM (CWS-ELM) that deterministically assigns weights by utilising clustering algorithms i.e. k-means clustering and support vector clustering. The experiments were conducted on one of the most challenging datasets–Maximum Unbiased Validation Dataset–which contains 17 activity classes carefully selected from PubChem. The proposed algorithms were then compared with other machine learning techniques such as support vector machine, random forest, and similarity searching. The results show that CWS-ELM in conjunction with support vector clustering yields the best performance when utilised together with Sokal/Sneath(1) coefficient. Furthermore, ECFP_6 fingerprint presents the best results in our framework compared to the other types of fingerprints, namely ECFP_4, FCFP_4, and FCFP_6. Public Library of Science 2018-04-13 /pmc/articles/PMC5898726/ /pubmed/29652912 http://dx.doi.org/10.1371/journal.pone.0195478 Text en © 2018 Pasupa, Kudisthalert http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Pasupa, Kitsuchart
Kudisthalert, Wasu
Virtual screening by a new Clustering-based Weighted Similarity Extreme Learning Machine approach
title Virtual screening by a new Clustering-based Weighted Similarity Extreme Learning Machine approach
title_full Virtual screening by a new Clustering-based Weighted Similarity Extreme Learning Machine approach
title_fullStr Virtual screening by a new Clustering-based Weighted Similarity Extreme Learning Machine approach
title_full_unstemmed Virtual screening by a new Clustering-based Weighted Similarity Extreme Learning Machine approach
title_short Virtual screening by a new Clustering-based Weighted Similarity Extreme Learning Machine approach
title_sort virtual screening by a new clustering-based weighted similarity extreme learning machine approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5898726/
https://www.ncbi.nlm.nih.gov/pubmed/29652912
http://dx.doi.org/10.1371/journal.pone.0195478
work_keys_str_mv AT pasupakitsuchart virtualscreeningbyanewclusteringbasedweightedsimilarityextremelearningmachineapproach
AT kudisthalertwasu virtualscreeningbyanewclusteringbasedweightedsimilarityextremelearningmachineapproach