Cargando…
Voting-based consensus clustering for combining multiple clusterings of chemical structures
BACKGROUND: Although many consensus clustering methods have been successfully used for combining multiple classifiers in many areas such as machine learning, applied statistics, pattern recognition and bioinformatics, few consensus clustering methods have been applied for combining multiple clusteri...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3541359/ https://www.ncbi.nlm.nih.gov/pubmed/23244782 http://dx.doi.org/10.1186/1758-2946-4-37 |
_version_ | 1782255349639675904 |
---|---|
author | Saeed, Faisal Salim, Naomie Abdo, Ammar |
author_facet | Saeed, Faisal Salim, Naomie Abdo, Ammar |
author_sort | Saeed, Faisal |
collection | PubMed |
description | BACKGROUND: Although many consensus clustering methods have been successfully used for combining multiple classifiers in many areas such as machine learning, applied statistics, pattern recognition and bioinformatics, few consensus clustering methods have been applied for combining multiple clusterings of chemical structures. It is known that any individual clustering method will not always give the best results for all types of applications. So, in this paper, three voting and graph-based consensus clusterings were used for combining multiple clusterings of chemical structures to enhance the ability of separating biologically active molecules from inactive ones in each cluster. RESULTS: The cumulative voting-based aggregation algorithm (CVAA), cluster-based similarity partitioning algorithm (CSPA) and hyper-graph partitioning algorithm (HGPA) were examined. The F-measure and Quality Partition Index method (QPI) were used to evaluate the clusterings and the results were compared to the Ward’s clustering method. The MDL Drug Data Report (MDDR) dataset was used for experiments and was represented by two 2D fingerprints, ALOGP and ECFP_4. The performance of voting-based consensus clustering method outperformed the Ward’s method using F-measure and QPI method for both ALOGP and ECFP_4 fingerprints, while the graph-based consensus clustering methods outperformed the Ward’s method only for ALOGP using QPI. The Jaccard and Euclidean distance measures were the methods of choice to generate the ensembles, which give the highest values for both criteria. CONCLUSIONS: The results of the experiments show that consensus clustering methods can improve the effectiveness of chemical structures clusterings. The cumulative voting-based aggregation algorithm (CVAA) was the method of choice among consensus clustering methods. |
format | Online Article Text |
id | pubmed-3541359 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35413592013-01-11 Voting-based consensus clustering for combining multiple clusterings of chemical structures Saeed, Faisal Salim, Naomie Abdo, Ammar J Cheminform Research Article BACKGROUND: Although many consensus clustering methods have been successfully used for combining multiple classifiers in many areas such as machine learning, applied statistics, pattern recognition and bioinformatics, few consensus clustering methods have been applied for combining multiple clusterings of chemical structures. It is known that any individual clustering method will not always give the best results for all types of applications. So, in this paper, three voting and graph-based consensus clusterings were used for combining multiple clusterings of chemical structures to enhance the ability of separating biologically active molecules from inactive ones in each cluster. RESULTS: The cumulative voting-based aggregation algorithm (CVAA), cluster-based similarity partitioning algorithm (CSPA) and hyper-graph partitioning algorithm (HGPA) were examined. The F-measure and Quality Partition Index method (QPI) were used to evaluate the clusterings and the results were compared to the Ward’s clustering method. The MDL Drug Data Report (MDDR) dataset was used for experiments and was represented by two 2D fingerprints, ALOGP and ECFP_4. The performance of voting-based consensus clustering method outperformed the Ward’s method using F-measure and QPI method for both ALOGP and ECFP_4 fingerprints, while the graph-based consensus clustering methods outperformed the Ward’s method only for ALOGP using QPI. The Jaccard and Euclidean distance measures were the methods of choice to generate the ensembles, which give the highest values for both criteria. CONCLUSIONS: The results of the experiments show that consensus clustering methods can improve the effectiveness of chemical structures clusterings. The cumulative voting-based aggregation algorithm (CVAA) was the method of choice among consensus clustering methods. BioMed Central 2012-12-17 /pmc/articles/PMC3541359/ /pubmed/23244782 http://dx.doi.org/10.1186/1758-2946-4-37 Text en Copyright ©2012 Saeed et al.; licensee Chemistry Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Saeed, Faisal Salim, Naomie Abdo, Ammar Voting-based consensus clustering for combining multiple clusterings of chemical structures |
title | Voting-based consensus clustering for combining multiple clusterings of chemical structures |
title_full | Voting-based consensus clustering for combining multiple clusterings of chemical structures |
title_fullStr | Voting-based consensus clustering for combining multiple clusterings of chemical structures |
title_full_unstemmed | Voting-based consensus clustering for combining multiple clusterings of chemical structures |
title_short | Voting-based consensus clustering for combining multiple clusterings of chemical structures |
title_sort | voting-based consensus clustering for combining multiple clusterings of chemical structures |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3541359/ https://www.ncbi.nlm.nih.gov/pubmed/23244782 http://dx.doi.org/10.1186/1758-2946-4-37 |
work_keys_str_mv | AT saeedfaisal votingbasedconsensusclusteringforcombiningmultipleclusteringsofchemicalstructures AT salimnaomie votingbasedconsensusclusteringforcombiningmultipleclusteringsofchemicalstructures AT abdoammar votingbasedconsensusclusteringforcombiningmultipleclusteringsofchemicalstructures |