Cargando…
Divisive hierarchical maximum likelihood clustering
BACKGROUND: Biological data comprises various topologies or a mixture of forms, which makes its analysis extremely complicated. With this data increasing in a daily basis, the design and development of efficient and accurate statistical methods has become absolutely necessary. Specific analyses, suc...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5751574/ https://www.ncbi.nlm.nih.gov/pubmed/29297297 http://dx.doi.org/10.1186/s12859-017-1965-5 |
_version_ | 1783289975192682496 |
---|---|
author | Sharma, Alok López, Yosvany Tsunoda, Tatsuhiko |
author_facet | Sharma, Alok López, Yosvany Tsunoda, Tatsuhiko |
author_sort | Sharma, Alok |
collection | PubMed |
description | BACKGROUND: Biological data comprises various topologies or a mixture of forms, which makes its analysis extremely complicated. With this data increasing in a daily basis, the design and development of efficient and accurate statistical methods has become absolutely necessary. Specific analyses, such as those related to genome-wide association studies and multi-omics information, are often aimed at clustering sub-conditions of cancers and other diseases. Hierarchical clustering methods, which can be categorized into agglomerative and divisive, have been widely used in such situations. However, unlike agglomerative methods divisive clustering approaches have consistently proved to be computationally expensive. RESULTS: The proposed clustering algorithm (DRAGON) was verified on mutation and microarray data, and was gauged against standard clustering methods in the literature. Its validation included synthetic and significant biological data. When validated on mixed-lineage leukemia data, DRAGON achieved the highest clustering accuracy with data of four different dimensions. Consequently, DRAGON outperformed previous methods with 3-,4- and 5-dimensional acute leukemia data. When tested on mutation data, DRAGON achieved the best performance with 2-dimensional information. CONCLUSIONS: This work proposes a computationally efficient divisive hierarchical clustering method, which can compete equally with agglomerative approaches. The proposed method turned out to correctly cluster data with distinct topologies. A MATLAB implementation can be extraced from http://www.riken.jp/en/research/labs/ims/med_sci_math/ or http://www.alok-ai-lab.com ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1965-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5751574 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-57515742018-01-05 Divisive hierarchical maximum likelihood clustering Sharma, Alok López, Yosvany Tsunoda, Tatsuhiko BMC Bioinformatics Research BACKGROUND: Biological data comprises various topologies or a mixture of forms, which makes its analysis extremely complicated. With this data increasing in a daily basis, the design and development of efficient and accurate statistical methods has become absolutely necessary. Specific analyses, such as those related to genome-wide association studies and multi-omics information, are often aimed at clustering sub-conditions of cancers and other diseases. Hierarchical clustering methods, which can be categorized into agglomerative and divisive, have been widely used in such situations. However, unlike agglomerative methods divisive clustering approaches have consistently proved to be computationally expensive. RESULTS: The proposed clustering algorithm (DRAGON) was verified on mutation and microarray data, and was gauged against standard clustering methods in the literature. Its validation included synthetic and significant biological data. When validated on mixed-lineage leukemia data, DRAGON achieved the highest clustering accuracy with data of four different dimensions. Consequently, DRAGON outperformed previous methods with 3-,4- and 5-dimensional acute leukemia data. When tested on mutation data, DRAGON achieved the best performance with 2-dimensional information. CONCLUSIONS: This work proposes a computationally efficient divisive hierarchical clustering method, which can compete equally with agglomerative approaches. The proposed method turned out to correctly cluster data with distinct topologies. A MATLAB implementation can be extraced from http://www.riken.jp/en/research/labs/ims/med_sci_math/ or http://www.alok-ai-lab.com ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1965-5) contains supplementary material, which is available to authorized users. BioMed Central 2017-12-28 /pmc/articles/PMC5751574/ /pubmed/29297297 http://dx.doi.org/10.1186/s12859-017-1965-5 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Sharma, Alok López, Yosvany Tsunoda, Tatsuhiko Divisive hierarchical maximum likelihood clustering |
title | Divisive hierarchical maximum likelihood clustering |
title_full | Divisive hierarchical maximum likelihood clustering |
title_fullStr | Divisive hierarchical maximum likelihood clustering |
title_full_unstemmed | Divisive hierarchical maximum likelihood clustering |
title_short | Divisive hierarchical maximum likelihood clustering |
title_sort | divisive hierarchical maximum likelihood clustering |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5751574/ https://www.ncbi.nlm.nih.gov/pubmed/29297297 http://dx.doi.org/10.1186/s12859-017-1965-5 |
work_keys_str_mv | AT sharmaalok divisivehierarchicalmaximumlikelihoodclustering AT lopezyosvany divisivehierarchicalmaximumlikelihoodclustering AT tsunodatatsuhiko divisivehierarchicalmaximumlikelihoodclustering |