Cargando…

Divisive hierarchical maximum likelihood clustering

BACKGROUND: Biological data comprises various topologies or a mixture of forms, which makes its analysis extremely complicated. With this data increasing in a daily basis, the design and development of efficient and accurate statistical methods has become absolutely necessary. Specific analyses, suc...

Descripción completa

Detalles Bibliográficos
Autores principales: Sharma, Alok, López, Yosvany, Tsunoda, Tatsuhiko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5751574/
https://www.ncbi.nlm.nih.gov/pubmed/29297297
http://dx.doi.org/10.1186/s12859-017-1965-5
_version_ 1783289975192682496
author Sharma, Alok
López, Yosvany
Tsunoda, Tatsuhiko
author_facet Sharma, Alok
López, Yosvany
Tsunoda, Tatsuhiko
author_sort Sharma, Alok
collection PubMed
description BACKGROUND: Biological data comprises various topologies or a mixture of forms, which makes its analysis extremely complicated. With this data increasing in a daily basis, the design and development of efficient and accurate statistical methods has become absolutely necessary. Specific analyses, such as those related to genome-wide association studies and multi-omics information, are often aimed at clustering sub-conditions of cancers and other diseases. Hierarchical clustering methods, which can be categorized into agglomerative and divisive, have been widely used in such situations. However, unlike agglomerative methods divisive clustering approaches have consistently proved to be computationally expensive. RESULTS: The proposed clustering algorithm (DRAGON) was verified on mutation and microarray data, and was gauged against standard clustering methods in the literature. Its validation included synthetic and significant biological data. When validated on mixed-lineage leukemia data, DRAGON achieved the highest clustering accuracy with data of four different dimensions. Consequently, DRAGON outperformed previous methods with 3-,4- and 5-dimensional acute leukemia data. When tested on mutation data, DRAGON achieved the best performance with 2-dimensional information. CONCLUSIONS: This work proposes a computationally efficient divisive hierarchical clustering method, which can compete equally with agglomerative approaches. The proposed method turned out to correctly cluster data with distinct topologies. A MATLAB implementation can be extraced from http://www.riken.jp/en/research/labs/ims/med_sci_math/ or http://www.alok-ai-lab.com ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1965-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5751574
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57515742018-01-05 Divisive hierarchical maximum likelihood clustering Sharma, Alok López, Yosvany Tsunoda, Tatsuhiko BMC Bioinformatics Research BACKGROUND: Biological data comprises various topologies or a mixture of forms, which makes its analysis extremely complicated. With this data increasing in a daily basis, the design and development of efficient and accurate statistical methods has become absolutely necessary. Specific analyses, such as those related to genome-wide association studies and multi-omics information, are often aimed at clustering sub-conditions of cancers and other diseases. Hierarchical clustering methods, which can be categorized into agglomerative and divisive, have been widely used in such situations. However, unlike agglomerative methods divisive clustering approaches have consistently proved to be computationally expensive. RESULTS: The proposed clustering algorithm (DRAGON) was verified on mutation and microarray data, and was gauged against standard clustering methods in the literature. Its validation included synthetic and significant biological data. When validated on mixed-lineage leukemia data, DRAGON achieved the highest clustering accuracy with data of four different dimensions. Consequently, DRAGON outperformed previous methods with 3-,4- and 5-dimensional acute leukemia data. When tested on mutation data, DRAGON achieved the best performance with 2-dimensional information. CONCLUSIONS: This work proposes a computationally efficient divisive hierarchical clustering method, which can compete equally with agglomerative approaches. The proposed method turned out to correctly cluster data with distinct topologies. A MATLAB implementation can be extraced from http://www.riken.jp/en/research/labs/ims/med_sci_math/ or http://www.alok-ai-lab.com ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1965-5) contains supplementary material, which is available to authorized users. BioMed Central 2017-12-28 /pmc/articles/PMC5751574/ /pubmed/29297297 http://dx.doi.org/10.1186/s12859-017-1965-5 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Sharma, Alok
López, Yosvany
Tsunoda, Tatsuhiko
Divisive hierarchical maximum likelihood clustering
title Divisive hierarchical maximum likelihood clustering
title_full Divisive hierarchical maximum likelihood clustering
title_fullStr Divisive hierarchical maximum likelihood clustering
title_full_unstemmed Divisive hierarchical maximum likelihood clustering
title_short Divisive hierarchical maximum likelihood clustering
title_sort divisive hierarchical maximum likelihood clustering
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5751574/
https://www.ncbi.nlm.nih.gov/pubmed/29297297
http://dx.doi.org/10.1186/s12859-017-1965-5
work_keys_str_mv AT sharmaalok divisivehierarchicalmaximumlikelihoodclustering
AT lopezyosvany divisivehierarchicalmaximumlikelihoodclustering
AT tsunodatatsuhiko divisivehierarchicalmaximumlikelihoodclustering