Cargando…

Fuzzy clustering of physicochemical and biochemical properties of amino Acids

In this article, we categorize presently available experimental and theoretical knowledge of various physicochemical and biochemical features of amino acids, as collected in the AAindex database of known 544 amino acid (AA) indices. Previously reported 402 indices were categorized into six groups us...

Descripción completa

Detalles Bibliográficos
Autores principales: Saha, Indrajit, Maulik, Ujjwal, Bandyopadhyay, Sanghamitra, Plewczynski, Dariusz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Vienna 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3397137/
https://www.ncbi.nlm.nih.gov/pubmed/21993537
http://dx.doi.org/10.1007/s00726-011-1106-9
_version_ 1782238153820602368
author Saha, Indrajit
Maulik, Ujjwal
Bandyopadhyay, Sanghamitra
Plewczynski, Dariusz
author_facet Saha, Indrajit
Maulik, Ujjwal
Bandyopadhyay, Sanghamitra
Plewczynski, Dariusz
author_sort Saha, Indrajit
collection PubMed
description In this article, we categorize presently available experimental and theoretical knowledge of various physicochemical and biochemical features of amino acids, as collected in the AAindex database of known 544 amino acid (AA) indices. Previously reported 402 indices were categorized into six groups using hierarchical clustering technique and 142 were left unclustered. However, due to the increasing diversity of the database these indices are overlapping, therefore crisp clustering method may not provide optimal results. Moreover, in various large-scale bioinformatics analyses of whole proteomes, the proper selection of amino acid indices representing their biological significance is crucial for efficient and error-prone encoding of the short functional sequence motifs. In most cases, researchers perform exhaustive manual selection of the most informative indices. These two facts motivated us to analyse the widely used AA indices. The main goal of this article is twofold. First, we present a novel method of partitioning the bioinformatics data using consensus fuzzy clustering, where the recently proposed fuzzy clustering techniques are exploited. Second, we prepare three high quality subsets of all available indices. Superiority of the consensus fuzzy clustering method is demonstrated quantitatively, visually and statistically by comparing it with the previously proposed hierarchical clustered results. The processed AAindex1 database, supplementary material and the software are available at http://sysbio.icm.edu.pl/aaindex/.
format Online
Article
Text
id pubmed-3397137
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Springer Vienna
record_format MEDLINE/PubMed
spelling pubmed-33971372012-07-19 Fuzzy clustering of physicochemical and biochemical properties of amino Acids Saha, Indrajit Maulik, Ujjwal Bandyopadhyay, Sanghamitra Plewczynski, Dariusz Amino Acids Original Article In this article, we categorize presently available experimental and theoretical knowledge of various physicochemical and biochemical features of amino acids, as collected in the AAindex database of known 544 amino acid (AA) indices. Previously reported 402 indices were categorized into six groups using hierarchical clustering technique and 142 were left unclustered. However, due to the increasing diversity of the database these indices are overlapping, therefore crisp clustering method may not provide optimal results. Moreover, in various large-scale bioinformatics analyses of whole proteomes, the proper selection of amino acid indices representing their biological significance is crucial for efficient and error-prone encoding of the short functional sequence motifs. In most cases, researchers perform exhaustive manual selection of the most informative indices. These two facts motivated us to analyse the widely used AA indices. The main goal of this article is twofold. First, we present a novel method of partitioning the bioinformatics data using consensus fuzzy clustering, where the recently proposed fuzzy clustering techniques are exploited. Second, we prepare three high quality subsets of all available indices. Superiority of the consensus fuzzy clustering method is demonstrated quantitatively, visually and statistically by comparing it with the previously proposed hierarchical clustered results. The processed AAindex1 database, supplementary material and the software are available at http://sysbio.icm.edu.pl/aaindex/. Springer Vienna 2011-10-13 2012 /pmc/articles/PMC3397137/ /pubmed/21993537 http://dx.doi.org/10.1007/s00726-011-1106-9 Text en © The Author(s) 2011 https://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
spellingShingle Original Article
Saha, Indrajit
Maulik, Ujjwal
Bandyopadhyay, Sanghamitra
Plewczynski, Dariusz
Fuzzy clustering of physicochemical and biochemical properties of amino Acids
title Fuzzy clustering of physicochemical and biochemical properties of amino Acids
title_full Fuzzy clustering of physicochemical and biochemical properties of amino Acids
title_fullStr Fuzzy clustering of physicochemical and biochemical properties of amino Acids
title_full_unstemmed Fuzzy clustering of physicochemical and biochemical properties of amino Acids
title_short Fuzzy clustering of physicochemical and biochemical properties of amino Acids
title_sort fuzzy clustering of physicochemical and biochemical properties of amino acids
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3397137/
https://www.ncbi.nlm.nih.gov/pubmed/21993537
http://dx.doi.org/10.1007/s00726-011-1106-9
work_keys_str_mv AT sahaindrajit fuzzyclusteringofphysicochemicalandbiochemicalpropertiesofaminoacids
AT maulikujjwal fuzzyclusteringofphysicochemicalandbiochemicalpropertiesofaminoacids
AT bandyopadhyaysanghamitra fuzzyclusteringofphysicochemicalandbiochemicalpropertiesofaminoacids
AT plewczynskidariusz fuzzyclusteringofphysicochemicalandbiochemicalpropertiesofaminoacids