Cargando…
Fuzzy clustering of physicochemical and biochemical properties of amino Acids
In this article, we categorize presently available experimental and theoretical knowledge of various physicochemical and biochemical features of amino acids, as collected in the AAindex database of known 544 amino acid (AA) indices. Previously reported 402 indices were categorized into six groups us...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Vienna
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3397137/ https://www.ncbi.nlm.nih.gov/pubmed/21993537 http://dx.doi.org/10.1007/s00726-011-1106-9 |
_version_ | 1782238153820602368 |
---|---|
author | Saha, Indrajit Maulik, Ujjwal Bandyopadhyay, Sanghamitra Plewczynski, Dariusz |
author_facet | Saha, Indrajit Maulik, Ujjwal Bandyopadhyay, Sanghamitra Plewczynski, Dariusz |
author_sort | Saha, Indrajit |
collection | PubMed |
description | In this article, we categorize presently available experimental and theoretical knowledge of various physicochemical and biochemical features of amino acids, as collected in the AAindex database of known 544 amino acid (AA) indices. Previously reported 402 indices were categorized into six groups using hierarchical clustering technique and 142 were left unclustered. However, due to the increasing diversity of the database these indices are overlapping, therefore crisp clustering method may not provide optimal results. Moreover, in various large-scale bioinformatics analyses of whole proteomes, the proper selection of amino acid indices representing their biological significance is crucial for efficient and error-prone encoding of the short functional sequence motifs. In most cases, researchers perform exhaustive manual selection of the most informative indices. These two facts motivated us to analyse the widely used AA indices. The main goal of this article is twofold. First, we present a novel method of partitioning the bioinformatics data using consensus fuzzy clustering, where the recently proposed fuzzy clustering techniques are exploited. Second, we prepare three high quality subsets of all available indices. Superiority of the consensus fuzzy clustering method is demonstrated quantitatively, visually and statistically by comparing it with the previously proposed hierarchical clustered results. The processed AAindex1 database, supplementary material and the software are available at http://sysbio.icm.edu.pl/aaindex/. |
format | Online Article Text |
id | pubmed-3397137 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Springer Vienna |
record_format | MEDLINE/PubMed |
spelling | pubmed-33971372012-07-19 Fuzzy clustering of physicochemical and biochemical properties of amino Acids Saha, Indrajit Maulik, Ujjwal Bandyopadhyay, Sanghamitra Plewczynski, Dariusz Amino Acids Original Article In this article, we categorize presently available experimental and theoretical knowledge of various physicochemical and biochemical features of amino acids, as collected in the AAindex database of known 544 amino acid (AA) indices. Previously reported 402 indices were categorized into six groups using hierarchical clustering technique and 142 were left unclustered. However, due to the increasing diversity of the database these indices are overlapping, therefore crisp clustering method may not provide optimal results. Moreover, in various large-scale bioinformatics analyses of whole proteomes, the proper selection of amino acid indices representing their biological significance is crucial for efficient and error-prone encoding of the short functional sequence motifs. In most cases, researchers perform exhaustive manual selection of the most informative indices. These two facts motivated us to analyse the widely used AA indices. The main goal of this article is twofold. First, we present a novel method of partitioning the bioinformatics data using consensus fuzzy clustering, where the recently proposed fuzzy clustering techniques are exploited. Second, we prepare three high quality subsets of all available indices. Superiority of the consensus fuzzy clustering method is demonstrated quantitatively, visually and statistically by comparing it with the previously proposed hierarchical clustered results. The processed AAindex1 database, supplementary material and the software are available at http://sysbio.icm.edu.pl/aaindex/. Springer Vienna 2011-10-13 2012 /pmc/articles/PMC3397137/ /pubmed/21993537 http://dx.doi.org/10.1007/s00726-011-1106-9 Text en © The Author(s) 2011 https://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited. |
spellingShingle | Original Article Saha, Indrajit Maulik, Ujjwal Bandyopadhyay, Sanghamitra Plewczynski, Dariusz Fuzzy clustering of physicochemical and biochemical properties of amino Acids |
title | Fuzzy clustering of physicochemical and biochemical properties of amino Acids |
title_full | Fuzzy clustering of physicochemical and biochemical properties of amino Acids |
title_fullStr | Fuzzy clustering of physicochemical and biochemical properties of amino Acids |
title_full_unstemmed | Fuzzy clustering of physicochemical and biochemical properties of amino Acids |
title_short | Fuzzy clustering of physicochemical and biochemical properties of amino Acids |
title_sort | fuzzy clustering of physicochemical and biochemical properties of amino acids |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3397137/ https://www.ncbi.nlm.nih.gov/pubmed/21993537 http://dx.doi.org/10.1007/s00726-011-1106-9 |
work_keys_str_mv | AT sahaindrajit fuzzyclusteringofphysicochemicalandbiochemicalpropertiesofaminoacids AT maulikujjwal fuzzyclusteringofphysicochemicalandbiochemicalpropertiesofaminoacids AT bandyopadhyaysanghamitra fuzzyclusteringofphysicochemicalandbiochemicalpropertiesofaminoacids AT plewczynskidariusz fuzzyclusteringofphysicochemicalandbiochemicalpropertiesofaminoacids |