Cargando…

Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids

Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of indiv...

Descripción completa

Detalles Bibliográficos
Autores principales: Das, Jayanta Kumar, Das, Provas, Ray, Korak Kumar, Choudhury, Pabitra Pal, Jana, Siddhartha Sankar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5145171/
https://www.ncbi.nlm.nih.gov/pubmed/27930687
http://dx.doi.org/10.1371/journal.pone.0167651
_version_ 1782473248761446400
author Das, Jayanta Kumar
Das, Provas
Ray, Korak Kumar
Choudhury, Pabitra Pal
Jana, Siddhartha Sankar
author_facet Das, Jayanta Kumar
Das, Provas
Ray, Korak Kumar
Choudhury, Pabitra Pal
Jana, Siddhartha Sankar
author_sort Das, Jayanta Kumar
collection PubMed
description Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as ‘FPKATD’ and ‘Y/FTNEKL’ without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na(+)/K(+)-ATPase and Ca(2+)-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids’ pattern in different proteins.
format Online
Article
Text
id pubmed-5145171
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-51451712016-12-22 Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids Das, Jayanta Kumar Das, Provas Ray, Korak Kumar Choudhury, Pabitra Pal Jana, Siddhartha Sankar PLoS One Research Article Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as ‘FPKATD’ and ‘Y/FTNEKL’ without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na(+)/K(+)-ATPase and Ca(2+)-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids’ pattern in different proteins. Public Library of Science 2016-12-08 /pmc/articles/PMC5145171/ /pubmed/27930687 http://dx.doi.org/10.1371/journal.pone.0167651 Text en © 2016 Das et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Das, Jayanta Kumar
Das, Provas
Ray, Korak Kumar
Choudhury, Pabitra Pal
Jana, Siddhartha Sankar
Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids
title Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids
title_full Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids
title_fullStr Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids
title_full_unstemmed Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids
title_short Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids
title_sort mathematical characterization of protein sequences using patterns as chemical group combinations of amino acids
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5145171/
https://www.ncbi.nlm.nih.gov/pubmed/27930687
http://dx.doi.org/10.1371/journal.pone.0167651
work_keys_str_mv AT dasjayantakumar mathematicalcharacterizationofproteinsequencesusingpatternsaschemicalgroupcombinationsofaminoacids
AT dasprovas mathematicalcharacterizationofproteinsequencesusingpatternsaschemicalgroupcombinationsofaminoacids
AT raykorakkumar mathematicalcharacterizationofproteinsequencesusingpatternsaschemicalgroupcombinationsofaminoacids
AT choudhurypabitrapal mathematicalcharacterizationofproteinsequencesusingpatternsaschemicalgroupcombinationsofaminoacids
AT janasiddharthasankar mathematicalcharacterizationofproteinsequencesusingpatternsaschemicalgroupcombinationsofaminoacids