Cargando…
Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein
The evolution of the influenza A virus to increase its host range is a major concern worldwide. Molecular mechanisms of increasing host range are largely unknown. Influenza surface proteins play determining roles in reorganization of host-sialic acid receptors and host range. In an attempt to uncove...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4014573/ https://www.ncbi.nlm.nih.gov/pubmed/24809455 http://dx.doi.org/10.1371/journal.pone.0096984 |
_version_ | 1782315199226707968 |
---|---|
author | Ebrahimi, Mansour Aghagolzadeh, Parisa Shamabadi, Narges Tahmasebi, Ahmad Alsharifi, Mohammed Adelson, David L. Hemmatzadeh, Farhid Ebrahimie, Esmaeil |
author_facet | Ebrahimi, Mansour Aghagolzadeh, Parisa Shamabadi, Narges Tahmasebi, Ahmad Alsharifi, Mohammed Adelson, David L. Hemmatzadeh, Farhid Ebrahimie, Esmaeil |
author_sort | Ebrahimi, Mansour |
collection | PubMed |
description | The evolution of the influenza A virus to increase its host range is a major concern worldwide. Molecular mechanisms of increasing host range are largely unknown. Influenza surface proteins play determining roles in reorganization of host-sialic acid receptors and host range. In an attempt to uncover the physic-chemical attributes which govern HA subtyping, we performed a large scale functional analysis of over 7000 sequences of 16 different HA subtypes. Large number (896) of physic-chemical protein characteristics were calculated for each HA sequence. Then, 10 different attribute weighting algorithms were used to find the key characteristics distinguishing HA subtypes. Furthermore, to discover machine leaning models which can predict HA subtypes, various Decision Tree, Support Vector Machine, Naïve Bayes, and Neural Network models were trained on calculated protein characteristics dataset as well as 10 trimmed datasets generated by attribute weighting algorithms. The prediction accuracies of the machine learning methods were evaluated by 10-fold cross validation. The results highlighted the frequency of Gln (selected by 80% of attribute weighting algorithms), percentage/frequency of Tyr, percentage of Cys, and frequencies of Try and Glu (selected by 70% of attribute weighting algorithms) as the key features that are associated with HA subtyping. Random Forest tree induction algorithm and RBF kernel function of SVM (scaled by grid search) showed high accuracy of 98% in clustering and predicting HA subtypes based on protein attributes. Decision tree models were successful in monitoring the short mutation/reassortment paths by which influenza virus can gain the key protein structure of another HA subtype and increase its host range in a short period of time with less energy consumption. Extracting and mining a large number of amino acid attributes of HA subtypes of influenza A virus through supervised algorithms represent a new avenue for understanding and predicting possible future structure of influenza pandemics. |
format | Online Article Text |
id | pubmed-4014573 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-40145732014-05-14 Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein Ebrahimi, Mansour Aghagolzadeh, Parisa Shamabadi, Narges Tahmasebi, Ahmad Alsharifi, Mohammed Adelson, David L. Hemmatzadeh, Farhid Ebrahimie, Esmaeil PLoS One Research Article The evolution of the influenza A virus to increase its host range is a major concern worldwide. Molecular mechanisms of increasing host range are largely unknown. Influenza surface proteins play determining roles in reorganization of host-sialic acid receptors and host range. In an attempt to uncover the physic-chemical attributes which govern HA subtyping, we performed a large scale functional analysis of over 7000 sequences of 16 different HA subtypes. Large number (896) of physic-chemical protein characteristics were calculated for each HA sequence. Then, 10 different attribute weighting algorithms were used to find the key characteristics distinguishing HA subtypes. Furthermore, to discover machine leaning models which can predict HA subtypes, various Decision Tree, Support Vector Machine, Naïve Bayes, and Neural Network models were trained on calculated protein characteristics dataset as well as 10 trimmed datasets generated by attribute weighting algorithms. The prediction accuracies of the machine learning methods were evaluated by 10-fold cross validation. The results highlighted the frequency of Gln (selected by 80% of attribute weighting algorithms), percentage/frequency of Tyr, percentage of Cys, and frequencies of Try and Glu (selected by 70% of attribute weighting algorithms) as the key features that are associated with HA subtyping. Random Forest tree induction algorithm and RBF kernel function of SVM (scaled by grid search) showed high accuracy of 98% in clustering and predicting HA subtypes based on protein attributes. Decision tree models were successful in monitoring the short mutation/reassortment paths by which influenza virus can gain the key protein structure of another HA subtype and increase its host range in a short period of time with less energy consumption. Extracting and mining a large number of amino acid attributes of HA subtypes of influenza A virus through supervised algorithms represent a new avenue for understanding and predicting possible future structure of influenza pandemics. Public Library of Science 2014-05-08 /pmc/articles/PMC4014573/ /pubmed/24809455 http://dx.doi.org/10.1371/journal.pone.0096984 Text en © 2014 Ebrahimi et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Ebrahimi, Mansour Aghagolzadeh, Parisa Shamabadi, Narges Tahmasebi, Ahmad Alsharifi, Mohammed Adelson, David L. Hemmatzadeh, Farhid Ebrahimie, Esmaeil Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein |
title | Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein |
title_full | Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein |
title_fullStr | Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein |
title_full_unstemmed | Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein |
title_short | Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein |
title_sort | understanding the underlying mechanism of ha-subtyping in the level of physic-chemical characteristics of protein |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4014573/ https://www.ncbi.nlm.nih.gov/pubmed/24809455 http://dx.doi.org/10.1371/journal.pone.0096984 |
work_keys_str_mv | AT ebrahimimansour understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein AT aghagolzadehparisa understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein AT shamabadinarges understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein AT tahmasebiahmad understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein AT alsharifimohammed understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein AT adelsondavidl understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein AT hemmatzadehfarhid understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein AT ebrahimieesmaeil understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein |