Cargando…

Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein

The evolution of the influenza A virus to increase its host range is a major concern worldwide. Molecular mechanisms of increasing host range are largely unknown. Influenza surface proteins play determining roles in reorganization of host-sialic acid receptors and host range. In an attempt to uncove...

Descripción completa

Detalles Bibliográficos
Autores principales: Ebrahimi, Mansour, Aghagolzadeh, Parisa, Shamabadi, Narges, Tahmasebi, Ahmad, Alsharifi, Mohammed, Adelson, David L., Hemmatzadeh, Farhid, Ebrahimie, Esmaeil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4014573/
https://www.ncbi.nlm.nih.gov/pubmed/24809455
http://dx.doi.org/10.1371/journal.pone.0096984
_version_ 1782315199226707968
author Ebrahimi, Mansour
Aghagolzadeh, Parisa
Shamabadi, Narges
Tahmasebi, Ahmad
Alsharifi, Mohammed
Adelson, David L.
Hemmatzadeh, Farhid
Ebrahimie, Esmaeil
author_facet Ebrahimi, Mansour
Aghagolzadeh, Parisa
Shamabadi, Narges
Tahmasebi, Ahmad
Alsharifi, Mohammed
Adelson, David L.
Hemmatzadeh, Farhid
Ebrahimie, Esmaeil
author_sort Ebrahimi, Mansour
collection PubMed
description The evolution of the influenza A virus to increase its host range is a major concern worldwide. Molecular mechanisms of increasing host range are largely unknown. Influenza surface proteins play determining roles in reorganization of host-sialic acid receptors and host range. In an attempt to uncover the physic-chemical attributes which govern HA subtyping, we performed a large scale functional analysis of over 7000 sequences of 16 different HA subtypes. Large number (896) of physic-chemical protein characteristics were calculated for each HA sequence. Then, 10 different attribute weighting algorithms were used to find the key characteristics distinguishing HA subtypes. Furthermore, to discover machine leaning models which can predict HA subtypes, various Decision Tree, Support Vector Machine, Naïve Bayes, and Neural Network models were trained on calculated protein characteristics dataset as well as 10 trimmed datasets generated by attribute weighting algorithms. The prediction accuracies of the machine learning methods were evaluated by 10-fold cross validation. The results highlighted the frequency of Gln (selected by 80% of attribute weighting algorithms), percentage/frequency of Tyr, percentage of Cys, and frequencies of Try and Glu (selected by 70% of attribute weighting algorithms) as the key features that are associated with HA subtyping. Random Forest tree induction algorithm and RBF kernel function of SVM (scaled by grid search) showed high accuracy of 98% in clustering and predicting HA subtypes based on protein attributes. Decision tree models were successful in monitoring the short mutation/reassortment paths by which influenza virus can gain the key protein structure of another HA subtype and increase its host range in a short period of time with less energy consumption. Extracting and mining a large number of amino acid attributes of HA subtypes of influenza A virus through supervised algorithms represent a new avenue for understanding and predicting possible future structure of influenza pandemics.
format Online
Article
Text
id pubmed-4014573
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-40145732014-05-14 Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein Ebrahimi, Mansour Aghagolzadeh, Parisa Shamabadi, Narges Tahmasebi, Ahmad Alsharifi, Mohammed Adelson, David L. Hemmatzadeh, Farhid Ebrahimie, Esmaeil PLoS One Research Article The evolution of the influenza A virus to increase its host range is a major concern worldwide. Molecular mechanisms of increasing host range are largely unknown. Influenza surface proteins play determining roles in reorganization of host-sialic acid receptors and host range. In an attempt to uncover the physic-chemical attributes which govern HA subtyping, we performed a large scale functional analysis of over 7000 sequences of 16 different HA subtypes. Large number (896) of physic-chemical protein characteristics were calculated for each HA sequence. Then, 10 different attribute weighting algorithms were used to find the key characteristics distinguishing HA subtypes. Furthermore, to discover machine leaning models which can predict HA subtypes, various Decision Tree, Support Vector Machine, Naïve Bayes, and Neural Network models were trained on calculated protein characteristics dataset as well as 10 trimmed datasets generated by attribute weighting algorithms. The prediction accuracies of the machine learning methods were evaluated by 10-fold cross validation. The results highlighted the frequency of Gln (selected by 80% of attribute weighting algorithms), percentage/frequency of Tyr, percentage of Cys, and frequencies of Try and Glu (selected by 70% of attribute weighting algorithms) as the key features that are associated with HA subtyping. Random Forest tree induction algorithm and RBF kernel function of SVM (scaled by grid search) showed high accuracy of 98% in clustering and predicting HA subtypes based on protein attributes. Decision tree models were successful in monitoring the short mutation/reassortment paths by which influenza virus can gain the key protein structure of another HA subtype and increase its host range in a short period of time with less energy consumption. Extracting and mining a large number of amino acid attributes of HA subtypes of influenza A virus through supervised algorithms represent a new avenue for understanding and predicting possible future structure of influenza pandemics. Public Library of Science 2014-05-08 /pmc/articles/PMC4014573/ /pubmed/24809455 http://dx.doi.org/10.1371/journal.pone.0096984 Text en © 2014 Ebrahimi et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ebrahimi, Mansour
Aghagolzadeh, Parisa
Shamabadi, Narges
Tahmasebi, Ahmad
Alsharifi, Mohammed
Adelson, David L.
Hemmatzadeh, Farhid
Ebrahimie, Esmaeil
Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein
title Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein
title_full Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein
title_fullStr Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein
title_full_unstemmed Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein
title_short Understanding the Underlying Mechanism of HA-Subtyping in the Level of Physic-Chemical Characteristics of Protein
title_sort understanding the underlying mechanism of ha-subtyping in the level of physic-chemical characteristics of protein
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4014573/
https://www.ncbi.nlm.nih.gov/pubmed/24809455
http://dx.doi.org/10.1371/journal.pone.0096984
work_keys_str_mv AT ebrahimimansour understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein
AT aghagolzadehparisa understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein
AT shamabadinarges understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein
AT tahmasebiahmad understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein
AT alsharifimohammed understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein
AT adelsondavidl understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein
AT hemmatzadehfarhid understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein
AT ebrahimieesmaeil understandingtheunderlyingmechanismofhasubtypinginthelevelofphysicchemicalcharacteristicsofprotein