Cargando…

Predicting host tropism of influenza A virus proteins using random forest

BACKGROUND: Majority of influenza A viruses reside and circulate among animal populations, seldom infecting humans due to host range restriction. Yet when some avian strains do acquire the ability to overcome species barrier, they might become adapted to humans, replicating efficiently and causing d...

Descripción completa

Detalles Bibliográficos
Autores principales: Eng, Christine LP, Tong, Joo Chuan, Tan, Tin Wee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4290784/
https://www.ncbi.nlm.nih.gov/pubmed/25521718
http://dx.doi.org/10.1186/1755-8794-7-S3-S1
_version_ 1782352302650163200
author Eng, Christine LP
Tong, Joo Chuan
Tan, Tin Wee
author_facet Eng, Christine LP
Tong, Joo Chuan
Tan, Tin Wee
author_sort Eng, Christine LP
collection PubMed
description BACKGROUND: Majority of influenza A viruses reside and circulate among animal populations, seldom infecting humans due to host range restriction. Yet when some avian strains do acquire the ability to overcome species barrier, they might become adapted to humans, replicating efficiently and causing diseases, leading to potential pandemic. With the huge influenza A virus reservoir in wild birds, it is a cause for concern when a new influenza strain emerges with the ability to cross host species barrier, as shown in light of the recent H7N9 outbreak in China. Several influenza proteins have been shown to be major determinants in host tropism. Further understanding and determining host tropism would be important in identifying zoonotic influenza virus strains capable of crossing species barrier and infecting humans. RESULTS: In this study, computational models for 11 influenza proteins have been constructed using the machine learning algorithm random forest for prediction of host tropism. The prediction models were trained on influenza protein sequences isolated from both avian and human samples, which were transformed into amino acid physicochemical properties feature vectors. The results were highly accurate prediction models (ACC>96.57; AUC>0.980; MCC>0.916) capable of determining host tropism of individual influenza proteins. In addition, features from all 11 proteins were used to construct a combined model to predict host tropism of influenza virus strains. This would help assess a novel influenza strain's host range capability. CONCLUSIONS: From the prediction models constructed, all achieved high prediction performance, indicating clear distinctions in both avian and human proteins. When used together as a host tropism prediction system, zoonotic strains could potentially be identified based on different protein prediction results. Understanding and predicting host tropism of influenza proteins lay an important foundation for future work in constructing computation models capable of directly predicting interspecies transmission of influenza viruses. The models are available for prediction at http://fluleap.bic.nus.edu.sg.
format Online
Article
Text
id pubmed-4290784
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42907842015-01-15 Predicting host tropism of influenza A virus proteins using random forest Eng, Christine LP Tong, Joo Chuan Tan, Tin Wee BMC Med Genomics Research BACKGROUND: Majority of influenza A viruses reside and circulate among animal populations, seldom infecting humans due to host range restriction. Yet when some avian strains do acquire the ability to overcome species barrier, they might become adapted to humans, replicating efficiently and causing diseases, leading to potential pandemic. With the huge influenza A virus reservoir in wild birds, it is a cause for concern when a new influenza strain emerges with the ability to cross host species barrier, as shown in light of the recent H7N9 outbreak in China. Several influenza proteins have been shown to be major determinants in host tropism. Further understanding and determining host tropism would be important in identifying zoonotic influenza virus strains capable of crossing species barrier and infecting humans. RESULTS: In this study, computational models for 11 influenza proteins have been constructed using the machine learning algorithm random forest for prediction of host tropism. The prediction models were trained on influenza protein sequences isolated from both avian and human samples, which were transformed into amino acid physicochemical properties feature vectors. The results were highly accurate prediction models (ACC>96.57; AUC>0.980; MCC>0.916) capable of determining host tropism of individual influenza proteins. In addition, features from all 11 proteins were used to construct a combined model to predict host tropism of influenza virus strains. This would help assess a novel influenza strain's host range capability. CONCLUSIONS: From the prediction models constructed, all achieved high prediction performance, indicating clear distinctions in both avian and human proteins. When used together as a host tropism prediction system, zoonotic strains could potentially be identified based on different protein prediction results. Understanding and predicting host tropism of influenza proteins lay an important foundation for future work in constructing computation models capable of directly predicting interspecies transmission of influenza viruses. The models are available for prediction at http://fluleap.bic.nus.edu.sg. BioMed Central 2014-12-08 /pmc/articles/PMC4290784/ /pubmed/25521718 http://dx.doi.org/10.1186/1755-8794-7-S3-S1 Text en Copyright © 2014 Eng et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Eng, Christine LP
Tong, Joo Chuan
Tan, Tin Wee
Predicting host tropism of influenza A virus proteins using random forest
title Predicting host tropism of influenza A virus proteins using random forest
title_full Predicting host tropism of influenza A virus proteins using random forest
title_fullStr Predicting host tropism of influenza A virus proteins using random forest
title_full_unstemmed Predicting host tropism of influenza A virus proteins using random forest
title_short Predicting host tropism of influenza A virus proteins using random forest
title_sort predicting host tropism of influenza a virus proteins using random forest
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4290784/
https://www.ncbi.nlm.nih.gov/pubmed/25521718
http://dx.doi.org/10.1186/1755-8794-7-S3-S1
work_keys_str_mv AT engchristinelp predictinghosttropismofinfluenzaavirusproteinsusingrandomforest
AT tongjoochuan predictinghosttropismofinfluenzaavirusproteinsusingrandomforest
AT tantinwee predictinghosttropismofinfluenzaavirusproteinsusingrandomforest