Cargando…

Identifying host-specific amino acid signatures for influenza A viruses using an adjusted entropy measure

BACKGROUND: Influenza A viruses (IAV) exhibit vast genetic mutability and have great zoonotic potential to infect avian and mammalian hosts and are known to be responsible for a number of pandemics. A key computational issue in influenza prevention and control is the identification of molecular sign...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yixiang, Eskridge, Kent M., Zhang, Shunpu, Lu, Guoqing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9372975/
https://www.ncbi.nlm.nih.gov/pubmed/35962315
http://dx.doi.org/10.1186/s12859-022-04885-7
_version_ 1784767503237382144
author Zhang, Yixiang
Eskridge, Kent M.
Zhang, Shunpu
Lu, Guoqing
author_facet Zhang, Yixiang
Eskridge, Kent M.
Zhang, Shunpu
Lu, Guoqing
author_sort Zhang, Yixiang
collection PubMed
description BACKGROUND: Influenza A viruses (IAV) exhibit vast genetic mutability and have great zoonotic potential to infect avian and mammalian hosts and are known to be responsible for a number of pandemics. A key computational issue in influenza prevention and control is the identification of molecular signatures with cross-species transmission potential. We propose an adjusted entropy-based host-specific signature identification method that uses a similarity coefficient to incorporate the amino acid substitution information and improve the identification performance. Mutations in the polymerase genes (e.g., PB2) are known to play a major role in avian influenza virus adaptation to mammalian hosts. We thus focus on the analysis of PB2 protein sequences and identify host specific PB2 amino acid signatures. RESULTS: Validation with a set of H5N1 PB2 sequences from 1996 to 2006 results in adjusted entropy having a 40% false negative discovery rate compared to a 60% false negative rate using unadjusted entropy. Simulations across different levels of sequence divergence show a false negative rate of no higher than 10% while unadjusted entropy ranged from 9 to 100%. In addition, under all levels of divergence adjusted entropy never had a false positive rate higher than 9%. Adjusted entropy also identifies important mutations in H1N1pdm PB2 previously identified in the literature that explain changes in divergence between 2008 and 2009 which unadjusted entropy could not identify. CONCLUSIONS: Based on these results, adjusted entropy provides a reliable and widely applicable host signature identification approach useful for IAV monitoring and vaccine development.
format Online
Article
Text
id pubmed-9372975
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-93729752022-08-12 Identifying host-specific amino acid signatures for influenza A viruses using an adjusted entropy measure Zhang, Yixiang Eskridge, Kent M. Zhang, Shunpu Lu, Guoqing BMC Bioinformatics Research BACKGROUND: Influenza A viruses (IAV) exhibit vast genetic mutability and have great zoonotic potential to infect avian and mammalian hosts and are known to be responsible for a number of pandemics. A key computational issue in influenza prevention and control is the identification of molecular signatures with cross-species transmission potential. We propose an adjusted entropy-based host-specific signature identification method that uses a similarity coefficient to incorporate the amino acid substitution information and improve the identification performance. Mutations in the polymerase genes (e.g., PB2) are known to play a major role in avian influenza virus adaptation to mammalian hosts. We thus focus on the analysis of PB2 protein sequences and identify host specific PB2 amino acid signatures. RESULTS: Validation with a set of H5N1 PB2 sequences from 1996 to 2006 results in adjusted entropy having a 40% false negative discovery rate compared to a 60% false negative rate using unadjusted entropy. Simulations across different levels of sequence divergence show a false negative rate of no higher than 10% while unadjusted entropy ranged from 9 to 100%. In addition, under all levels of divergence adjusted entropy never had a false positive rate higher than 9%. Adjusted entropy also identifies important mutations in H1N1pdm PB2 previously identified in the literature that explain changes in divergence between 2008 and 2009 which unadjusted entropy could not identify. CONCLUSIONS: Based on these results, adjusted entropy provides a reliable and widely applicable host signature identification approach useful for IAV monitoring and vaccine development. BioMed Central 2022-08-12 /pmc/articles/PMC9372975/ /pubmed/35962315 http://dx.doi.org/10.1186/s12859-022-04885-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zhang, Yixiang
Eskridge, Kent M.
Zhang, Shunpu
Lu, Guoqing
Identifying host-specific amino acid signatures for influenza A viruses using an adjusted entropy measure
title Identifying host-specific amino acid signatures for influenza A viruses using an adjusted entropy measure
title_full Identifying host-specific amino acid signatures for influenza A viruses using an adjusted entropy measure
title_fullStr Identifying host-specific amino acid signatures for influenza A viruses using an adjusted entropy measure
title_full_unstemmed Identifying host-specific amino acid signatures for influenza A viruses using an adjusted entropy measure
title_short Identifying host-specific amino acid signatures for influenza A viruses using an adjusted entropy measure
title_sort identifying host-specific amino acid signatures for influenza a viruses using an adjusted entropy measure
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9372975/
https://www.ncbi.nlm.nih.gov/pubmed/35962315
http://dx.doi.org/10.1186/s12859-022-04885-7
work_keys_str_mv AT zhangyixiang identifyinghostspecificaminoacidsignaturesforinfluenzaavirusesusinganadjustedentropymeasure
AT eskridgekentm identifyinghostspecificaminoacidsignaturesforinfluenzaavirusesusinganadjustedentropymeasure
AT zhangshunpu identifyinghostspecificaminoacidsignaturesforinfluenzaavirusesusinganadjustedentropymeasure
AT luguoqing identifyinghostspecificaminoacidsignaturesforinfluenzaavirusesusinganadjustedentropymeasure