Cargando…

Machine Learning Prediction and Experimental Validation of Antigenic Drift in H3 Influenza A Viruses in Swine

The antigenic diversity of influenza A viruses (IAV) circulating in swine challenges the development of effective vaccines, increasing zoonotic threat and pandemic potential. High-throughput sequencing technologies can quantify IAV genetic diversity, but there are no accurate approaches to adequatel...

Descripción completa

Detalles Bibliográficos
Autores principales: Zeller, Michael A., Gauger, Phillip C., Arendsee, Zebulun W., Souza, Carine K., Vincent, Amy L., Anderson, Tavis K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8546707/
https://www.ncbi.nlm.nih.gov/pubmed/33731472
http://dx.doi.org/10.1128/mSphere.00920-20
_version_ 1784590242414592000
author Zeller, Michael A.
Gauger, Phillip C.
Arendsee, Zebulun W.
Souza, Carine K.
Vincent, Amy L.
Anderson, Tavis K.
author_facet Zeller, Michael A.
Gauger, Phillip C.
Arendsee, Zebulun W.
Souza, Carine K.
Vincent, Amy L.
Anderson, Tavis K.
author_sort Zeller, Michael A.
collection PubMed
description The antigenic diversity of influenza A viruses (IAV) circulating in swine challenges the development of effective vaccines, increasing zoonotic threat and pandemic potential. High-throughput sequencing technologies can quantify IAV genetic diversity, but there are no accurate approaches to adequately describe antigenic phenotypes. This study evaluated an ensemble of nonlinear regression models to estimate virus phenotype from genotype. Regression models were trained with a phenotypic data set of pairwise hemagglutination inhibition (HI) assays, using genetic sequence identity and pairwise amino acid mutations as predictor features. The model identified amino acid identity, ranked the relative importance of mutations in the hemagglutinin (HA) protein, and demonstrated good prediction accuracy. Four previously untested IAV strains were selected to experimentally validate model predictions by HI assays. Errors between predicted and measured distances of uncharacterized strains were 0.35, 0.61, 1.69, and 0.13 antigenic units. These empirically trained regression models can be used to estimate antigenic distances between different strains of IAV in swine by using sequence data. By ranking the importance of mutations in the HA, we provide criteria for identifying antigenically advanced IAV strains that may not be controlled by existing vaccines and can inform strain updates to vaccines to better control this pathogen. IMPORTANCE Influenza A viruses (IAV) in swine constitute a major economic burden to an important global agricultural sector, impact food security, and are a public health threat. Despite significant improvement in surveillance for IAV in swine over the past 10 years, sequence data have not been integrated into a systematic vaccine strain selection process for predicting antigenic phenotype and identifying determinants of antigenic drift. To overcome this, we developed nonlinear regression models that predict antigenic phenotype from genetic sequence data by training the model on hemagglutination inhibition assay results. We used these models to predict antigenic phenotype for previously uncharacterized IAV, ranked the importance of genetic features for antigenic phenotype, and experimentally validated our predictions. Our model predicted virus antigenic characteristics from genetic sequence data and provides a rapid and accurate method linking genetic sequence data to antigenic characteristics. This approach also provides support for public health by identifying viruses that are antigenically advanced from strains used as pandemic preparedness candidate vaccine viruses.
format Online
Article
Text
id pubmed-8546707
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-85467072021-11-04 Machine Learning Prediction and Experimental Validation of Antigenic Drift in H3 Influenza A Viruses in Swine Zeller, Michael A. Gauger, Phillip C. Arendsee, Zebulun W. Souza, Carine K. Vincent, Amy L. Anderson, Tavis K. mSphere Research Article The antigenic diversity of influenza A viruses (IAV) circulating in swine challenges the development of effective vaccines, increasing zoonotic threat and pandemic potential. High-throughput sequencing technologies can quantify IAV genetic diversity, but there are no accurate approaches to adequately describe antigenic phenotypes. This study evaluated an ensemble of nonlinear regression models to estimate virus phenotype from genotype. Regression models were trained with a phenotypic data set of pairwise hemagglutination inhibition (HI) assays, using genetic sequence identity and pairwise amino acid mutations as predictor features. The model identified amino acid identity, ranked the relative importance of mutations in the hemagglutinin (HA) protein, and demonstrated good prediction accuracy. Four previously untested IAV strains were selected to experimentally validate model predictions by HI assays. Errors between predicted and measured distances of uncharacterized strains were 0.35, 0.61, 1.69, and 0.13 antigenic units. These empirically trained regression models can be used to estimate antigenic distances between different strains of IAV in swine by using sequence data. By ranking the importance of mutations in the HA, we provide criteria for identifying antigenically advanced IAV strains that may not be controlled by existing vaccines and can inform strain updates to vaccines to better control this pathogen. IMPORTANCE Influenza A viruses (IAV) in swine constitute a major economic burden to an important global agricultural sector, impact food security, and are a public health threat. Despite significant improvement in surveillance for IAV in swine over the past 10 years, sequence data have not been integrated into a systematic vaccine strain selection process for predicting antigenic phenotype and identifying determinants of antigenic drift. To overcome this, we developed nonlinear regression models that predict antigenic phenotype from genetic sequence data by training the model on hemagglutination inhibition assay results. We used these models to predict antigenic phenotype for previously uncharacterized IAV, ranked the importance of genetic features for antigenic phenotype, and experimentally validated our predictions. Our model predicted virus antigenic characteristics from genetic sequence data and provides a rapid and accurate method linking genetic sequence data to antigenic characteristics. This approach also provides support for public health by identifying viruses that are antigenically advanced from strains used as pandemic preparedness candidate vaccine viruses. American Society for Microbiology 2021-03-17 /pmc/articles/PMC8546707/ /pubmed/33731472 http://dx.doi.org/10.1128/mSphere.00920-20 Text en https://doi.org/10.1128/AuthorWarrantyLicense.v1This is a work of the U.S. Government and is not subject to copyright protection in the United States. Foreign copyrights may apply.
spellingShingle Research Article
Zeller, Michael A.
Gauger, Phillip C.
Arendsee, Zebulun W.
Souza, Carine K.
Vincent, Amy L.
Anderson, Tavis K.
Machine Learning Prediction and Experimental Validation of Antigenic Drift in H3 Influenza A Viruses in Swine
title Machine Learning Prediction and Experimental Validation of Antigenic Drift in H3 Influenza A Viruses in Swine
title_full Machine Learning Prediction and Experimental Validation of Antigenic Drift in H3 Influenza A Viruses in Swine
title_fullStr Machine Learning Prediction and Experimental Validation of Antigenic Drift in H3 Influenza A Viruses in Swine
title_full_unstemmed Machine Learning Prediction and Experimental Validation of Antigenic Drift in H3 Influenza A Viruses in Swine
title_short Machine Learning Prediction and Experimental Validation of Antigenic Drift in H3 Influenza A Viruses in Swine
title_sort machine learning prediction and experimental validation of antigenic drift in h3 influenza a viruses in swine
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8546707/
https://www.ncbi.nlm.nih.gov/pubmed/33731472
http://dx.doi.org/10.1128/mSphere.00920-20
work_keys_str_mv AT zellermichaela machinelearningpredictionandexperimentalvalidationofantigenicdriftinh3influenzaavirusesinswine
AT gaugerphillipc machinelearningpredictionandexperimentalvalidationofantigenicdriftinh3influenzaavirusesinswine
AT arendseezebulunw machinelearningpredictionandexperimentalvalidationofantigenicdriftinh3influenzaavirusesinswine
AT souzacarinek machinelearningpredictionandexperimentalvalidationofantigenicdriftinh3influenzaavirusesinswine
AT vincentamyl machinelearningpredictionandexperimentalvalidationofantigenicdriftinh3influenzaavirusesinswine
AT andersontavisk machinelearningpredictionandexperimentalvalidationofantigenicdriftinh3influenzaavirusesinswine