Cargando…
Applying machine learning to predict viral assembly for adeno-associated virus capsid libraries
Machine learning (ML) can aid in novel discoveries in the field of viral gene therapy. Specifically, big data gathered through next-generation sequencing (NGS) of complex capsid libraries is an especially prominent source of lost potential in data analysis and prediction. Furthermore, adeno-associat...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society of Gene & Cell Therapy
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7809249/ https://www.ncbi.nlm.nih.gov/pubmed/33511242 http://dx.doi.org/10.1016/j.omtm.2020.11.017 |
_version_ | 1783637081003655168 |
---|---|
author | Marques, Andrew D. Kummer, Michael Kondratov, Oleksandr Banerjee, Arunava Moskalenko, Oleksandr Zolotukhin, Sergei |
author_facet | Marques, Andrew D. Kummer, Michael Kondratov, Oleksandr Banerjee, Arunava Moskalenko, Oleksandr Zolotukhin, Sergei |
author_sort | Marques, Andrew D. |
collection | PubMed |
description | Machine learning (ML) can aid in novel discoveries in the field of viral gene therapy. Specifically, big data gathered through next-generation sequencing (NGS) of complex capsid libraries is an especially prominent source of lost potential in data analysis and prediction. Furthermore, adeno-associated virus (AAV)-based capsid libraries are becoming increasingly popular as a tool to select candidates for gene therapy vectors. These higher complexity AAV capsid libraries have previously been created and selected in vivo; however, in silico analysis using ML computer algorithms may augment smarter and more robust libraries for selection. In this study, data of AAV capsid libraries gathered before and after viral assembly are used to train ML algorithms. We found that two ML computer algorithms, artificial neural networks (ANNs), and support vector machines (SVMs), can be trained to predict whether unknown capsid variants may assemble into viable virus-like structures. Using the most accurate models constructed, hypothetical mutation patterns in library construction were simulated to suggest the importance of N495, G546, and I554 in AAV2-derived capsids. Finally, two comparative libraries were generated using ML-derived data to biologically validate these findings and demonstrate the predictive power of ML in vector design. |
format | Online Article Text |
id | pubmed-7809249 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | American Society of Gene & Cell Therapy |
record_format | MEDLINE/PubMed |
spelling | pubmed-78092492021-01-27 Applying machine learning to predict viral assembly for adeno-associated virus capsid libraries Marques, Andrew D. Kummer, Michael Kondratov, Oleksandr Banerjee, Arunava Moskalenko, Oleksandr Zolotukhin, Sergei Mol Ther Methods Clin Dev Original Article Machine learning (ML) can aid in novel discoveries in the field of viral gene therapy. Specifically, big data gathered through next-generation sequencing (NGS) of complex capsid libraries is an especially prominent source of lost potential in data analysis and prediction. Furthermore, adeno-associated virus (AAV)-based capsid libraries are becoming increasingly popular as a tool to select candidates for gene therapy vectors. These higher complexity AAV capsid libraries have previously been created and selected in vivo; however, in silico analysis using ML computer algorithms may augment smarter and more robust libraries for selection. In this study, data of AAV capsid libraries gathered before and after viral assembly are used to train ML algorithms. We found that two ML computer algorithms, artificial neural networks (ANNs), and support vector machines (SVMs), can be trained to predict whether unknown capsid variants may assemble into viable virus-like structures. Using the most accurate models constructed, hypothetical mutation patterns in library construction were simulated to suggest the importance of N495, G546, and I554 in AAV2-derived capsids. Finally, two comparative libraries were generated using ML-derived data to biologically validate these findings and demonstrate the predictive power of ML in vector design. American Society of Gene & Cell Therapy 2020-12-03 /pmc/articles/PMC7809249/ /pubmed/33511242 http://dx.doi.org/10.1016/j.omtm.2020.11.017 Text en © 2020 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Original Article Marques, Andrew D. Kummer, Michael Kondratov, Oleksandr Banerjee, Arunava Moskalenko, Oleksandr Zolotukhin, Sergei Applying machine learning to predict viral assembly for adeno-associated virus capsid libraries |
title | Applying machine learning to predict viral assembly for adeno-associated virus capsid libraries |
title_full | Applying machine learning to predict viral assembly for adeno-associated virus capsid libraries |
title_fullStr | Applying machine learning to predict viral assembly for adeno-associated virus capsid libraries |
title_full_unstemmed | Applying machine learning to predict viral assembly for adeno-associated virus capsid libraries |
title_short | Applying machine learning to predict viral assembly for adeno-associated virus capsid libraries |
title_sort | applying machine learning to predict viral assembly for adeno-associated virus capsid libraries |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7809249/ https://www.ncbi.nlm.nih.gov/pubmed/33511242 http://dx.doi.org/10.1016/j.omtm.2020.11.017 |
work_keys_str_mv | AT marquesandrewd applyingmachinelearningtopredictviralassemblyforadenoassociatedviruscapsidlibraries AT kummermichael applyingmachinelearningtopredictviralassemblyforadenoassociatedviruscapsidlibraries AT kondratovoleksandr applyingmachinelearningtopredictviralassemblyforadenoassociatedviruscapsidlibraries AT banerjeearunava applyingmachinelearningtopredictviralassemblyforadenoassociatedviruscapsidlibraries AT moskalenkooleksandr applyingmachinelearningtopredictviralassemblyforadenoassociatedviruscapsidlibraries AT zolotukhinsergei applyingmachinelearningtopredictviralassemblyforadenoassociatedviruscapsidlibraries |