Cargando…
IDEPI: Rapid Prediction of HIV-1 Antibody Epitopes and Other Phenotypic Features from Sequence Data Using a Flexible Machine Learning Platform
Since its identification in 1983, HIV-1 has been the focus of a research effort unprecedented in scope and difficulty, whose ultimate goals — a cure and a vaccine – remain elusive. One of the fundamental challenges in accomplishing these goals is the tremendous genetic variability of the virus, with...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4177671/ https://www.ncbi.nlm.nih.gov/pubmed/25254639 http://dx.doi.org/10.1371/journal.pcbi.1003842 |
_version_ | 1782336810090758144 |
---|---|
author | Hepler, N. Lance Scheffler, Konrad Weaver, Steven Murrell, Ben Richman, Douglas D. Burton, Dennis R. Poignard, Pascal Smith, Davey M. Kosakovsky Pond, Sergei L. |
author_facet | Hepler, N. Lance Scheffler, Konrad Weaver, Steven Murrell, Ben Richman, Douglas D. Burton, Dennis R. Poignard, Pascal Smith, Davey M. Kosakovsky Pond, Sergei L. |
author_sort | Hepler, N. Lance |
collection | PubMed |
description | Since its identification in 1983, HIV-1 has been the focus of a research effort unprecedented in scope and difficulty, whose ultimate goals — a cure and a vaccine – remain elusive. One of the fundamental challenges in accomplishing these goals is the tremendous genetic variability of the virus, with some genes differing at as many as 40% of nucleotide positions among circulating strains. Because of this, the genetic bases of many viral phenotypes, most notably the susceptibility to neutralization by a particular antibody, are difficult to identify computationally. Drawing upon open-source general-purpose machine learning algorithms and libraries, we have developed a software package IDEPI (IDentify EPItopes) for learning genotype-to-phenotype predictive models from sequences with known phenotypes. IDEPI can apply learned models to classify sequences of unknown phenotypes, and also identify specific sequence features which contribute to a particular phenotype. We demonstrate that IDEPI achieves performance similar to or better than that of previously published approaches on four well-studied problems: finding the epitopes of broadly neutralizing antibodies (bNab), determining coreceptor tropism of the virus, identifying compartment-specific genetic signatures of the virus, and deducing drug-resistance associated mutations. The cross-platform Python source code (released under the GPL 3.0 license), documentation, issue tracking, and a pre-configured virtual machine for IDEPI can be found at https://github.com/veg/idepi. |
format | Online Article Text |
id | pubmed-4177671 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-41776712014-10-02 IDEPI: Rapid Prediction of HIV-1 Antibody Epitopes and Other Phenotypic Features from Sequence Data Using a Flexible Machine Learning Platform Hepler, N. Lance Scheffler, Konrad Weaver, Steven Murrell, Ben Richman, Douglas D. Burton, Dennis R. Poignard, Pascal Smith, Davey M. Kosakovsky Pond, Sergei L. PLoS Comput Biol Research Article Since its identification in 1983, HIV-1 has been the focus of a research effort unprecedented in scope and difficulty, whose ultimate goals — a cure and a vaccine – remain elusive. One of the fundamental challenges in accomplishing these goals is the tremendous genetic variability of the virus, with some genes differing at as many as 40% of nucleotide positions among circulating strains. Because of this, the genetic bases of many viral phenotypes, most notably the susceptibility to neutralization by a particular antibody, are difficult to identify computationally. Drawing upon open-source general-purpose machine learning algorithms and libraries, we have developed a software package IDEPI (IDentify EPItopes) for learning genotype-to-phenotype predictive models from sequences with known phenotypes. IDEPI can apply learned models to classify sequences of unknown phenotypes, and also identify specific sequence features which contribute to a particular phenotype. We demonstrate that IDEPI achieves performance similar to or better than that of previously published approaches on four well-studied problems: finding the epitopes of broadly neutralizing antibodies (bNab), determining coreceptor tropism of the virus, identifying compartment-specific genetic signatures of the virus, and deducing drug-resistance associated mutations. The cross-platform Python source code (released under the GPL 3.0 license), documentation, issue tracking, and a pre-configured virtual machine for IDEPI can be found at https://github.com/veg/idepi. Public Library of Science 2014-09-25 /pmc/articles/PMC4177671/ /pubmed/25254639 http://dx.doi.org/10.1371/journal.pcbi.1003842 Text en © 2014 Hepler et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Hepler, N. Lance Scheffler, Konrad Weaver, Steven Murrell, Ben Richman, Douglas D. Burton, Dennis R. Poignard, Pascal Smith, Davey M. Kosakovsky Pond, Sergei L. IDEPI: Rapid Prediction of HIV-1 Antibody Epitopes and Other Phenotypic Features from Sequence Data Using a Flexible Machine Learning Platform |
title | IDEPI: Rapid Prediction of HIV-1 Antibody Epitopes and Other Phenotypic Features from Sequence Data Using a Flexible Machine Learning Platform |
title_full | IDEPI: Rapid Prediction of HIV-1 Antibody Epitopes and Other Phenotypic Features from Sequence Data Using a Flexible Machine Learning Platform |
title_fullStr | IDEPI: Rapid Prediction of HIV-1 Antibody Epitopes and Other Phenotypic Features from Sequence Data Using a Flexible Machine Learning Platform |
title_full_unstemmed | IDEPI: Rapid Prediction of HIV-1 Antibody Epitopes and Other Phenotypic Features from Sequence Data Using a Flexible Machine Learning Platform |
title_short | IDEPI: Rapid Prediction of HIV-1 Antibody Epitopes and Other Phenotypic Features from Sequence Data Using a Flexible Machine Learning Platform |
title_sort | idepi: rapid prediction of hiv-1 antibody epitopes and other phenotypic features from sequence data using a flexible machine learning platform |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4177671/ https://www.ncbi.nlm.nih.gov/pubmed/25254639 http://dx.doi.org/10.1371/journal.pcbi.1003842 |
work_keys_str_mv | AT heplernlance idepirapidpredictionofhiv1antibodyepitopesandotherphenotypicfeaturesfromsequencedatausingaflexiblemachinelearningplatform AT schefflerkonrad idepirapidpredictionofhiv1antibodyepitopesandotherphenotypicfeaturesfromsequencedatausingaflexiblemachinelearningplatform AT weaversteven idepirapidpredictionofhiv1antibodyepitopesandotherphenotypicfeaturesfromsequencedatausingaflexiblemachinelearningplatform AT murrellben idepirapidpredictionofhiv1antibodyepitopesandotherphenotypicfeaturesfromsequencedatausingaflexiblemachinelearningplatform AT richmandouglasd idepirapidpredictionofhiv1antibodyepitopesandotherphenotypicfeaturesfromsequencedatausingaflexiblemachinelearningplatform AT burtondennisr idepirapidpredictionofhiv1antibodyepitopesandotherphenotypicfeaturesfromsequencedatausingaflexiblemachinelearningplatform AT poignardpascal idepirapidpredictionofhiv1antibodyepitopesandotherphenotypicfeaturesfromsequencedatausingaflexiblemachinelearningplatform AT smithdaveym idepirapidpredictionofhiv1antibodyepitopesandotherphenotypicfeaturesfromsequencedatausingaflexiblemachinelearningplatform AT kosakovskypondsergeil idepirapidpredictionofhiv1antibodyepitopesandotherphenotypicfeaturesfromsequencedatausingaflexiblemachinelearningplatform |