Cargando…

Revisiting the recombinant history of HIV-1 group M with dynamic network community detection

The prevailing abundance of full-length HIV type 1 (HIV-1) genome sequences provides an opportunity to revisit the standard model of HIV-1 group M (HIV-1/M) diversity that clusters genomes into largely nonrecombinant subtypes, which is not consistent with recent evidence of deep recombinant historie...

Descripción completa

Detalles Bibliográficos
Autores principales: Olabode, Abayomi S., Ng, Garway T., Wade, Kaitlyn E., Salnikov, Mikhail, Grant, Heather E., Dick, David W., Poon, Art F. Y.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9171507/
https://www.ncbi.nlm.nih.gov/pubmed/35500121
http://dx.doi.org/10.1073/pnas.2108815119
_version_ 1784721682358861824
author Olabode, Abayomi S.
Ng, Garway T.
Wade, Kaitlyn E.
Salnikov, Mikhail
Grant, Heather E.
Dick, David W.
Poon, Art F. Y.
author_facet Olabode, Abayomi S.
Ng, Garway T.
Wade, Kaitlyn E.
Salnikov, Mikhail
Grant, Heather E.
Dick, David W.
Poon, Art F. Y.
author_sort Olabode, Abayomi S.
collection PubMed
description The prevailing abundance of full-length HIV type 1 (HIV-1) genome sequences provides an opportunity to revisit the standard model of HIV-1 group M (HIV-1/M) diversity that clusters genomes into largely nonrecombinant subtypes, which is not consistent with recent evidence of deep recombinant histories for simian immunodeficiency virus (SIV) and other HIV-1 groups. Here we develop an unsupervised nonparametric clustering approach, which does not rely on predefined nonrecombinant genomes, by adapting a community detection method developed for dynamic social network analysis. We show that this method (dynamic stochastic block model [DSBM]) attains a significantly lower mean error rate in detecting recombinant breakpoints in simulated data (quasibinomial generalized linear model (GLM), [Formula: see text]), compared to other reference-free recombination detection programs (genetic algorithm for recombination detection [GARD], recombination detection program 4 [RDP4], and RDP5). When this method was applied to a representative sample of n = 525 actual HIV-1 genomes, we determined k = 29 as the optimal number of DSBM clusters and used change-point detection to estimate that at least 95% of these genomes are recombinant. Further, we identified both known and undocumented recombination hotspots in the HIV-1 genome and evidence of intersubtype recombination in HIV-1 subtype reference genomes. We propose that clusters generated by DSBM can provide an informative framework for HIV-1 classification.
format Online
Article
Text
id pubmed-9171507
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-91715072022-11-02 Revisiting the recombinant history of HIV-1 group M with dynamic network community detection Olabode, Abayomi S. Ng, Garway T. Wade, Kaitlyn E. Salnikov, Mikhail Grant, Heather E. Dick, David W. Poon, Art F. Y. Proc Natl Acad Sci U S A Biological Sciences The prevailing abundance of full-length HIV type 1 (HIV-1) genome sequences provides an opportunity to revisit the standard model of HIV-1 group M (HIV-1/M) diversity that clusters genomes into largely nonrecombinant subtypes, which is not consistent with recent evidence of deep recombinant histories for simian immunodeficiency virus (SIV) and other HIV-1 groups. Here we develop an unsupervised nonparametric clustering approach, which does not rely on predefined nonrecombinant genomes, by adapting a community detection method developed for dynamic social network analysis. We show that this method (dynamic stochastic block model [DSBM]) attains a significantly lower mean error rate in detecting recombinant breakpoints in simulated data (quasibinomial generalized linear model (GLM), [Formula: see text]), compared to other reference-free recombination detection programs (genetic algorithm for recombination detection [GARD], recombination detection program 4 [RDP4], and RDP5). When this method was applied to a representative sample of n = 525 actual HIV-1 genomes, we determined k = 29 as the optimal number of DSBM clusters and used change-point detection to estimate that at least 95% of these genomes are recombinant. Further, we identified both known and undocumented recombination hotspots in the HIV-1 genome and evidence of intersubtype recombination in HIV-1 subtype reference genomes. We propose that clusters generated by DSBM can provide an informative framework for HIV-1 classification. National Academy of Sciences 2022-05-02 2022-05-10 /pmc/articles/PMC9171507/ /pubmed/35500121 http://dx.doi.org/10.1073/pnas.2108815119 Text en Copyright © 2022 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Biological Sciences
Olabode, Abayomi S.
Ng, Garway T.
Wade, Kaitlyn E.
Salnikov, Mikhail
Grant, Heather E.
Dick, David W.
Poon, Art F. Y.
Revisiting the recombinant history of HIV-1 group M with dynamic network community detection
title Revisiting the recombinant history of HIV-1 group M with dynamic network community detection
title_full Revisiting the recombinant history of HIV-1 group M with dynamic network community detection
title_fullStr Revisiting the recombinant history of HIV-1 group M with dynamic network community detection
title_full_unstemmed Revisiting the recombinant history of HIV-1 group M with dynamic network community detection
title_short Revisiting the recombinant history of HIV-1 group M with dynamic network community detection
title_sort revisiting the recombinant history of hiv-1 group m with dynamic network community detection
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9171507/
https://www.ncbi.nlm.nih.gov/pubmed/35500121
http://dx.doi.org/10.1073/pnas.2108815119
work_keys_str_mv AT olabodeabayomis revisitingtherecombinanthistoryofhiv1groupmwithdynamicnetworkcommunitydetection
AT nggarwayt revisitingtherecombinanthistoryofhiv1groupmwithdynamicnetworkcommunitydetection
AT wadekaitlyne revisitingtherecombinanthistoryofhiv1groupmwithdynamicnetworkcommunitydetection
AT salnikovmikhail revisitingtherecombinanthistoryofhiv1groupmwithdynamicnetworkcommunitydetection
AT grantheathere revisitingtherecombinanthistoryofhiv1groupmwithdynamicnetworkcommunitydetection
AT dickdavidw revisitingtherecombinanthistoryofhiv1groupmwithdynamicnetworkcommunitydetection
AT poonartfy revisitingtherecombinanthistoryofhiv1groupmwithdynamicnetworkcommunitydetection