Cargando…

Streamlined Subpopulation, Subtype, and Recombination Analysis of HIV-1 Half-Genome Sequences Generated by High-Throughput Sequencing

High-throughput sequencing (HTS) has been widely used to characterize HIV-1 genome sequences. There are no algorithms currently that can directly determine genotype and quasispecies population using short HTS reads generated from long genome sequences without additional software. To establish a robu...

Descripción completa

Detalles Bibliográficos
Autores principales: Hora, Bhavna, Gulzar, Naila, Chen, Yue, Karagiannis, Konstantinos, Cai, Fangping, Su, Chang, Smith, Krista, Simonyan, Vahan, Shah, Sharaf Ali, Ahmed, Manzoor, Sanchez, Ana M., Stone, Mars, Cohen, Myron S., Denny, Thomas N., Mazumder, Raja, Gao, Feng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7565892/
https://www.ncbi.nlm.nih.gov/pubmed/33055255
http://dx.doi.org/10.1128/mSphere.00551-20
_version_ 1783596032978845696
author Hora, Bhavna
Gulzar, Naila
Chen, Yue
Karagiannis, Konstantinos
Cai, Fangping
Su, Chang
Smith, Krista
Simonyan, Vahan
Shah, Sharaf Ali
Ahmed, Manzoor
Sanchez, Ana M.
Stone, Mars
Cohen, Myron S.
Denny, Thomas N.
Mazumder, Raja
Gao, Feng
author_facet Hora, Bhavna
Gulzar, Naila
Chen, Yue
Karagiannis, Konstantinos
Cai, Fangping
Su, Chang
Smith, Krista
Simonyan, Vahan
Shah, Sharaf Ali
Ahmed, Manzoor
Sanchez, Ana M.
Stone, Mars
Cohen, Myron S.
Denny, Thomas N.
Mazumder, Raja
Gao, Feng
author_sort Hora, Bhavna
collection PubMed
description High-throughput sequencing (HTS) has been widely used to characterize HIV-1 genome sequences. There are no algorithms currently that can directly determine genotype and quasispecies population using short HTS reads generated from long genome sequences without additional software. To establish a robust subpopulation, subtype, and recombination analysis workflow, we amplified the HIV-1 3′-half genome from plasma samples of 65 HIV-1-infected individuals and sequenced the entire amplicon (∼4,500 bp) by HTS. With direct analysis of raw reads using HIVE-hexahedron, we showed that 48% of samples harbored 2 to 13 subpopulations. We identified various subtypes (17 A1s, 4 Bs, 27 Cs, 6 CRF02_AGs, and 11 unique recombinant forms) and defined recombinant breakpoints of 10 recombinants. These results were validated with viral genome sequences generated by single genome sequencing (SGS) or the analysis of consensus sequence of the HTS reads. The HIVE-hexahedron workflow is more sensitive and accurate than just evaluating the consensus sequence and also more cost-effective than SGS. IMPORTANCE The highly recombinogenic nature of human immunodeficiency virus type 1 (HIV-1) leads to recombination and emergence of quasispecies. It is important to reliably identify subpopulations to understand the complexity of a viral population for drug resistance surveillance and vaccine development. High-throughput sequencing (HTS) provides improved resolution over Sanger sequencing for the analysis of heterogeneous viral subpopulations. However, current methods of analysis of HTS reads are unable to fully address accurate population reconstruction. Hence, there is a dire need for a more sensitive, accurate, user-friendly, and cost-effective method to analyze viral quasispecies. For this purpose, we have improved the HIVE-hexahedron algorithm that we previously developed with in silico short sequences to analyze raw HTS short reads. The significance of this study is that our standalone algorithm enables a streamlined analysis of quasispecies, subtype, and recombination patterns from long HIV-1 genome regions without the need of additional sequence analysis tools. Distinct viral populations and recombination patterns identified by HIVE-hexahedron are further validated by comparison with sequences obtained by single genome sequencing (SGS).
format Online
Article
Text
id pubmed-7565892
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-75658922020-10-27 Streamlined Subpopulation, Subtype, and Recombination Analysis of HIV-1 Half-Genome Sequences Generated by High-Throughput Sequencing Hora, Bhavna Gulzar, Naila Chen, Yue Karagiannis, Konstantinos Cai, Fangping Su, Chang Smith, Krista Simonyan, Vahan Shah, Sharaf Ali Ahmed, Manzoor Sanchez, Ana M. Stone, Mars Cohen, Myron S. Denny, Thomas N. Mazumder, Raja Gao, Feng mSphere Research Article High-throughput sequencing (HTS) has been widely used to characterize HIV-1 genome sequences. There are no algorithms currently that can directly determine genotype and quasispecies population using short HTS reads generated from long genome sequences without additional software. To establish a robust subpopulation, subtype, and recombination analysis workflow, we amplified the HIV-1 3′-half genome from plasma samples of 65 HIV-1-infected individuals and sequenced the entire amplicon (∼4,500 bp) by HTS. With direct analysis of raw reads using HIVE-hexahedron, we showed that 48% of samples harbored 2 to 13 subpopulations. We identified various subtypes (17 A1s, 4 Bs, 27 Cs, 6 CRF02_AGs, and 11 unique recombinant forms) and defined recombinant breakpoints of 10 recombinants. These results were validated with viral genome sequences generated by single genome sequencing (SGS) or the analysis of consensus sequence of the HTS reads. The HIVE-hexahedron workflow is more sensitive and accurate than just evaluating the consensus sequence and also more cost-effective than SGS. IMPORTANCE The highly recombinogenic nature of human immunodeficiency virus type 1 (HIV-1) leads to recombination and emergence of quasispecies. It is important to reliably identify subpopulations to understand the complexity of a viral population for drug resistance surveillance and vaccine development. High-throughput sequencing (HTS) provides improved resolution over Sanger sequencing for the analysis of heterogeneous viral subpopulations. However, current methods of analysis of HTS reads are unable to fully address accurate population reconstruction. Hence, there is a dire need for a more sensitive, accurate, user-friendly, and cost-effective method to analyze viral quasispecies. For this purpose, we have improved the HIVE-hexahedron algorithm that we previously developed with in silico short sequences to analyze raw HTS short reads. The significance of this study is that our standalone algorithm enables a streamlined analysis of quasispecies, subtype, and recombination patterns from long HIV-1 genome regions without the need of additional sequence analysis tools. Distinct viral populations and recombination patterns identified by HIVE-hexahedron are further validated by comparison with sequences obtained by single genome sequencing (SGS). American Society for Microbiology 2020-10-14 /pmc/articles/PMC7565892/ /pubmed/33055255 http://dx.doi.org/10.1128/mSphere.00551-20 Text en Copyright © 2020 Hora et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Hora, Bhavna
Gulzar, Naila
Chen, Yue
Karagiannis, Konstantinos
Cai, Fangping
Su, Chang
Smith, Krista
Simonyan, Vahan
Shah, Sharaf Ali
Ahmed, Manzoor
Sanchez, Ana M.
Stone, Mars
Cohen, Myron S.
Denny, Thomas N.
Mazumder, Raja
Gao, Feng
Streamlined Subpopulation, Subtype, and Recombination Analysis of HIV-1 Half-Genome Sequences Generated by High-Throughput Sequencing
title Streamlined Subpopulation, Subtype, and Recombination Analysis of HIV-1 Half-Genome Sequences Generated by High-Throughput Sequencing
title_full Streamlined Subpopulation, Subtype, and Recombination Analysis of HIV-1 Half-Genome Sequences Generated by High-Throughput Sequencing
title_fullStr Streamlined Subpopulation, Subtype, and Recombination Analysis of HIV-1 Half-Genome Sequences Generated by High-Throughput Sequencing
title_full_unstemmed Streamlined Subpopulation, Subtype, and Recombination Analysis of HIV-1 Half-Genome Sequences Generated by High-Throughput Sequencing
title_short Streamlined Subpopulation, Subtype, and Recombination Analysis of HIV-1 Half-Genome Sequences Generated by High-Throughput Sequencing
title_sort streamlined subpopulation, subtype, and recombination analysis of hiv-1 half-genome sequences generated by high-throughput sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7565892/
https://www.ncbi.nlm.nih.gov/pubmed/33055255
http://dx.doi.org/10.1128/mSphere.00551-20
work_keys_str_mv AT horabhavna streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT gulzarnaila streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT chenyue streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT karagianniskonstantinos streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT caifangping streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT suchang streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT smithkrista streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT simonyanvahan streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT shahsharafali streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT ahmedmanzoor streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT sanchezanam streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT stonemars streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT cohenmyrons streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT dennythomasn streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT mazumderraja streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing
AT gaofeng streamlinedsubpopulationsubtypeandrecombinationanalysisofhiv1halfgenomesequencesgeneratedbyhighthroughputsequencing