Cargando…

Detection of viral sequence fragments of HIV-1 subfamilies yet unknown

BACKGROUND: Methods of determining whether or not any particular HIV-1 sequence stems - completely or in part - from some unknown HIV-1 subtype are important for the design of vaccines and molecular detection systems, as well as for epidemiological monitoring. Nevertheless, a single algorithm only,...

Descripción completa

Detalles Bibliográficos
Autores principales: Unterthiner, Thomas, Schultz, Anne-Kathrin, Bulla, Jan, Morgenstern, Burkhard, Stanke, Mario, Bulla, Ingo
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3086866/
https://www.ncbi.nlm.nih.gov/pubmed/21481263
http://dx.doi.org/10.1186/1471-2105-12-93
_version_ 1782202717903519744
author Unterthiner, Thomas
Schultz, Anne-Kathrin
Bulla, Jan
Morgenstern, Burkhard
Stanke, Mario
Bulla, Ingo
author_facet Unterthiner, Thomas
Schultz, Anne-Kathrin
Bulla, Jan
Morgenstern, Burkhard
Stanke, Mario
Bulla, Ingo
author_sort Unterthiner, Thomas
collection PubMed
description BACKGROUND: Methods of determining whether or not any particular HIV-1 sequence stems - completely or in part - from some unknown HIV-1 subtype are important for the design of vaccines and molecular detection systems, as well as for epidemiological monitoring. Nevertheless, a single algorithm only, the Branching Index (BI), has been developed for this task so far. Moving along the genome of a query sequence in a sliding window, the BI computes a ratio quantifying how closely the query sequence clusters with a subtype clade. In its current version, however, the BI does not provide predicted boundaries of unknown fragments. RESULTS: We have developed Unknown Subtype Finder (USF), an algorithm based on a probabilistic model, which automatically determines which parts of an input sequence originate from a subtype yet unknown. The underlying model is based on a simple profile hidden Markov model (pHMM) for each known subtype and an additional pHMM for an unknown subtype. The emission probabilities of the latter are estimated using the emission frequencies of the known subtypes by means of a (position-wise) probabilistic model for the emergence of new subtypes. We have applied USF to SIV and HIV-1 sequences formerly classified as having emerged from an unknown subtype. Moreover, we have evaluated its performance on artificial HIV-1 recombinants and non-recombinant HIV-1 sequences. The results have been compared with the corresponding results of the BI. CONCLUSIONS: Our results demonstrate that USF is suitable for detecting segments in HIV-1 sequences stemming from yet unknown subtypes. Comparing USF with the BI shows that our algorithm performs as good as the BI or better.
format Text
id pubmed-3086866
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30868662011-05-04 Detection of viral sequence fragments of HIV-1 subfamilies yet unknown Unterthiner, Thomas Schultz, Anne-Kathrin Bulla, Jan Morgenstern, Burkhard Stanke, Mario Bulla, Ingo BMC Bioinformatics Research Article BACKGROUND: Methods of determining whether or not any particular HIV-1 sequence stems - completely or in part - from some unknown HIV-1 subtype are important for the design of vaccines and molecular detection systems, as well as for epidemiological monitoring. Nevertheless, a single algorithm only, the Branching Index (BI), has been developed for this task so far. Moving along the genome of a query sequence in a sliding window, the BI computes a ratio quantifying how closely the query sequence clusters with a subtype clade. In its current version, however, the BI does not provide predicted boundaries of unknown fragments. RESULTS: We have developed Unknown Subtype Finder (USF), an algorithm based on a probabilistic model, which automatically determines which parts of an input sequence originate from a subtype yet unknown. The underlying model is based on a simple profile hidden Markov model (pHMM) for each known subtype and an additional pHMM for an unknown subtype. The emission probabilities of the latter are estimated using the emission frequencies of the known subtypes by means of a (position-wise) probabilistic model for the emergence of new subtypes. We have applied USF to SIV and HIV-1 sequences formerly classified as having emerged from an unknown subtype. Moreover, we have evaluated its performance on artificial HIV-1 recombinants and non-recombinant HIV-1 sequences. The results have been compared with the corresponding results of the BI. CONCLUSIONS: Our results demonstrate that USF is suitable for detecting segments in HIV-1 sequences stemming from yet unknown subtypes. Comparing USF with the BI shows that our algorithm performs as good as the BI or better. BioMed Central 2011-04-11 /pmc/articles/PMC3086866/ /pubmed/21481263 http://dx.doi.org/10.1186/1471-2105-12-93 Text en Copyright ©2011 Unterthiner et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Unterthiner, Thomas
Schultz, Anne-Kathrin
Bulla, Jan
Morgenstern, Burkhard
Stanke, Mario
Bulla, Ingo
Detection of viral sequence fragments of HIV-1 subfamilies yet unknown
title Detection of viral sequence fragments of HIV-1 subfamilies yet unknown
title_full Detection of viral sequence fragments of HIV-1 subfamilies yet unknown
title_fullStr Detection of viral sequence fragments of HIV-1 subfamilies yet unknown
title_full_unstemmed Detection of viral sequence fragments of HIV-1 subfamilies yet unknown
title_short Detection of viral sequence fragments of HIV-1 subfamilies yet unknown
title_sort detection of viral sequence fragments of hiv-1 subfamilies yet unknown
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3086866/
https://www.ncbi.nlm.nih.gov/pubmed/21481263
http://dx.doi.org/10.1186/1471-2105-12-93
work_keys_str_mv AT unterthinerthomas detectionofviralsequencefragmentsofhiv1subfamiliesyetunknown
AT schultzannekathrin detectionofviralsequencefragmentsofhiv1subfamiliesyetunknown
AT bullajan detectionofviralsequencefragmentsofhiv1subfamiliesyetunknown
AT morgensternburkhard detectionofviralsequencefragmentsofhiv1subfamiliesyetunknown
AT stankemario detectionofviralsequencefragmentsofhiv1subfamiliesyetunknown
AT bullaingo detectionofviralsequencefragmentsofhiv1subfamiliesyetunknown