Cargando…

SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing

Human immunodeficiency virus 1 (HIV-1) genome sequencing is routinely done for drug resistance monitoring in hospitals worldwide. Subtyping these extensive datasets of HIV-1 sequences is a critical first step in molecular epidemiology and evolution studies. The clinical relevance of HIV-1 subtypes i...

Descripción completa

Detalles Bibliográficos
Autores principales: Araújo, Pedro M M, Martins, Joana S, Osório, Nuno S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6863187/
https://www.ncbi.nlm.nih.gov/pubmed/31768265
http://dx.doi.org/10.1093/ve/vez050
_version_ 1783471702756294656
author Araújo, Pedro M M
Martins, Joana S
Osório, Nuno S
author_facet Araújo, Pedro M M
Martins, Joana S
Osório, Nuno S
author_sort Araújo, Pedro M M
collection PubMed
description Human immunodeficiency virus 1 (HIV-1) genome sequencing is routinely done for drug resistance monitoring in hospitals worldwide. Subtyping these extensive datasets of HIV-1 sequences is a critical first step in molecular epidemiology and evolution studies. The clinical relevance of HIV-1 subtypes is increasingly recognized. Several studies suggest subtype-related differences in disease progression, transmission route efficiency, immune evasion, and even therapeutic outcomes. HIV-1 subtyping is mainly done using web-servers. These tools have limitations in scalability and potential noncompliance with data protection legislation. Thus, the aim of this work was to develop an efficient method for large-scale local HIV-1 subtyping. We designed SNAPPy: a snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing. It contains several tasks of phylogenetic inference and BLAST queries, which can be executed sequentially or in parallel, taking advantage of multiple-core processing units. Although it was built for subtyping, SNAPPy is also useful to perform extensive HIV-1 alignments. This tool facilitates large-scale sequence-based HIV-1 research by providing a local, resource efficient and scalable alternative for HIV-1 subtyping. It is capable of analyzing full-length genomes or partial HIV-1 genomic regions (GAG, POL, and ENV) and recognizes more than ninety circulating recombinant forms. SNAPPy is freely available at: https://github.com/PMMAraujo/snappy/releases.
format Online
Article
Text
id pubmed-6863187
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-68631872019-11-25 SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing Araújo, Pedro M M Martins, Joana S Osório, Nuno S Virus Evol Resources Human immunodeficiency virus 1 (HIV-1) genome sequencing is routinely done for drug resistance monitoring in hospitals worldwide. Subtyping these extensive datasets of HIV-1 sequences is a critical first step in molecular epidemiology and evolution studies. The clinical relevance of HIV-1 subtypes is increasingly recognized. Several studies suggest subtype-related differences in disease progression, transmission route efficiency, immune evasion, and even therapeutic outcomes. HIV-1 subtyping is mainly done using web-servers. These tools have limitations in scalability and potential noncompliance with data protection legislation. Thus, the aim of this work was to develop an efficient method for large-scale local HIV-1 subtyping. We designed SNAPPy: a snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing. It contains several tasks of phylogenetic inference and BLAST queries, which can be executed sequentially or in parallel, taking advantage of multiple-core processing units. Although it was built for subtyping, SNAPPy is also useful to perform extensive HIV-1 alignments. This tool facilitates large-scale sequence-based HIV-1 research by providing a local, resource efficient and scalable alternative for HIV-1 subtyping. It is capable of analyzing full-length genomes or partial HIV-1 genomic regions (GAG, POL, and ENV) and recognizes more than ninety circulating recombinant forms. SNAPPy is freely available at: https://github.com/PMMAraujo/snappy/releases. Oxford University Press 2019-11-19 /pmc/articles/PMC6863187/ /pubmed/31768265 http://dx.doi.org/10.1093/ve/vez050 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Resources
Araújo, Pedro M M
Martins, Joana S
Osório, Nuno S
SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing
title SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing
title_full SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing
title_fullStr SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing
title_full_unstemmed SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing
title_short SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing
title_sort snappy: a snakemake pipeline for scalable hiv-1 subtyping by phylogenetic pairing
topic Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6863187/
https://www.ncbi.nlm.nih.gov/pubmed/31768265
http://dx.doi.org/10.1093/ve/vez050
work_keys_str_mv AT araujopedromm snappyasnakemakepipelineforscalablehiv1subtypingbyphylogeneticpairing
AT martinsjoanas snappyasnakemakepipelineforscalablehiv1subtypingbyphylogeneticpairing
AT osorionunos snappyasnakemakepipelineforscalablehiv1subtypingbyphylogeneticpairing