Cargando…
SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing
Human immunodeficiency virus 1 (HIV-1) genome sequencing is routinely done for drug resistance monitoring in hospitals worldwide. Subtyping these extensive datasets of HIV-1 sequences is a critical first step in molecular epidemiology and evolution studies. The clinical relevance of HIV-1 subtypes i...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6863187/ https://www.ncbi.nlm.nih.gov/pubmed/31768265 http://dx.doi.org/10.1093/ve/vez050 |
_version_ | 1783471702756294656 |
---|---|
author | Araújo, Pedro M M Martins, Joana S Osório, Nuno S |
author_facet | Araújo, Pedro M M Martins, Joana S Osório, Nuno S |
author_sort | Araújo, Pedro M M |
collection | PubMed |
description | Human immunodeficiency virus 1 (HIV-1) genome sequencing is routinely done for drug resistance monitoring in hospitals worldwide. Subtyping these extensive datasets of HIV-1 sequences is a critical first step in molecular epidemiology and evolution studies. The clinical relevance of HIV-1 subtypes is increasingly recognized. Several studies suggest subtype-related differences in disease progression, transmission route efficiency, immune evasion, and even therapeutic outcomes. HIV-1 subtyping is mainly done using web-servers. These tools have limitations in scalability and potential noncompliance with data protection legislation. Thus, the aim of this work was to develop an efficient method for large-scale local HIV-1 subtyping. We designed SNAPPy: a snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing. It contains several tasks of phylogenetic inference and BLAST queries, which can be executed sequentially or in parallel, taking advantage of multiple-core processing units. Although it was built for subtyping, SNAPPy is also useful to perform extensive HIV-1 alignments. This tool facilitates large-scale sequence-based HIV-1 research by providing a local, resource efficient and scalable alternative for HIV-1 subtyping. It is capable of analyzing full-length genomes or partial HIV-1 genomic regions (GAG, POL, and ENV) and recognizes more than ninety circulating recombinant forms. SNAPPy is freely available at: https://github.com/PMMAraujo/snappy/releases. |
format | Online Article Text |
id | pubmed-6863187 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-68631872019-11-25 SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing Araújo, Pedro M M Martins, Joana S Osório, Nuno S Virus Evol Resources Human immunodeficiency virus 1 (HIV-1) genome sequencing is routinely done for drug resistance monitoring in hospitals worldwide. Subtyping these extensive datasets of HIV-1 sequences is a critical first step in molecular epidemiology and evolution studies. The clinical relevance of HIV-1 subtypes is increasingly recognized. Several studies suggest subtype-related differences in disease progression, transmission route efficiency, immune evasion, and even therapeutic outcomes. HIV-1 subtyping is mainly done using web-servers. These tools have limitations in scalability and potential noncompliance with data protection legislation. Thus, the aim of this work was to develop an efficient method for large-scale local HIV-1 subtyping. We designed SNAPPy: a snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing. It contains several tasks of phylogenetic inference and BLAST queries, which can be executed sequentially or in parallel, taking advantage of multiple-core processing units. Although it was built for subtyping, SNAPPy is also useful to perform extensive HIV-1 alignments. This tool facilitates large-scale sequence-based HIV-1 research by providing a local, resource efficient and scalable alternative for HIV-1 subtyping. It is capable of analyzing full-length genomes or partial HIV-1 genomic regions (GAG, POL, and ENV) and recognizes more than ninety circulating recombinant forms. SNAPPy is freely available at: https://github.com/PMMAraujo/snappy/releases. Oxford University Press 2019-11-19 /pmc/articles/PMC6863187/ /pubmed/31768265 http://dx.doi.org/10.1093/ve/vez050 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Resources Araújo, Pedro M M Martins, Joana S Osório, Nuno S SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing |
title | SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing |
title_full | SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing |
title_fullStr | SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing |
title_full_unstemmed | SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing |
title_short | SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing |
title_sort | snappy: a snakemake pipeline for scalable hiv-1 subtyping by phylogenetic pairing |
topic | Resources |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6863187/ https://www.ncbi.nlm.nih.gov/pubmed/31768265 http://dx.doi.org/10.1093/ve/vez050 |
work_keys_str_mv | AT araujopedromm snappyasnakemakepipelineforscalablehiv1subtypingbyphylogeneticpairing AT martinsjoanas snappyasnakemakepipelineforscalablehiv1subtypingbyphylogeneticpairing AT osorionunos snappyasnakemakepipelineforscalablehiv1subtypingbyphylogeneticpairing |