Cargando…

gofasta: command-line utilities for genomic epidemiology research

SUMMARY: gofasta comprises a set of command-line utilities for handling alignments of short assembled genomes in a genomic epidemiology context. It was developed for processing large numbers of closely related SARS-CoV-2 viral genomes and should be useful with other densely sampled pathogen genomic...

Descripción completa

Detalles Bibliográficos
Autor principal: Jackson, Ben
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9364388/
https://www.ncbi.nlm.nih.gov/pubmed/35789376
http://dx.doi.org/10.1093/bioinformatics/btac424
_version_ 1784765136350740480
author Jackson, Ben
author_facet Jackson, Ben
author_sort Jackson, Ben
collection PubMed
description SUMMARY: gofasta comprises a set of command-line utilities for handling alignments of short assembled genomes in a genomic epidemiology context. It was developed for processing large numbers of closely related SARS-CoV-2 viral genomes and should be useful with other densely sampled pathogen genomic datasets. It provides functions to convert sam-format pairwise alignments between assembled genomes to fasta format; to annotate mutations in multiple sequence alignments, and to extract sets of sequences by genetic distance measures for use in outbreak investigations. AVAILABILITY AND IMPLEMENTATION: gofasta is an open-source project distributed under the MIT license. Binaries are available at https://github.com/virus-evolution/gofasta, from Bioconda, and through the Go programming language’s package management system. Source code and further documentation, including walkthroughs for common use cases, are available on the GitHub repository.
format Online
Article
Text
id pubmed-9364388
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-93643882022-08-11 gofasta: command-line utilities for genomic epidemiology research Jackson, Ben Bioinformatics Applications Notes SUMMARY: gofasta comprises a set of command-line utilities for handling alignments of short assembled genomes in a genomic epidemiology context. It was developed for processing large numbers of closely related SARS-CoV-2 viral genomes and should be useful with other densely sampled pathogen genomic datasets. It provides functions to convert sam-format pairwise alignments between assembled genomes to fasta format; to annotate mutations in multiple sequence alignments, and to extract sets of sequences by genetic distance measures for use in outbreak investigations. AVAILABILITY AND IMPLEMENTATION: gofasta is an open-source project distributed under the MIT license. Binaries are available at https://github.com/virus-evolution/gofasta, from Bioconda, and through the Go programming language’s package management system. Source code and further documentation, including walkthroughs for common use cases, are available on the GitHub repository. Oxford University Press 2022-07-05 /pmc/articles/PMC9364388/ /pubmed/35789376 http://dx.doi.org/10.1093/bioinformatics/btac424 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Jackson, Ben
gofasta: command-line utilities for genomic epidemiology research
title gofasta: command-line utilities for genomic epidemiology research
title_full gofasta: command-line utilities for genomic epidemiology research
title_fullStr gofasta: command-line utilities for genomic epidemiology research
title_full_unstemmed gofasta: command-line utilities for genomic epidemiology research
title_short gofasta: command-line utilities for genomic epidemiology research
title_sort gofasta: command-line utilities for genomic epidemiology research
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9364388/
https://www.ncbi.nlm.nih.gov/pubmed/35789376
http://dx.doi.org/10.1093/bioinformatics/btac424
work_keys_str_mv AT jacksonben gofastacommandlineutilitiesforgenomicepidemiologyresearch