Cargando…
gofasta: command-line utilities for genomic epidemiology research
SUMMARY: gofasta comprises a set of command-line utilities for handling alignments of short assembled genomes in a genomic epidemiology context. It was developed for processing large numbers of closely related SARS-CoV-2 viral genomes and should be useful with other densely sampled pathogen genomic...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9364388/ https://www.ncbi.nlm.nih.gov/pubmed/35789376 http://dx.doi.org/10.1093/bioinformatics/btac424 |
_version_ | 1784765136350740480 |
---|---|
author | Jackson, Ben |
author_facet | Jackson, Ben |
author_sort | Jackson, Ben |
collection | PubMed |
description | SUMMARY: gofasta comprises a set of command-line utilities for handling alignments of short assembled genomes in a genomic epidemiology context. It was developed for processing large numbers of closely related SARS-CoV-2 viral genomes and should be useful with other densely sampled pathogen genomic datasets. It provides functions to convert sam-format pairwise alignments between assembled genomes to fasta format; to annotate mutations in multiple sequence alignments, and to extract sets of sequences by genetic distance measures for use in outbreak investigations. AVAILABILITY AND IMPLEMENTATION: gofasta is an open-source project distributed under the MIT license. Binaries are available at https://github.com/virus-evolution/gofasta, from Bioconda, and through the Go programming language’s package management system. Source code and further documentation, including walkthroughs for common use cases, are available on the GitHub repository. |
format | Online Article Text |
id | pubmed-9364388 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-93643882022-08-11 gofasta: command-line utilities for genomic epidemiology research Jackson, Ben Bioinformatics Applications Notes SUMMARY: gofasta comprises a set of command-line utilities for handling alignments of short assembled genomes in a genomic epidemiology context. It was developed for processing large numbers of closely related SARS-CoV-2 viral genomes and should be useful with other densely sampled pathogen genomic datasets. It provides functions to convert sam-format pairwise alignments between assembled genomes to fasta format; to annotate mutations in multiple sequence alignments, and to extract sets of sequences by genetic distance measures for use in outbreak investigations. AVAILABILITY AND IMPLEMENTATION: gofasta is an open-source project distributed under the MIT license. Binaries are available at https://github.com/virus-evolution/gofasta, from Bioconda, and through the Go programming language’s package management system. Source code and further documentation, including walkthroughs for common use cases, are available on the GitHub repository. Oxford University Press 2022-07-05 /pmc/articles/PMC9364388/ /pubmed/35789376 http://dx.doi.org/10.1093/bioinformatics/btac424 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Notes Jackson, Ben gofasta: command-line utilities for genomic epidemiology research |
title | gofasta: command-line utilities for genomic epidemiology research |
title_full | gofasta: command-line utilities for genomic epidemiology research |
title_fullStr | gofasta: command-line utilities for genomic epidemiology research |
title_full_unstemmed | gofasta: command-line utilities for genomic epidemiology research |
title_short | gofasta: command-line utilities for genomic epidemiology research |
title_sort | gofasta: command-line utilities for genomic epidemiology research |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9364388/ https://www.ncbi.nlm.nih.gov/pubmed/35789376 http://dx.doi.org/10.1093/bioinformatics/btac424 |
work_keys_str_mv | AT jacksonben gofastacommandlineutilitiesforgenomicepidemiologyresearch |