Cargando…

FAST: FAST Analysis of Sequences Toolbox

FAST (FAST Analysis of Sequences Toolbox) provides simple, powerful open source command-line tools to filter, transform, annotate and analyze biological sequence data. Modeled after the GNU (GNU's Not Unix) Textutils such as grep, cut, and tr, FAST tools such as fasgrep, fascut, and fastr make...

Descripción completa

Detalles Bibliográficos
Autores principales: Lawrence, Travis J., Kauffman, Kyle T., Amrine, Katherine C. H., Carper, Dana L., Lee, Raymond S., Becich, Peter J., Canales, Claudia J., Ardell, David H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4437040/
https://www.ncbi.nlm.nih.gov/pubmed/26042145
http://dx.doi.org/10.3389/fgene.2015.00172
_version_ 1782372168047263744
author Lawrence, Travis J.
Kauffman, Kyle T.
Amrine, Katherine C. H.
Carper, Dana L.
Lee, Raymond S.
Becich, Peter J.
Canales, Claudia J.
Ardell, David H.
author_facet Lawrence, Travis J.
Kauffman, Kyle T.
Amrine, Katherine C. H.
Carper, Dana L.
Lee, Raymond S.
Becich, Peter J.
Canales, Claudia J.
Ardell, David H.
author_sort Lawrence, Travis J.
collection PubMed
description FAST (FAST Analysis of Sequences Toolbox) provides simple, powerful open source command-line tools to filter, transform, annotate and analyze biological sequence data. Modeled after the GNU (GNU's Not Unix) Textutils such as grep, cut, and tr, FAST tools such as fasgrep, fascut, and fastr make it easy to rapidly prototype expressive bioinformatic workflows in a compact and generic command vocabulary. Compact combinatorial encoding of data workflows with FAST commands can simplify the documentation and reproducibility of bioinformatic protocols, supporting better transparency in biological data science. Interface self-consistency and conformity with conventions of GNU, Matlab, Perl, BioPerl, R, and GenBank help make FAST easy and rewarding to learn. FAST automates numerical, taxonomic, and text-based sorting, selection and transformation of sequence records and alignment sites based on content, index ranges, descriptive tags, annotated features, and in-line calculated analytics, including composition and codon usage. Automated content- and feature-based extraction of sites and support for molecular population genetic statistics make FAST useful for molecular evolutionary analysis. FAST is portable, easy to install and secure thanks to the relative maturity of its Perl and BioPerl foundations, with stable releases posted to CPAN. Development as well as a publicly accessible Cookbook and Wiki are available on the FAST GitHub repository at https://github.com/tlawrence3/FAST. The default data exchange format in FAST is Multi-FastA (specifically, a restriction of BioPerl FastA format). Sanger and Illumina 1.8+ FastQ formatted files are also supported. FAST makes it easier for non-programmer biologists to interactively investigate and control biological data at the speed of thought.
format Online
Article
Text
id pubmed-4437040
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-44370402015-06-03 FAST: FAST Analysis of Sequences Toolbox Lawrence, Travis J. Kauffman, Kyle T. Amrine, Katherine C. H. Carper, Dana L. Lee, Raymond S. Becich, Peter J. Canales, Claudia J. Ardell, David H. Front Genet Genetics FAST (FAST Analysis of Sequences Toolbox) provides simple, powerful open source command-line tools to filter, transform, annotate and analyze biological sequence data. Modeled after the GNU (GNU's Not Unix) Textutils such as grep, cut, and tr, FAST tools such as fasgrep, fascut, and fastr make it easy to rapidly prototype expressive bioinformatic workflows in a compact and generic command vocabulary. Compact combinatorial encoding of data workflows with FAST commands can simplify the documentation and reproducibility of bioinformatic protocols, supporting better transparency in biological data science. Interface self-consistency and conformity with conventions of GNU, Matlab, Perl, BioPerl, R, and GenBank help make FAST easy and rewarding to learn. FAST automates numerical, taxonomic, and text-based sorting, selection and transformation of sequence records and alignment sites based on content, index ranges, descriptive tags, annotated features, and in-line calculated analytics, including composition and codon usage. Automated content- and feature-based extraction of sites and support for molecular population genetic statistics make FAST useful for molecular evolutionary analysis. FAST is portable, easy to install and secure thanks to the relative maturity of its Perl and BioPerl foundations, with stable releases posted to CPAN. Development as well as a publicly accessible Cookbook and Wiki are available on the FAST GitHub repository at https://github.com/tlawrence3/FAST. The default data exchange format in FAST is Multi-FastA (specifically, a restriction of BioPerl FastA format). Sanger and Illumina 1.8+ FastQ formatted files are also supported. FAST makes it easier for non-programmer biologists to interactively investigate and control biological data at the speed of thought. Frontiers Media S.A. 2015-05-19 /pmc/articles/PMC4437040/ /pubmed/26042145 http://dx.doi.org/10.3389/fgene.2015.00172 Text en Copyright © 2015 Lawrence, Kauffman, Amrine, Carper, Lee, Becich, Canales and Ardell. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Lawrence, Travis J.
Kauffman, Kyle T.
Amrine, Katherine C. H.
Carper, Dana L.
Lee, Raymond S.
Becich, Peter J.
Canales, Claudia J.
Ardell, David H.
FAST: FAST Analysis of Sequences Toolbox
title FAST: FAST Analysis of Sequences Toolbox
title_full FAST: FAST Analysis of Sequences Toolbox
title_fullStr FAST: FAST Analysis of Sequences Toolbox
title_full_unstemmed FAST: FAST Analysis of Sequences Toolbox
title_short FAST: FAST Analysis of Sequences Toolbox
title_sort fast: fast analysis of sequences toolbox
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4437040/
https://www.ncbi.nlm.nih.gov/pubmed/26042145
http://dx.doi.org/10.3389/fgene.2015.00172
work_keys_str_mv AT lawrencetravisj fastfastanalysisofsequencestoolbox
AT kauffmankylet fastfastanalysisofsequencestoolbox
AT amrinekatherinech fastfastanalysisofsequencestoolbox
AT carperdanal fastfastanalysisofsequencestoolbox
AT leeraymonds fastfastanalysisofsequencestoolbox
AT becichpeterj fastfastanalysisofsequencestoolbox
AT canalesclaudiaj fastfastanalysisofsequencestoolbox
AT ardelldavidh fastfastanalysisofsequencestoolbox