Cargando…

FragGeneScanRs: faster gene prediction for short reads

BACKGROUND: FragGeneScan is currently the most accurate and popular tool for gene prediction in short and error-prone reads, but its execution speed is insufficient for use on larger data sets. The parallelization which should have addressed this is inefficient. Its alternative implementation FragGe...

Descripción completa

Detalles Bibliográficos
Autores principales: Van der Jeugt, Felix, Dawyndt, Peter, Mesuere, Bart
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9148508/
https://www.ncbi.nlm.nih.gov/pubmed/35643462
http://dx.doi.org/10.1186/s12859-022-04736-5
Descripción
Sumario:BACKGROUND: FragGeneScan is currently the most accurate and popular tool for gene prediction in short and error-prone reads, but its execution speed is insufficient for use on larger data sets. The parallelization which should have addressed this is inefficient. Its alternative implementation FragGeneScan+ is faster, but introduced a number of bugs related to memory management, race conditions and even output accuracy. RESULTS: This paper introduces FragGeneScanRs, a faster Rust implementation of the FragGeneScan gene prediction model. Its command line interface is backward compatible and adds extra features for more flexible usage. Its output is equivalent to the original FragGeneScan implementation. CONCLUSIONS: Compared to the current C implementation, shotgun metagenomic reads are processed up to 22 times faster using a single thread, with better scaling for multithreaded execution. The Rust code of FragGeneScanRs is freely available from GitHub under the GPL-3.0 license with instructions for installation, usage and other documentation (https://github.com/unipept/FragGeneScanRs). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04736-5.