Cargando…

MeFiT: merging and filtering tool for illumina paired-end reads for 16S rRNA amplicon sequencing

BACKGROUND: Recent advances in next-generation sequencing have revolutionized genomic research. 16S rRNA amplicon sequencing using paired-end sequencing on the MiSeq platform from Illumina, Inc., is being used to characterize the composition and dynamics of extremely complex/diverse microbial commun...

Descripción completa

Detalles Bibliográficos
Autores principales: Parikh, Hardik I., Koparde, Vishal N., Bradley, Steven P., Buck, Gregory A., Sheth, Nihar U.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5134250/
https://www.ncbi.nlm.nih.gov/pubmed/27905885
http://dx.doi.org/10.1186/s12859-016-1358-1
Descripción
Sumario:BACKGROUND: Recent advances in next-generation sequencing have revolutionized genomic research. 16S rRNA amplicon sequencing using paired-end sequencing on the MiSeq platform from Illumina, Inc., is being used to characterize the composition and dynamics of extremely complex/diverse microbial communities. For this analysis on the Illumina platform, merging and quality filtering of paired-end reads are essential first steps in data analysis to ensure the accuracy and reliability of downstream analysis. RESULTS: We have developed the Merging and Filtering Tool (MeFiT) to combine these pre-processing steps into one simple, intuitive pipeline. MeFiT invokes CASPER (context-aware scheme for paired-end reads) for merging paired-end reads and provides users the option to quality filter the reads using the traditional average Q-score metric or using a maximum expected error cut-off threshold. CONCLUSIONS: MeFiT provides an open-source solution that permits users to merge and filter paired end illumina reads. The tool has been implemented in python and the source-code is freely available at https://github.com/nisheth/MeFiT. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1358-1) contains supplementary material, which is available to authorized users.