Cargando…

TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data

MOTIVATION: Whole-genome sequencing (WGS) is increasingly used to aid the understanding of Mycobacterium tuberculosis (MTB) transmission. The epidemiological analysis of tuberculosis based on the WGS technique requires a diverse collection of bioinformatics tools. Effectively using these analysis to...

Descripción completa

Detalles Bibliográficos
Autores principales: Pan, Junhang, Li, Xiangchen, Zhang, Mingwu, Lu, Yewei, Zhu, Yelei, Wu, Kunyang, Wu, Yiwen, Wang, Weixin, Chen, Bin, Liu, Zhengwei, Wang, Xiaomeng, Gao, Junshun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825751/
https://www.ncbi.nlm.nih.gov/pubmed/36469333
http://dx.doi.org/10.1093/bioinformatics/btac785
Descripción
Sumario:MOTIVATION: Whole-genome sequencing (WGS) is increasingly used to aid the understanding of Mycobacterium tuberculosis (MTB) transmission. The epidemiological analysis of tuberculosis based on the WGS technique requires a diverse collection of bioinformatics tools. Effectively using these analysis tools in a scalable and reproducible way can be challenging, especially for non-experts. RESULTS: Here, we present TransFlow (Transmission Workflow), a user-friendly, fast, efficient and comprehensive WGS-based transmission analysis pipeline. TransFlow combines some state-of-the-art tools to take transmission analysis from raw sequencing data, through quality control, sequence alignment and variant calling, into downstream transmission clustering, transmission network reconstruction and transmission risk factor inference, together with summary statistics and data visualization in a summary report. TransFlow relies on Snakemake and Conda to resolve dependencies among consecutive processing steps and can be easily adapted to any computation environment. AVAILABILITY AND IMPLEMENTATION: TransFlow is free available at https://github.com/cvn001/transflow. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.