Cargando…

TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data

MOTIVATION: Whole-genome sequencing (WGS) is increasingly used to aid the understanding of Mycobacterium tuberculosis (MTB) transmission. The epidemiological analysis of tuberculosis based on the WGS technique requires a diverse collection of bioinformatics tools. Effectively using these analysis to...

Descripción completa

Detalles Bibliográficos
Autores principales: Pan, Junhang, Li, Xiangchen, Zhang, Mingwu, Lu, Yewei, Zhu, Yelei, Wu, Kunyang, Wu, Yiwen, Wang, Weixin, Chen, Bin, Liu, Zhengwei, Wang, Xiaomeng, Gao, Junshun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825751/
https://www.ncbi.nlm.nih.gov/pubmed/36469333
http://dx.doi.org/10.1093/bioinformatics/btac785
_version_ 1784866690182414336
author Pan, Junhang
Li, Xiangchen
Zhang, Mingwu
Lu, Yewei
Zhu, Yelei
Wu, Kunyang
Wu, Yiwen
Wang, Weixin
Chen, Bin
Liu, Zhengwei
Wang, Xiaomeng
Gao, Junshun
author_facet Pan, Junhang
Li, Xiangchen
Zhang, Mingwu
Lu, Yewei
Zhu, Yelei
Wu, Kunyang
Wu, Yiwen
Wang, Weixin
Chen, Bin
Liu, Zhengwei
Wang, Xiaomeng
Gao, Junshun
author_sort Pan, Junhang
collection PubMed
description MOTIVATION: Whole-genome sequencing (WGS) is increasingly used to aid the understanding of Mycobacterium tuberculosis (MTB) transmission. The epidemiological analysis of tuberculosis based on the WGS technique requires a diverse collection of bioinformatics tools. Effectively using these analysis tools in a scalable and reproducible way can be challenging, especially for non-experts. RESULTS: Here, we present TransFlow (Transmission Workflow), a user-friendly, fast, efficient and comprehensive WGS-based transmission analysis pipeline. TransFlow combines some state-of-the-art tools to take transmission analysis from raw sequencing data, through quality control, sequence alignment and variant calling, into downstream transmission clustering, transmission network reconstruction and transmission risk factor inference, together with summary statistics and data visualization in a summary report. TransFlow relies on Snakemake and Conda to resolve dependencies among consecutive processing steps and can be easily adapted to any computation environment. AVAILABILITY AND IMPLEMENTATION: TransFlow is free available at https://github.com/cvn001/transflow. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9825751
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98257512023-01-10 TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data Pan, Junhang Li, Xiangchen Zhang, Mingwu Lu, Yewei Zhu, Yelei Wu, Kunyang Wu, Yiwen Wang, Weixin Chen, Bin Liu, Zhengwei Wang, Xiaomeng Gao, Junshun Bioinformatics Applications Note MOTIVATION: Whole-genome sequencing (WGS) is increasingly used to aid the understanding of Mycobacterium tuberculosis (MTB) transmission. The epidemiological analysis of tuberculosis based on the WGS technique requires a diverse collection of bioinformatics tools. Effectively using these analysis tools in a scalable and reproducible way can be challenging, especially for non-experts. RESULTS: Here, we present TransFlow (Transmission Workflow), a user-friendly, fast, efficient and comprehensive WGS-based transmission analysis pipeline. TransFlow combines some state-of-the-art tools to take transmission analysis from raw sequencing data, through quality control, sequence alignment and variant calling, into downstream transmission clustering, transmission network reconstruction and transmission risk factor inference, together with summary statistics and data visualization in a summary report. TransFlow relies on Snakemake and Conda to resolve dependencies among consecutive processing steps and can be easily adapted to any computation environment. AVAILABILITY AND IMPLEMENTATION: TransFlow is free available at https://github.com/cvn001/transflow. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-12-05 /pmc/articles/PMC9825751/ /pubmed/36469333 http://dx.doi.org/10.1093/bioinformatics/btac785 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Pan, Junhang
Li, Xiangchen
Zhang, Mingwu
Lu, Yewei
Zhu, Yelei
Wu, Kunyang
Wu, Yiwen
Wang, Weixin
Chen, Bin
Liu, Zhengwei
Wang, Xiaomeng
Gao, Junshun
TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data
title TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data
title_full TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data
title_fullStr TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data
title_full_unstemmed TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data
title_short TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data
title_sort transflow: a snakemake workflow for transmission analysis of mycobacterium tuberculosis whole-genome sequencing data
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825751/
https://www.ncbi.nlm.nih.gov/pubmed/36469333
http://dx.doi.org/10.1093/bioinformatics/btac785
work_keys_str_mv AT panjunhang transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata
AT lixiangchen transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata
AT zhangmingwu transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata
AT luyewei transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata
AT zhuyelei transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata
AT wukunyang transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata
AT wuyiwen transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata
AT wangweixin transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata
AT chenbin transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata
AT liuzhengwei transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata
AT wangxiaomeng transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata
AT gaojunshun transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata