Cargando…
TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data
MOTIVATION: Whole-genome sequencing (WGS) is increasingly used to aid the understanding of Mycobacterium tuberculosis (MTB) transmission. The epidemiological analysis of tuberculosis based on the WGS technique requires a diverse collection of bioinformatics tools. Effectively using these analysis to...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825751/ https://www.ncbi.nlm.nih.gov/pubmed/36469333 http://dx.doi.org/10.1093/bioinformatics/btac785 |
_version_ | 1784866690182414336 |
---|---|
author | Pan, Junhang Li, Xiangchen Zhang, Mingwu Lu, Yewei Zhu, Yelei Wu, Kunyang Wu, Yiwen Wang, Weixin Chen, Bin Liu, Zhengwei Wang, Xiaomeng Gao, Junshun |
author_facet | Pan, Junhang Li, Xiangchen Zhang, Mingwu Lu, Yewei Zhu, Yelei Wu, Kunyang Wu, Yiwen Wang, Weixin Chen, Bin Liu, Zhengwei Wang, Xiaomeng Gao, Junshun |
author_sort | Pan, Junhang |
collection | PubMed |
description | MOTIVATION: Whole-genome sequencing (WGS) is increasingly used to aid the understanding of Mycobacterium tuberculosis (MTB) transmission. The epidemiological analysis of tuberculosis based on the WGS technique requires a diverse collection of bioinformatics tools. Effectively using these analysis tools in a scalable and reproducible way can be challenging, especially for non-experts. RESULTS: Here, we present TransFlow (Transmission Workflow), a user-friendly, fast, efficient and comprehensive WGS-based transmission analysis pipeline. TransFlow combines some state-of-the-art tools to take transmission analysis from raw sequencing data, through quality control, sequence alignment and variant calling, into downstream transmission clustering, transmission network reconstruction and transmission risk factor inference, together with summary statistics and data visualization in a summary report. TransFlow relies on Snakemake and Conda to resolve dependencies among consecutive processing steps and can be easily adapted to any computation environment. AVAILABILITY AND IMPLEMENTATION: TransFlow is free available at https://github.com/cvn001/transflow. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-9825751 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98257512023-01-10 TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data Pan, Junhang Li, Xiangchen Zhang, Mingwu Lu, Yewei Zhu, Yelei Wu, Kunyang Wu, Yiwen Wang, Weixin Chen, Bin Liu, Zhengwei Wang, Xiaomeng Gao, Junshun Bioinformatics Applications Note MOTIVATION: Whole-genome sequencing (WGS) is increasingly used to aid the understanding of Mycobacterium tuberculosis (MTB) transmission. The epidemiological analysis of tuberculosis based on the WGS technique requires a diverse collection of bioinformatics tools. Effectively using these analysis tools in a scalable and reproducible way can be challenging, especially for non-experts. RESULTS: Here, we present TransFlow (Transmission Workflow), a user-friendly, fast, efficient and comprehensive WGS-based transmission analysis pipeline. TransFlow combines some state-of-the-art tools to take transmission analysis from raw sequencing data, through quality control, sequence alignment and variant calling, into downstream transmission clustering, transmission network reconstruction and transmission risk factor inference, together with summary statistics and data visualization in a summary report. TransFlow relies on Snakemake and Conda to resolve dependencies among consecutive processing steps and can be easily adapted to any computation environment. AVAILABILITY AND IMPLEMENTATION: TransFlow is free available at https://github.com/cvn001/transflow. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-12-05 /pmc/articles/PMC9825751/ /pubmed/36469333 http://dx.doi.org/10.1093/bioinformatics/btac785 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Note Pan, Junhang Li, Xiangchen Zhang, Mingwu Lu, Yewei Zhu, Yelei Wu, Kunyang Wu, Yiwen Wang, Weixin Chen, Bin Liu, Zhengwei Wang, Xiaomeng Gao, Junshun TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data |
title | TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data |
title_full | TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data |
title_fullStr | TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data |
title_full_unstemmed | TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data |
title_short | TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data |
title_sort | transflow: a snakemake workflow for transmission analysis of mycobacterium tuberculosis whole-genome sequencing data |
topic | Applications Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825751/ https://www.ncbi.nlm.nih.gov/pubmed/36469333 http://dx.doi.org/10.1093/bioinformatics/btac785 |
work_keys_str_mv | AT panjunhang transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata AT lixiangchen transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata AT zhangmingwu transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata AT luyewei transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata AT zhuyelei transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata AT wukunyang transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata AT wuyiwen transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata AT wangweixin transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata AT chenbin transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata AT liuzhengwei transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata AT wangxiaomeng transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata AT gaojunshun transflowasnakemakeworkflowfortransmissionanalysisofmycobacteriumtuberculosiswholegenomesequencingdata |