Cargando…
MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis
Whole-genome sequencing (WGS) provides a comprehensive tool to analyze the bacterial genomes for genotype–phenotype correlations, diversity of single-nucleotide variant (SNV), and their evolution and transmission. Several online pipelines and standalone tools are available for WGS analysis of Mycoba...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580932/ https://www.ncbi.nlm.nih.gov/pubmed/36303799 http://dx.doi.org/10.3389/fbinf.2021.805338 |
_version_ | 1784812503660756992 |
---|---|
author | Swargam, Sandeep Kumari, Indu Kumar, Amit Pradhan, Dibyabhaba Alam, Anwar Singh, Harpreet Jain, Anuja Devi, Kangjam Rekha Trivedi, Vishal Sarma, Jogesh Hanif, Mahmud Narain, Kanwar Ehtesham, Nasreen Zafar Hasnain, Seyed Ehtesham Ahmad, Shandar |
author_facet | Swargam, Sandeep Kumari, Indu Kumar, Amit Pradhan, Dibyabhaba Alam, Anwar Singh, Harpreet Jain, Anuja Devi, Kangjam Rekha Trivedi, Vishal Sarma, Jogesh Hanif, Mahmud Narain, Kanwar Ehtesham, Nasreen Zafar Hasnain, Seyed Ehtesham Ahmad, Shandar |
author_sort | Swargam, Sandeep |
collection | PubMed |
description | Whole-genome sequencing (WGS) provides a comprehensive tool to analyze the bacterial genomes for genotype–phenotype correlations, diversity of single-nucleotide variant (SNV), and their evolution and transmission. Several online pipelines and standalone tools are available for WGS analysis of Mycobacterium tuberculosis (Mtb) complex (MTBC). While they facilitate the processing of WGS data with minimal user expertise, they are either too general, providing little insights into bacterium-specific issues such as gene variations, INDEL/synonymous/PE-PPE (IDP family), and drug resistance from sample data, or are limited to specific objectives, such as drug resistance. It is understood that drug resistance and lineage-specific issues require an elaborate prioritization of identified variants to choose the best target for subsequent therapeutic intervention. Mycobacterium variant pipeline (MycoVarP) addresses these specific issues with a flexible battery of user-defined and default filters. It provides an end-to-end solution for WGS analysis of Mtb variants from the raw reads and performs two quality checks, viz, before trimming and after alignments of reads to the reference genome. MycoVarP maps the annotated variants to the drug-susceptible (DS) database and removes the false-positive variants, provides lineage identification, and predicts potential drug resistance. We have re-analyzed the WGS data reported by Advani et al. (2019) using MycoVarP and identified some additional variants not reported so far. We conclude that MycoVarP will help in identifying nonsynonymous, true-positive, drug resistance–associated variants more effectively and comprehensively, including those within the IDP of the PE-PPE/PGRS family, than possible from the currently available pipelines. |
format | Online Article Text |
id | pubmed-9580932 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95809322022-10-26 MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis Swargam, Sandeep Kumari, Indu Kumar, Amit Pradhan, Dibyabhaba Alam, Anwar Singh, Harpreet Jain, Anuja Devi, Kangjam Rekha Trivedi, Vishal Sarma, Jogesh Hanif, Mahmud Narain, Kanwar Ehtesham, Nasreen Zafar Hasnain, Seyed Ehtesham Ahmad, Shandar Front Bioinform Bioinformatics Whole-genome sequencing (WGS) provides a comprehensive tool to analyze the bacterial genomes for genotype–phenotype correlations, diversity of single-nucleotide variant (SNV), and their evolution and transmission. Several online pipelines and standalone tools are available for WGS analysis of Mycobacterium tuberculosis (Mtb) complex (MTBC). While they facilitate the processing of WGS data with minimal user expertise, they are either too general, providing little insights into bacterium-specific issues such as gene variations, INDEL/synonymous/PE-PPE (IDP family), and drug resistance from sample data, or are limited to specific objectives, such as drug resistance. It is understood that drug resistance and lineage-specific issues require an elaborate prioritization of identified variants to choose the best target for subsequent therapeutic intervention. Mycobacterium variant pipeline (MycoVarP) addresses these specific issues with a flexible battery of user-defined and default filters. It provides an end-to-end solution for WGS analysis of Mtb variants from the raw reads and performs two quality checks, viz, before trimming and after alignments of reads to the reference genome. MycoVarP maps the annotated variants to the drug-susceptible (DS) database and removes the false-positive variants, provides lineage identification, and predicts potential drug resistance. We have re-analyzed the WGS data reported by Advani et al. (2019) using MycoVarP and identified some additional variants not reported so far. We conclude that MycoVarP will help in identifying nonsynonymous, true-positive, drug resistance–associated variants more effectively and comprehensively, including those within the IDP of the PE-PPE/PGRS family, than possible from the currently available pipelines. Frontiers Media S.A. 2022-06-03 /pmc/articles/PMC9580932/ /pubmed/36303799 http://dx.doi.org/10.3389/fbinf.2021.805338 Text en Copyright © 2022 Swargam, Kumari, Kumar, Pradhan, Alam, Singh, Jain, Devi, Trivedi, Sarma, Hanif, Narain, Ehtesham, Hasnain and Ahmad. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Bioinformatics Swargam, Sandeep Kumari, Indu Kumar, Amit Pradhan, Dibyabhaba Alam, Anwar Singh, Harpreet Jain, Anuja Devi, Kangjam Rekha Trivedi, Vishal Sarma, Jogesh Hanif, Mahmud Narain, Kanwar Ehtesham, Nasreen Zafar Hasnain, Seyed Ehtesham Ahmad, Shandar MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis |
title | MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis |
title_full | MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis |
title_fullStr | MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis |
title_full_unstemmed | MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis |
title_short | MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis |
title_sort | mycovarp: mycobacterium variant and drug resistance prediction pipeline for whole-genome sequence data analysis |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580932/ https://www.ncbi.nlm.nih.gov/pubmed/36303799 http://dx.doi.org/10.3389/fbinf.2021.805338 |
work_keys_str_mv | AT swargamsandeep mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis AT kumariindu mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis AT kumaramit mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis AT pradhandibyabhaba mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis AT alamanwar mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis AT singhharpreet mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis AT jainanuja mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis AT devikangjamrekha mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis AT trivedivishal mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis AT sarmajogesh mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis AT hanifmahmud mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis AT narainkanwar mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis AT ehteshamnasreenzafar mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis AT hasnainseyedehtesham mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis AT ahmadshandar mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis |