Cargando…

MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis

Whole-genome sequencing (WGS) provides a comprehensive tool to analyze the bacterial genomes for genotype–phenotype correlations, diversity of single-nucleotide variant (SNV), and their evolution and transmission. Several online pipelines and standalone tools are available for WGS analysis of Mycoba...

Descripción completa

Detalles Bibliográficos
Autores principales: Swargam, Sandeep, Kumari, Indu, Kumar, Amit, Pradhan, Dibyabhaba, Alam, Anwar, Singh, Harpreet, Jain, Anuja, Devi, Kangjam Rekha, Trivedi, Vishal, Sarma, Jogesh, Hanif, Mahmud, Narain, Kanwar, Ehtesham, Nasreen Zafar, Hasnain, Seyed Ehtesham, Ahmad, Shandar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580932/
https://www.ncbi.nlm.nih.gov/pubmed/36303799
http://dx.doi.org/10.3389/fbinf.2021.805338
_version_ 1784812503660756992
author Swargam, Sandeep
Kumari, Indu
Kumar, Amit
Pradhan, Dibyabhaba
Alam, Anwar
Singh, Harpreet
Jain, Anuja
Devi, Kangjam Rekha
Trivedi, Vishal
Sarma, Jogesh
Hanif, Mahmud
Narain, Kanwar
Ehtesham, Nasreen Zafar
Hasnain, Seyed Ehtesham
Ahmad, Shandar
author_facet Swargam, Sandeep
Kumari, Indu
Kumar, Amit
Pradhan, Dibyabhaba
Alam, Anwar
Singh, Harpreet
Jain, Anuja
Devi, Kangjam Rekha
Trivedi, Vishal
Sarma, Jogesh
Hanif, Mahmud
Narain, Kanwar
Ehtesham, Nasreen Zafar
Hasnain, Seyed Ehtesham
Ahmad, Shandar
author_sort Swargam, Sandeep
collection PubMed
description Whole-genome sequencing (WGS) provides a comprehensive tool to analyze the bacterial genomes for genotype–phenotype correlations, diversity of single-nucleotide variant (SNV), and their evolution and transmission. Several online pipelines and standalone tools are available for WGS analysis of Mycobacterium tuberculosis (Mtb) complex (MTBC). While they facilitate the processing of WGS data with minimal user expertise, they are either too general, providing little insights into bacterium-specific issues such as gene variations, INDEL/synonymous/PE-PPE (IDP family), and drug resistance from sample data, or are limited to specific objectives, such as drug resistance. It is understood that drug resistance and lineage-specific issues require an elaborate prioritization of identified variants to choose the best target for subsequent therapeutic intervention. Mycobacterium variant pipeline (MycoVarP) addresses these specific issues with a flexible battery of user-defined and default filters. It provides an end-to-end solution for WGS analysis of Mtb variants from the raw reads and performs two quality checks, viz, before trimming and after alignments of reads to the reference genome. MycoVarP maps the annotated variants to the drug-susceptible (DS) database and removes the false-positive variants, provides lineage identification, and predicts potential drug resistance. We have re-analyzed the WGS data reported by Advani et al. (2019) using MycoVarP and identified some additional variants not reported so far. We conclude that MycoVarP will help in identifying nonsynonymous, true-positive, drug resistance–associated variants more effectively and comprehensively, including those within the IDP of the PE-PPE/PGRS family, than possible from the currently available pipelines.
format Online
Article
Text
id pubmed-9580932
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95809322022-10-26 MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis Swargam, Sandeep Kumari, Indu Kumar, Amit Pradhan, Dibyabhaba Alam, Anwar Singh, Harpreet Jain, Anuja Devi, Kangjam Rekha Trivedi, Vishal Sarma, Jogesh Hanif, Mahmud Narain, Kanwar Ehtesham, Nasreen Zafar Hasnain, Seyed Ehtesham Ahmad, Shandar Front Bioinform Bioinformatics Whole-genome sequencing (WGS) provides a comprehensive tool to analyze the bacterial genomes for genotype–phenotype correlations, diversity of single-nucleotide variant (SNV), and their evolution and transmission. Several online pipelines and standalone tools are available for WGS analysis of Mycobacterium tuberculosis (Mtb) complex (MTBC). While they facilitate the processing of WGS data with minimal user expertise, they are either too general, providing little insights into bacterium-specific issues such as gene variations, INDEL/synonymous/PE-PPE (IDP family), and drug resistance from sample data, or are limited to specific objectives, such as drug resistance. It is understood that drug resistance and lineage-specific issues require an elaborate prioritization of identified variants to choose the best target for subsequent therapeutic intervention. Mycobacterium variant pipeline (MycoVarP) addresses these specific issues with a flexible battery of user-defined and default filters. It provides an end-to-end solution for WGS analysis of Mtb variants from the raw reads and performs two quality checks, viz, before trimming and after alignments of reads to the reference genome. MycoVarP maps the annotated variants to the drug-susceptible (DS) database and removes the false-positive variants, provides lineage identification, and predicts potential drug resistance. We have re-analyzed the WGS data reported by Advani et al. (2019) using MycoVarP and identified some additional variants not reported so far. We conclude that MycoVarP will help in identifying nonsynonymous, true-positive, drug resistance–associated variants more effectively and comprehensively, including those within the IDP of the PE-PPE/PGRS family, than possible from the currently available pipelines. Frontiers Media S.A. 2022-06-03 /pmc/articles/PMC9580932/ /pubmed/36303799 http://dx.doi.org/10.3389/fbinf.2021.805338 Text en Copyright © 2022 Swargam, Kumari, Kumar, Pradhan, Alam, Singh, Jain, Devi, Trivedi, Sarma, Hanif, Narain, Ehtesham, Hasnain and Ahmad. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioinformatics
Swargam, Sandeep
Kumari, Indu
Kumar, Amit
Pradhan, Dibyabhaba
Alam, Anwar
Singh, Harpreet
Jain, Anuja
Devi, Kangjam Rekha
Trivedi, Vishal
Sarma, Jogesh
Hanif, Mahmud
Narain, Kanwar
Ehtesham, Nasreen Zafar
Hasnain, Seyed Ehtesham
Ahmad, Shandar
MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis
title MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis
title_full MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis
title_fullStr MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis
title_full_unstemmed MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis
title_short MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis
title_sort mycovarp: mycobacterium variant and drug resistance prediction pipeline for whole-genome sequence data analysis
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580932/
https://www.ncbi.nlm.nih.gov/pubmed/36303799
http://dx.doi.org/10.3389/fbinf.2021.805338
work_keys_str_mv AT swargamsandeep mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis
AT kumariindu mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis
AT kumaramit mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis
AT pradhandibyabhaba mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis
AT alamanwar mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis
AT singhharpreet mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis
AT jainanuja mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis
AT devikangjamrekha mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis
AT trivedivishal mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis
AT sarmajogesh mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis
AT hanifmahmud mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis
AT narainkanwar mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis
AT ehteshamnasreenzafar mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis
AT hasnainseyedehtesham mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis
AT ahmadshandar mycovarpmycobacteriumvariantanddrugresistancepredictionpipelineforwholegenomesequencedataanalysis