Cargando…

mStrain: strain-level identification of Yersinia pestis using metagenomic data

MOTIVATION: High-resolution target pathogen detection using metagenomic sequencing data represents a major challenge due to the low concentration of target pathogens in samples. We introduced mStrain, a novel Yesinia pestis strain/lineage-level identification tool that utilizes metagenomic data. mSt...

Descripción completa

Detalles Bibliográficos
Autores principales: Qian, Xiuwei, Wu, Yarong, Zuo, Xiujuan, Peng, Xin, Guo, Yan, Yang, Ruifu, Zhang, Xianglilan, Cui, Yujun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516513/
https://www.ncbi.nlm.nih.gov/pubmed/37745000
http://dx.doi.org/10.1093/bioadv/vbad115
Descripción
Sumario:MOTIVATION: High-resolution target pathogen detection using metagenomic sequencing data represents a major challenge due to the low concentration of target pathogens in samples. We introduced mStrain, a novel Yesinia pestis strain/lineage-level identification tool that utilizes metagenomic data. mStrain successfully identified Y. pestis at the strain/lineage level by extracting sufficient information regarding single-nucleotide polymorphisms (SNPs), which can therefore be an effective tool for identification and source tracking of Y. pestis based on metagenomic data during a plague outbreak. DEFINITION:   STRAIN-LEVEL IDENTIFICATION: Assigning the reads in the metagenomic sequencing data to an exactly known or most closely representative Y. pestis strain. LINEAGE-LEVEL IDENTIFICATION: Assigning the reads in the metagenomic sequencing data to a specific lineage on the phylogenetic tree. CANOSNPS: The unique and typical SNPs present in all representative strains. ANCESTOR/DERIVED STATE: An SNP is defined as the ancestor state when consistent with the allele of Yersinia pseudotuberculosis strain IP32953; otherwise, the SNP is defined as the derived state. AVAILABILITY AND IMPLEMENTATION: The code for running mStrain, the test dataset, and instructions for running the code can be found at the following GitHub repository: https://github.com/xwqian1123/mStrain.