Cargando…
Understanding the Genetic Diversity of Mycobacterium africanum Using Phylogenetics and Population Genomics Approaches
A total of two lineages of Mycobacterium tuberculosis var. africanum (Maf), L5 and L6, which are members of the Mycobacterium tuberculosis complex (MTBC), are responsible for causing tuberculosis in West Africa. Regions of difference (RDs) are usually used for delineation of MTBC. With increased dat...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9043288/ https://www.ncbi.nlm.nih.gov/pubmed/35495132 http://dx.doi.org/10.3389/fgene.2022.800083 |
_version_ | 1784694843166949376 |
---|---|
author | Balamurugan, Muthukumar Banerjee, Ruma Kasibhatla, Sunitha Manjari Achalere, Archana Joshi, Rajendra |
author_facet | Balamurugan, Muthukumar Banerjee, Ruma Kasibhatla, Sunitha Manjari Achalere, Archana Joshi, Rajendra |
author_sort | Balamurugan, Muthukumar |
collection | PubMed |
description | A total of two lineages of Mycobacterium tuberculosis var. africanum (Maf), L5 and L6, which are members of the Mycobacterium tuberculosis complex (MTBC), are responsible for causing tuberculosis in West Africa. Regions of difference (RDs) are usually used for delineation of MTBC. With increased data availability, single nucleotide polymorphisms (SNPs) promise to provide better resolution. Publicly available 380 Maf samples were analyzed for identification of “core-cluster-specific-SNPs,” while additional 270 samples were used for validation. RD-based methods were used for lineage-assignment, wherein 31 samples remained unidentified. The genetic diversity of Maf was estimated based on genome-wide SNPs using phylogeny and population genomics approaches. Lineage-based clustering (L5 and L6) was observed in the whole genome phylogeny with distinct sub-clusters. Population stratification using both model-based and de novo approaches supported the same observations. L6 was further delineated into three sub-lineages (L6.1–L6.3), whereas L5 was grouped as L5.1 and L5.2 based on the occurrence of RD711. L5.1 and L5.2 were further divided into two (L5.1.1 and L5.1.2) and four (L5.2.1–L5.2.4) sub-clusters, respectively. Unassigned samples could be assigned to definite lineages/sub-lineages based on clustering observed in phylogeny along with high-confidence posterior membership scores obtained during population stratification. Based on the (sub)-clusters delineated, “core-cluster-specific-SNPs” were derived. Synonymous SNPs (137 in L5 and 128 in L6) were identified as biomarkers and used for validation. Few of the cluster-specific missense variants in L5 and L6 belong to the central carbohydrate metabolism pathway which include His6Tyr (Rv0946c), Glu255Ala (Rv1131), Ala309Gly (Rv2454c), Val425Ala and Ser112Ala (Rv1127c), Gly198Ala (Rv3293) and Ile137Val (Rv0363c), Thr421Ala (Rv0896), Arg442His (Rv1248c), Thr218Ile (Rv1122), and Ser381Leu (Rv1449c), hinting at the differential growth attenuation. Genes harboring multiple (sub)-lineage-specific “core-cluster” SNPs such as Lys117Asn, Val447Met, and Ala455Val (Rv0066c; icd2) present across L6, L6.1, and L5, respectively, hinting at the association of these SNPs with selective advantage or host-adaptation. Cluster-specific SNPs serve as additional markers along with RD-regions for Maf delineation. The identified SNPs have the potential to provide insights into the genotype–phenotype correlation and clues for endemicity of Maf in the African population. |
format | Online Article Text |
id | pubmed-9043288 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-90432882022-04-28 Understanding the Genetic Diversity of Mycobacterium africanum Using Phylogenetics and Population Genomics Approaches Balamurugan, Muthukumar Banerjee, Ruma Kasibhatla, Sunitha Manjari Achalere, Archana Joshi, Rajendra Front Genet Genetics A total of two lineages of Mycobacterium tuberculosis var. africanum (Maf), L5 and L6, which are members of the Mycobacterium tuberculosis complex (MTBC), are responsible for causing tuberculosis in West Africa. Regions of difference (RDs) are usually used for delineation of MTBC. With increased data availability, single nucleotide polymorphisms (SNPs) promise to provide better resolution. Publicly available 380 Maf samples were analyzed for identification of “core-cluster-specific-SNPs,” while additional 270 samples were used for validation. RD-based methods were used for lineage-assignment, wherein 31 samples remained unidentified. The genetic diversity of Maf was estimated based on genome-wide SNPs using phylogeny and population genomics approaches. Lineage-based clustering (L5 and L6) was observed in the whole genome phylogeny with distinct sub-clusters. Population stratification using both model-based and de novo approaches supported the same observations. L6 was further delineated into three sub-lineages (L6.1–L6.3), whereas L5 was grouped as L5.1 and L5.2 based on the occurrence of RD711. L5.1 and L5.2 were further divided into two (L5.1.1 and L5.1.2) and four (L5.2.1–L5.2.4) sub-clusters, respectively. Unassigned samples could be assigned to definite lineages/sub-lineages based on clustering observed in phylogeny along with high-confidence posterior membership scores obtained during population stratification. Based on the (sub)-clusters delineated, “core-cluster-specific-SNPs” were derived. Synonymous SNPs (137 in L5 and 128 in L6) were identified as biomarkers and used for validation. Few of the cluster-specific missense variants in L5 and L6 belong to the central carbohydrate metabolism pathway which include His6Tyr (Rv0946c), Glu255Ala (Rv1131), Ala309Gly (Rv2454c), Val425Ala and Ser112Ala (Rv1127c), Gly198Ala (Rv3293) and Ile137Val (Rv0363c), Thr421Ala (Rv0896), Arg442His (Rv1248c), Thr218Ile (Rv1122), and Ser381Leu (Rv1449c), hinting at the differential growth attenuation. Genes harboring multiple (sub)-lineage-specific “core-cluster” SNPs such as Lys117Asn, Val447Met, and Ala455Val (Rv0066c; icd2) present across L6, L6.1, and L5, respectively, hinting at the association of these SNPs with selective advantage or host-adaptation. Cluster-specific SNPs serve as additional markers along with RD-regions for Maf delineation. The identified SNPs have the potential to provide insights into the genotype–phenotype correlation and clues for endemicity of Maf in the African population. Frontiers Media S.A. 2022-04-13 /pmc/articles/PMC9043288/ /pubmed/35495132 http://dx.doi.org/10.3389/fgene.2022.800083 Text en Copyright © 2022 Balamurugan, Banerjee, Kasibhatla, Achalere and Joshi. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Balamurugan, Muthukumar Banerjee, Ruma Kasibhatla, Sunitha Manjari Achalere, Archana Joshi, Rajendra Understanding the Genetic Diversity of Mycobacterium africanum Using Phylogenetics and Population Genomics Approaches |
title | Understanding the Genetic Diversity of Mycobacterium africanum Using Phylogenetics and Population Genomics Approaches |
title_full | Understanding the Genetic Diversity of Mycobacterium africanum Using Phylogenetics and Population Genomics Approaches |
title_fullStr | Understanding the Genetic Diversity of Mycobacterium africanum Using Phylogenetics and Population Genomics Approaches |
title_full_unstemmed | Understanding the Genetic Diversity of Mycobacterium africanum Using Phylogenetics and Population Genomics Approaches |
title_short | Understanding the Genetic Diversity of Mycobacterium africanum Using Phylogenetics and Population Genomics Approaches |
title_sort | understanding the genetic diversity of mycobacterium africanum using phylogenetics and population genomics approaches |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9043288/ https://www.ncbi.nlm.nih.gov/pubmed/35495132 http://dx.doi.org/10.3389/fgene.2022.800083 |
work_keys_str_mv | AT balamuruganmuthukumar understandingthegeneticdiversityofmycobacteriumafricanumusingphylogeneticsandpopulationgenomicsapproaches AT banerjeeruma understandingthegeneticdiversityofmycobacteriumafricanumusingphylogeneticsandpopulationgenomicsapproaches AT kasibhatlasunithamanjari understandingthegeneticdiversityofmycobacteriumafricanumusingphylogeneticsandpopulationgenomicsapproaches AT achalerearchana understandingthegeneticdiversityofmycobacteriumafricanumusingphylogeneticsandpopulationgenomicsapproaches AT joshirajendra understandingthegeneticdiversityofmycobacteriumafricanumusingphylogeneticsandpopulationgenomicsapproaches |