Cargando…
Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence
In India, tuberculosis is an enormous public health problem. This study provides the first description of molecular diversity of the Mycobacterium tuberculosis complex (MTBC) from Sikkim, India. A total of 399 Acid Fast Bacilli sputum positive samples were cultured on Lőwenstein–Jensen media and gen...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8016865/ https://www.ncbi.nlm.nih.gov/pubmed/33795751 http://dx.doi.org/10.1038/s41598-021-86626-z |
_version_ | 1783673942566764544 |
---|---|
author | Devi, Kangjam Rekha Pradhan, Jagat Bhutia, Rinchenla Dadul, Peggy Sarkar, Atanu Gohain, Nitumoni Narain, Kanwar |
author_facet | Devi, Kangjam Rekha Pradhan, Jagat Bhutia, Rinchenla Dadul, Peggy Sarkar, Atanu Gohain, Nitumoni Narain, Kanwar |
author_sort | Devi, Kangjam Rekha |
collection | PubMed |
description | In India, tuberculosis is an enormous public health problem. This study provides the first description of molecular diversity of the Mycobacterium tuberculosis complex (MTBC) from Sikkim, India. A total of 399 Acid Fast Bacilli sputum positive samples were cultured on Lőwenstein–Jensen media and genetic characterisation was done by spoligotyping and 24-loci MIRU-VNTR typing. Spoligotyping revealed the occurrence of 58 different spoligotypes. Beijing spoligotype was the most dominant type constituting 62.41% of the total isolates and was associated with Multiple Drug Resistance. Minimum Spanning tree analysis of 249 Beijing strains based on 24-loci MIRU-VNTR analysis identified 12 clonal complexes (Single Locus Variants). The principal component analysis was used to visualise possible grouping of MTBC isolates from Sikkim belonging to major spoligotypes using 24-MIRU VNTR profiles. Artificial intelligence-based machine learning (ML) methods such as Random Forests (RF), Support Vector Machines (SVM) and Artificial Neural Networks (ANN) were used to predict dominant spoligotypes of MTBC using MIRU-VNTR data. K-fold cross-validation and validation using unseen testing data set revealed high accuracy of ANN, RF, and SVM for predicting Beijing, CAS1_Delhi, and T1 Spoligotypes (93–99%). However, prediction using the external new validation data set revealed that the RF model was more accurate than SVM and ANN. |
format | Online Article Text |
id | pubmed-8016865 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-80168652021-04-05 Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence Devi, Kangjam Rekha Pradhan, Jagat Bhutia, Rinchenla Dadul, Peggy Sarkar, Atanu Gohain, Nitumoni Narain, Kanwar Sci Rep Article In India, tuberculosis is an enormous public health problem. This study provides the first description of molecular diversity of the Mycobacterium tuberculosis complex (MTBC) from Sikkim, India. A total of 399 Acid Fast Bacilli sputum positive samples were cultured on Lőwenstein–Jensen media and genetic characterisation was done by spoligotyping and 24-loci MIRU-VNTR typing. Spoligotyping revealed the occurrence of 58 different spoligotypes. Beijing spoligotype was the most dominant type constituting 62.41% of the total isolates and was associated with Multiple Drug Resistance. Minimum Spanning tree analysis of 249 Beijing strains based on 24-loci MIRU-VNTR analysis identified 12 clonal complexes (Single Locus Variants). The principal component analysis was used to visualise possible grouping of MTBC isolates from Sikkim belonging to major spoligotypes using 24-MIRU VNTR profiles. Artificial intelligence-based machine learning (ML) methods such as Random Forests (RF), Support Vector Machines (SVM) and Artificial Neural Networks (ANN) were used to predict dominant spoligotypes of MTBC using MIRU-VNTR data. K-fold cross-validation and validation using unseen testing data set revealed high accuracy of ANN, RF, and SVM for predicting Beijing, CAS1_Delhi, and T1 Spoligotypes (93–99%). However, prediction using the external new validation data set revealed that the RF model was more accurate than SVM and ANN. Nature Publishing Group UK 2021-04-01 /pmc/articles/PMC8016865/ /pubmed/33795751 http://dx.doi.org/10.1038/s41598-021-86626-z Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Devi, Kangjam Rekha Pradhan, Jagat Bhutia, Rinchenla Dadul, Peggy Sarkar, Atanu Gohain, Nitumoni Narain, Kanwar Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence |
title | Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence |
title_full | Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence |
title_fullStr | Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence |
title_full_unstemmed | Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence |
title_short | Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence |
title_sort | molecular diversity of mycobacterium tuberculosis complex in sikkim, india and prediction of dominant spoligotypes using artificial intelligence |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8016865/ https://www.ncbi.nlm.nih.gov/pubmed/33795751 http://dx.doi.org/10.1038/s41598-021-86626-z |
work_keys_str_mv | AT devikangjamrekha moleculardiversityofmycobacteriumtuberculosiscomplexinsikkimindiaandpredictionofdominantspoligotypesusingartificialintelligence AT pradhanjagat moleculardiversityofmycobacteriumtuberculosiscomplexinsikkimindiaandpredictionofdominantspoligotypesusingartificialintelligence AT bhutiarinchenla moleculardiversityofmycobacteriumtuberculosiscomplexinsikkimindiaandpredictionofdominantspoligotypesusingartificialintelligence AT dadulpeggy moleculardiversityofmycobacteriumtuberculosiscomplexinsikkimindiaandpredictionofdominantspoligotypesusingartificialintelligence AT sarkaratanu moleculardiversityofmycobacteriumtuberculosiscomplexinsikkimindiaandpredictionofdominantspoligotypesusingartificialintelligence AT gohainnitumoni moleculardiversityofmycobacteriumtuberculosiscomplexinsikkimindiaandpredictionofdominantspoligotypesusingartificialintelligence AT narainkanwar moleculardiversityofmycobacteriumtuberculosiscomplexinsikkimindiaandpredictionofdominantspoligotypesusingartificialintelligence |