Cargando…

Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence

In India, tuberculosis is an enormous public health problem. This study provides the first description of molecular diversity of the Mycobacterium tuberculosis complex (MTBC) from Sikkim, India. A total of 399 Acid Fast Bacilli sputum positive samples were cultured on Lőwenstein–Jensen media and gen...

Descripción completa

Detalles Bibliográficos
Autores principales: Devi, Kangjam Rekha, Pradhan, Jagat, Bhutia, Rinchenla, Dadul, Peggy, Sarkar, Atanu, Gohain, Nitumoni, Narain, Kanwar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8016865/
https://www.ncbi.nlm.nih.gov/pubmed/33795751
http://dx.doi.org/10.1038/s41598-021-86626-z
_version_ 1783673942566764544
author Devi, Kangjam Rekha
Pradhan, Jagat
Bhutia, Rinchenla
Dadul, Peggy
Sarkar, Atanu
Gohain, Nitumoni
Narain, Kanwar
author_facet Devi, Kangjam Rekha
Pradhan, Jagat
Bhutia, Rinchenla
Dadul, Peggy
Sarkar, Atanu
Gohain, Nitumoni
Narain, Kanwar
author_sort Devi, Kangjam Rekha
collection PubMed
description In India, tuberculosis is an enormous public health problem. This study provides the first description of molecular diversity of the Mycobacterium tuberculosis complex (MTBC) from Sikkim, India. A total of 399 Acid Fast Bacilli sputum positive samples were cultured on Lőwenstein–Jensen media and genetic characterisation was done by spoligotyping and 24-loci MIRU-VNTR typing. Spoligotyping revealed the occurrence of 58 different spoligotypes. Beijing spoligotype was the most dominant type constituting 62.41% of the total isolates and was associated with Multiple Drug Resistance. Minimum Spanning tree analysis of 249 Beijing strains based on 24-loci MIRU-VNTR analysis identified 12 clonal complexes (Single Locus Variants). The principal component analysis was used to visualise possible grouping of MTBC isolates from Sikkim belonging to major spoligotypes using 24-MIRU VNTR profiles. Artificial intelligence-based machine learning (ML) methods such as Random Forests (RF), Support Vector Machines (SVM) and Artificial Neural Networks (ANN) were used to predict dominant spoligotypes of MTBC using MIRU-VNTR data. K-fold cross-validation and validation using unseen testing data set revealed high accuracy of ANN, RF, and SVM for predicting Beijing, CAS1_Delhi, and T1 Spoligotypes (93–99%). However, prediction using the external new validation data set revealed that the RF model was more accurate than SVM and ANN.
format Online
Article
Text
id pubmed-8016865
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-80168652021-04-05 Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence Devi, Kangjam Rekha Pradhan, Jagat Bhutia, Rinchenla Dadul, Peggy Sarkar, Atanu Gohain, Nitumoni Narain, Kanwar Sci Rep Article In India, tuberculosis is an enormous public health problem. This study provides the first description of molecular diversity of the Mycobacterium tuberculosis complex (MTBC) from Sikkim, India. A total of 399 Acid Fast Bacilli sputum positive samples were cultured on Lőwenstein–Jensen media and genetic characterisation was done by spoligotyping and 24-loci MIRU-VNTR typing. Spoligotyping revealed the occurrence of 58 different spoligotypes. Beijing spoligotype was the most dominant type constituting 62.41% of the total isolates and was associated with Multiple Drug Resistance. Minimum Spanning tree analysis of 249 Beijing strains based on 24-loci MIRU-VNTR analysis identified 12 clonal complexes (Single Locus Variants). The principal component analysis was used to visualise possible grouping of MTBC isolates from Sikkim belonging to major spoligotypes using 24-MIRU VNTR profiles. Artificial intelligence-based machine learning (ML) methods such as Random Forests (RF), Support Vector Machines (SVM) and Artificial Neural Networks (ANN) were used to predict dominant spoligotypes of MTBC using MIRU-VNTR data. K-fold cross-validation and validation using unseen testing data set revealed high accuracy of ANN, RF, and SVM for predicting Beijing, CAS1_Delhi, and T1 Spoligotypes (93–99%). However, prediction using the external new validation data set revealed that the RF model was more accurate than SVM and ANN. Nature Publishing Group UK 2021-04-01 /pmc/articles/PMC8016865/ /pubmed/33795751 http://dx.doi.org/10.1038/s41598-021-86626-z Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Devi, Kangjam Rekha
Pradhan, Jagat
Bhutia, Rinchenla
Dadul, Peggy
Sarkar, Atanu
Gohain, Nitumoni
Narain, Kanwar
Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence
title Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence
title_full Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence
title_fullStr Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence
title_full_unstemmed Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence
title_short Molecular diversity of Mycobacterium tuberculosis complex in Sikkim, India and prediction of dominant spoligotypes using artificial intelligence
title_sort molecular diversity of mycobacterium tuberculosis complex in sikkim, india and prediction of dominant spoligotypes using artificial intelligence
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8016865/
https://www.ncbi.nlm.nih.gov/pubmed/33795751
http://dx.doi.org/10.1038/s41598-021-86626-z
work_keys_str_mv AT devikangjamrekha moleculardiversityofmycobacteriumtuberculosiscomplexinsikkimindiaandpredictionofdominantspoligotypesusingartificialintelligence
AT pradhanjagat moleculardiversityofmycobacteriumtuberculosiscomplexinsikkimindiaandpredictionofdominantspoligotypesusingartificialintelligence
AT bhutiarinchenla moleculardiversityofmycobacteriumtuberculosiscomplexinsikkimindiaandpredictionofdominantspoligotypesusingartificialintelligence
AT dadulpeggy moleculardiversityofmycobacteriumtuberculosiscomplexinsikkimindiaandpredictionofdominantspoligotypesusingartificialintelligence
AT sarkaratanu moleculardiversityofmycobacteriumtuberculosiscomplexinsikkimindiaandpredictionofdominantspoligotypesusingartificialintelligence
AT gohainnitumoni moleculardiversityofmycobacteriumtuberculosiscomplexinsikkimindiaandpredictionofdominantspoligotypesusingartificialintelligence
AT narainkanwar moleculardiversityofmycobacteriumtuberculosiscomplexinsikkimindiaandpredictionofdominantspoligotypesusingartificialintelligence