Cargando…

Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis

BACKGROUND: Classification and naming is a key step in the analysis, understanding and adequate management of living organisms. However, where to set limits between groups can be puzzling especially in clonal organisms. Within the Mycobacterium tuberculosis complex (MTC), the etiological agent of tu...

Descripción completa

Detalles Bibliográficos
Autores principales: Borile, Claudio, Labarre, Mathieu, Franz, Silvio, Sola, Christophe, Refrégier, Guislaine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3126747/
https://www.ncbi.nlm.nih.gov/pubmed/21635750
http://dx.doi.org/10.1186/1471-2105-12-224
_version_ 1782207282394693632
author Borile, Claudio
Labarre, Mathieu
Franz, Silvio
Sola, Christophe
Refrégier, Guislaine
author_facet Borile, Claudio
Labarre, Mathieu
Franz, Silvio
Sola, Christophe
Refrégier, Guislaine
author_sort Borile, Claudio
collection PubMed
description BACKGROUND: Classification and naming is a key step in the analysis, understanding and adequate management of living organisms. However, where to set limits between groups can be puzzling especially in clonal organisms. Within the Mycobacterium tuberculosis complex (MTC), the etiological agent of tuberculosis (TB), experts have first identified several groups according to their pattern at repetitive sequences, especially at the CRISPR locus (spoligotyping), and to their epidemiological relevance. Most groups such as "Beijing" found good support when tested with other loci. However, other groups such as T family and T1 subfamily (belonging to the "Euro-American" lineage) correspond to non-monophyletic groups and still need to be refined. Here, we propose to use a method called Affinity Propagation that has been successfully used in image categorization to identify relevant patterns at the CRISPR locus in MTC. RESULTS: To adequately infer the relative divergence time between strains, we used a distance method inspired by the recent evolutionary model by Reyes et al. We first confirm that this method performs better than the Jaccard index commonly used to compare spoligotype patterns. Second, we document the support of each spoligotype family among the previous classification using affinity propagation on the international spoligotyping database SpolDB4. This allowed us to propose a consensus assignation for all SpolDB4 spoligotypes. Third, we propose new signatures to subclassify the T family. CONCLUSION: Altogether, this study shows how the new clustering algorithm Affinity Propagation can help building or refining clonal organims classifications. It also describes well-supported families and subfamilies among M. tuberculosis complex, especially inside the modern "Euro-American" lineage.
format Online
Article
Text
id pubmed-3126747
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31267472011-06-30 Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis Borile, Claudio Labarre, Mathieu Franz, Silvio Sola, Christophe Refrégier, Guislaine BMC Bioinformatics Research Article BACKGROUND: Classification and naming is a key step in the analysis, understanding and adequate management of living organisms. However, where to set limits between groups can be puzzling especially in clonal organisms. Within the Mycobacterium tuberculosis complex (MTC), the etiological agent of tuberculosis (TB), experts have first identified several groups according to their pattern at repetitive sequences, especially at the CRISPR locus (spoligotyping), and to their epidemiological relevance. Most groups such as "Beijing" found good support when tested with other loci. However, other groups such as T family and T1 subfamily (belonging to the "Euro-American" lineage) correspond to non-monophyletic groups and still need to be refined. Here, we propose to use a method called Affinity Propagation that has been successfully used in image categorization to identify relevant patterns at the CRISPR locus in MTC. RESULTS: To adequately infer the relative divergence time between strains, we used a distance method inspired by the recent evolutionary model by Reyes et al. We first confirm that this method performs better than the Jaccard index commonly used to compare spoligotype patterns. Second, we document the support of each spoligotype family among the previous classification using affinity propagation on the international spoligotyping database SpolDB4. This allowed us to propose a consensus assignation for all SpolDB4 spoligotypes. Third, we propose new signatures to subclassify the T family. CONCLUSION: Altogether, this study shows how the new clustering algorithm Affinity Propagation can help building or refining clonal organims classifications. It also describes well-supported families and subfamilies among M. tuberculosis complex, especially inside the modern "Euro-American" lineage. BioMed Central 2011-06-02 /pmc/articles/PMC3126747/ /pubmed/21635750 http://dx.doi.org/10.1186/1471-2105-12-224 Text en Copyright ©2011 Borile et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Borile, Claudio
Labarre, Mathieu
Franz, Silvio
Sola, Christophe
Refrégier, Guislaine
Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis
title Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis
title_full Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis
title_fullStr Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis
title_full_unstemmed Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis
title_short Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis
title_sort using affinity propagation for identifying subspecies among clonal organisms: lessons from m. tuberculosis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3126747/
https://www.ncbi.nlm.nih.gov/pubmed/21635750
http://dx.doi.org/10.1186/1471-2105-12-224
work_keys_str_mv AT borileclaudio usingaffinitypropagationforidentifyingsubspeciesamongclonalorganismslessonsfrommtuberculosis
AT labarremathieu usingaffinitypropagationforidentifyingsubspeciesamongclonalorganismslessonsfrommtuberculosis
AT franzsilvio usingaffinitypropagationforidentifyingsubspeciesamongclonalorganismslessonsfrommtuberculosis
AT solachristophe usingaffinitypropagationforidentifyingsubspeciesamongclonalorganismslessonsfrommtuberculosis
AT refregierguislaine usingaffinitypropagationforidentifyingsubspeciesamongclonalorganismslessonsfrommtuberculosis