Cargando…
Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis
BACKGROUND: Classification and naming is a key step in the analysis, understanding and adequate management of living organisms. However, where to set limits between groups can be puzzling especially in clonal organisms. Within the Mycobacterium tuberculosis complex (MTC), the etiological agent of tu...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3126747/ https://www.ncbi.nlm.nih.gov/pubmed/21635750 http://dx.doi.org/10.1186/1471-2105-12-224 |
_version_ | 1782207282394693632 |
---|---|
author | Borile, Claudio Labarre, Mathieu Franz, Silvio Sola, Christophe Refrégier, Guislaine |
author_facet | Borile, Claudio Labarre, Mathieu Franz, Silvio Sola, Christophe Refrégier, Guislaine |
author_sort | Borile, Claudio |
collection | PubMed |
description | BACKGROUND: Classification and naming is a key step in the analysis, understanding and adequate management of living organisms. However, where to set limits between groups can be puzzling especially in clonal organisms. Within the Mycobacterium tuberculosis complex (MTC), the etiological agent of tuberculosis (TB), experts have first identified several groups according to their pattern at repetitive sequences, especially at the CRISPR locus (spoligotyping), and to their epidemiological relevance. Most groups such as "Beijing" found good support when tested with other loci. However, other groups such as T family and T1 subfamily (belonging to the "Euro-American" lineage) correspond to non-monophyletic groups and still need to be refined. Here, we propose to use a method called Affinity Propagation that has been successfully used in image categorization to identify relevant patterns at the CRISPR locus in MTC. RESULTS: To adequately infer the relative divergence time between strains, we used a distance method inspired by the recent evolutionary model by Reyes et al. We first confirm that this method performs better than the Jaccard index commonly used to compare spoligotype patterns. Second, we document the support of each spoligotype family among the previous classification using affinity propagation on the international spoligotyping database SpolDB4. This allowed us to propose a consensus assignation for all SpolDB4 spoligotypes. Third, we propose new signatures to subclassify the T family. CONCLUSION: Altogether, this study shows how the new clustering algorithm Affinity Propagation can help building or refining clonal organims classifications. It also describes well-supported families and subfamilies among M. tuberculosis complex, especially inside the modern "Euro-American" lineage. |
format | Online Article Text |
id | pubmed-3126747 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-31267472011-06-30 Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis Borile, Claudio Labarre, Mathieu Franz, Silvio Sola, Christophe Refrégier, Guislaine BMC Bioinformatics Research Article BACKGROUND: Classification and naming is a key step in the analysis, understanding and adequate management of living organisms. However, where to set limits between groups can be puzzling especially in clonal organisms. Within the Mycobacterium tuberculosis complex (MTC), the etiological agent of tuberculosis (TB), experts have first identified several groups according to their pattern at repetitive sequences, especially at the CRISPR locus (spoligotyping), and to their epidemiological relevance. Most groups such as "Beijing" found good support when tested with other loci. However, other groups such as T family and T1 subfamily (belonging to the "Euro-American" lineage) correspond to non-monophyletic groups and still need to be refined. Here, we propose to use a method called Affinity Propagation that has been successfully used in image categorization to identify relevant patterns at the CRISPR locus in MTC. RESULTS: To adequately infer the relative divergence time between strains, we used a distance method inspired by the recent evolutionary model by Reyes et al. We first confirm that this method performs better than the Jaccard index commonly used to compare spoligotype patterns. Second, we document the support of each spoligotype family among the previous classification using affinity propagation on the international spoligotyping database SpolDB4. This allowed us to propose a consensus assignation for all SpolDB4 spoligotypes. Third, we propose new signatures to subclassify the T family. CONCLUSION: Altogether, this study shows how the new clustering algorithm Affinity Propagation can help building or refining clonal organims classifications. It also describes well-supported families and subfamilies among M. tuberculosis complex, especially inside the modern "Euro-American" lineage. BioMed Central 2011-06-02 /pmc/articles/PMC3126747/ /pubmed/21635750 http://dx.doi.org/10.1186/1471-2105-12-224 Text en Copyright ©2011 Borile et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Borile, Claudio Labarre, Mathieu Franz, Silvio Sola, Christophe Refrégier, Guislaine Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis |
title | Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis |
title_full | Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis |
title_fullStr | Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis |
title_full_unstemmed | Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis |
title_short | Using affinity propagation for identifying subspecies among clonal organisms: lessons from M. tuberculosis |
title_sort | using affinity propagation for identifying subspecies among clonal organisms: lessons from m. tuberculosis |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3126747/ https://www.ncbi.nlm.nih.gov/pubmed/21635750 http://dx.doi.org/10.1186/1471-2105-12-224 |
work_keys_str_mv | AT borileclaudio usingaffinitypropagationforidentifyingsubspeciesamongclonalorganismslessonsfrommtuberculosis AT labarremathieu usingaffinitypropagationforidentifyingsubspeciesamongclonalorganismslessonsfrommtuberculosis AT franzsilvio usingaffinitypropagationforidentifyingsubspeciesamongclonalorganismslessonsfrommtuberculosis AT solachristophe usingaffinitypropagationforidentifyingsubspeciesamongclonalorganismslessonsfrommtuberculosis AT refregierguislaine usingaffinitypropagationforidentifyingsubspeciesamongclonalorganismslessonsfrommtuberculosis |