Cargando…

The Cano-eMLST Program: An Approach for the Calculation of Canonical Extended Multi-Locus Sequence Typing, Making Comparison of Genetic Differences Among Bunches of Bacterial Strains

Extended multi-locus sequence typing (eMLST) methods have become popular in the field of genomic epidemiology. Before eMLST methods can be applied in epidemiological investigations, the selection of a suitable scheme is critical. The core genome scheme (cgMLST) has become the most popular eMLST appr...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yen-Yi, Lin, Ji-Wei, Chen, Chih-Chieh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6517909/
https://www.ncbi.nlm.nih.gov/pubmed/30987190
http://dx.doi.org/10.3390/microorganisms7040098
_version_ 1783418350212218880
author Liu, Yen-Yi
Lin, Ji-Wei
Chen, Chih-Chieh
author_facet Liu, Yen-Yi
Lin, Ji-Wei
Chen, Chih-Chieh
author_sort Liu, Yen-Yi
collection PubMed
description Extended multi-locus sequence typing (eMLST) methods have become popular in the field of genomic epidemiology. Before eMLST methods can be applied in epidemiological investigations, the selection of a suitable scheme is critical. The core genome scheme (cgMLST) has become the most popular eMLST approach for strain typing in the epidemiological domain. In addition to strain typing, many public health researchers and clinical microbiologists wish to investigate which genes cause genetic differences between compared strains. Therefore, a tool that can be used to extract canonical genes with an eMLST scheme would be particularly useful. In this study, we present cano-eMLST, a well-designed program that applies a feature-selection methodology to create a canonical locus combination with discriminatory power by traversing a genetic relatedness tree based on a user-selected scheme. The cano-eMLST program is provided mainly to help infectious disease laboratory researchers identify potential factors related to bacterial pathogenesis. The core program (tree-traversing approach) of cano-eMLST is implemented in Perl and Python. All the necessary dependencies and environmental settings are provided in the encapsulated version (VirtualBox or VMware) and self-installation version (all use source code and libraries).
format Online
Article
Text
id pubmed-6517909
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-65179092019-05-31 The Cano-eMLST Program: An Approach for the Calculation of Canonical Extended Multi-Locus Sequence Typing, Making Comparison of Genetic Differences Among Bunches of Bacterial Strains Liu, Yen-Yi Lin, Ji-Wei Chen, Chih-Chieh Microorganisms Article Extended multi-locus sequence typing (eMLST) methods have become popular in the field of genomic epidemiology. Before eMLST methods can be applied in epidemiological investigations, the selection of a suitable scheme is critical. The core genome scheme (cgMLST) has become the most popular eMLST approach for strain typing in the epidemiological domain. In addition to strain typing, many public health researchers and clinical microbiologists wish to investigate which genes cause genetic differences between compared strains. Therefore, a tool that can be used to extract canonical genes with an eMLST scheme would be particularly useful. In this study, we present cano-eMLST, a well-designed program that applies a feature-selection methodology to create a canonical locus combination with discriminatory power by traversing a genetic relatedness tree based on a user-selected scheme. The cano-eMLST program is provided mainly to help infectious disease laboratory researchers identify potential factors related to bacterial pathogenesis. The core program (tree-traversing approach) of cano-eMLST is implemented in Perl and Python. All the necessary dependencies and environmental settings are provided in the encapsulated version (VirtualBox or VMware) and self-installation version (all use source code and libraries). MDPI 2019-04-03 /pmc/articles/PMC6517909/ /pubmed/30987190 http://dx.doi.org/10.3390/microorganisms7040098 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Liu, Yen-Yi
Lin, Ji-Wei
Chen, Chih-Chieh
The Cano-eMLST Program: An Approach for the Calculation of Canonical Extended Multi-Locus Sequence Typing, Making Comparison of Genetic Differences Among Bunches of Bacterial Strains
title The Cano-eMLST Program: An Approach for the Calculation of Canonical Extended Multi-Locus Sequence Typing, Making Comparison of Genetic Differences Among Bunches of Bacterial Strains
title_full The Cano-eMLST Program: An Approach for the Calculation of Canonical Extended Multi-Locus Sequence Typing, Making Comparison of Genetic Differences Among Bunches of Bacterial Strains
title_fullStr The Cano-eMLST Program: An Approach for the Calculation of Canonical Extended Multi-Locus Sequence Typing, Making Comparison of Genetic Differences Among Bunches of Bacterial Strains
title_full_unstemmed The Cano-eMLST Program: An Approach for the Calculation of Canonical Extended Multi-Locus Sequence Typing, Making Comparison of Genetic Differences Among Bunches of Bacterial Strains
title_short The Cano-eMLST Program: An Approach for the Calculation of Canonical Extended Multi-Locus Sequence Typing, Making Comparison of Genetic Differences Among Bunches of Bacterial Strains
title_sort cano-emlst program: an approach for the calculation of canonical extended multi-locus sequence typing, making comparison of genetic differences among bunches of bacterial strains
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6517909/
https://www.ncbi.nlm.nih.gov/pubmed/30987190
http://dx.doi.org/10.3390/microorganisms7040098
work_keys_str_mv AT liuyenyi thecanoemlstprogramanapproachforthecalculationofcanonicalextendedmultilocussequencetypingmakingcomparisonofgeneticdifferencesamongbunchesofbacterialstrains
AT linjiwei thecanoemlstprogramanapproachforthecalculationofcanonicalextendedmultilocussequencetypingmakingcomparisonofgeneticdifferencesamongbunchesofbacterialstrains
AT chenchihchieh thecanoemlstprogramanapproachforthecalculationofcanonicalextendedmultilocussequencetypingmakingcomparisonofgeneticdifferencesamongbunchesofbacterialstrains
AT liuyenyi canoemlstprogramanapproachforthecalculationofcanonicalextendedmultilocussequencetypingmakingcomparisonofgeneticdifferencesamongbunchesofbacterialstrains
AT linjiwei canoemlstprogramanapproachforthecalculationofcanonicalextendedmultilocussequencetypingmakingcomparisonofgeneticdifferencesamongbunchesofbacterialstrains
AT chenchihchieh canoemlstprogramanapproachforthecalculationofcanonicalextendedmultilocussequencetypingmakingcomparisonofgeneticdifferencesamongbunchesofbacterialstrains