Cargando…

Software for selecting the most informative sets of genomic loci for multi-target microbial typing

BACKGROUND: High-throughput sequencing can identify numerous potential genomic targets for microbial strain typing, but identification of the most informative combinations requires the use of computational screening tools. This paper describes novel software – Automated Selection of Typing Target Su...

Descripción completa

Detalles Bibliográficos
Autores principales: O’Sullivan, Matthew VN, Sintchenko, Vitali, Gilbert, Gwendolyn L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3660239/
https://www.ncbi.nlm.nih.gov/pubmed/23635100
http://dx.doi.org/10.1186/1471-2105-14-148
_version_ 1782270524255109120
author O’Sullivan, Matthew VN
Sintchenko, Vitali
Gilbert, Gwendolyn L
author_facet O’Sullivan, Matthew VN
Sintchenko, Vitali
Gilbert, Gwendolyn L
author_sort O’Sullivan, Matthew VN
collection PubMed
description BACKGROUND: High-throughput sequencing can identify numerous potential genomic targets for microbial strain typing, but identification of the most informative combinations requires the use of computational screening tools. This paper describes novel software – Automated Selection of Typing Target Subsets (AuSeTTS) - that allows intelligent selection of optimal targets for pathogen strain typing. The objective of this software is to maximise both discriminatory power, using Simpson’s index of diversity (D), and concordance with existing typing methods, using the adjusted Wallace coefficient (AW). The program interrogates molecular typing results for panels of isolates, based on large target sets, and iteratively examines each target, one-by-one, to determine the most informative subset. RESULTS: AuSeTTS was evaluated using three target sets: 51 binary targets (13 toxin genes, 16 phage-related loci and 22 SCCmec elements), used for multilocus typing of 153 methicillin-resistant Staphylococcus aureus (MRSA) isolates; 17 MLVA loci in 502 Streptococcus pneumoniae isolates from the MLVA database (http://www.mlva.eu) and 12 MLST loci for 98 Cryptococcus spp. isolates. The maximum D for MRSA, 0.984, was achieved with a subset of 20 targets and a D value of 0.954 with 7 targets. Twelve targets predicted MLST with a maximum AW of 0.9994. All 17 S. pneumoniae MLVA targets were required to achieve maximum D of 0.997, but 4 targets reached D of 0.990. Twelve targets predicted pneumococcal serotype with a maximum AW of 0.899 and 9 predicted MLST with maximum AW of 0.963. Eight of the 12 MLST loci were sufficient to achieve the maximum D of 0.963 for Cryptococcus spp. CONCLUSIONS: Computerised analysis with AuSeTTS allows rapid selection of the most discriminatory targets for incorporation into typing schemes. Output of the program is presented in both tabular and graphical formats and the software is available for free download from http://www.cidmpublichealth.org/pages/ausetts.html.
format Online
Article
Text
id pubmed-3660239
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36602392013-05-23 Software for selecting the most informative sets of genomic loci for multi-target microbial typing O’Sullivan, Matthew VN Sintchenko, Vitali Gilbert, Gwendolyn L BMC Bioinformatics Software BACKGROUND: High-throughput sequencing can identify numerous potential genomic targets for microbial strain typing, but identification of the most informative combinations requires the use of computational screening tools. This paper describes novel software – Automated Selection of Typing Target Subsets (AuSeTTS) - that allows intelligent selection of optimal targets for pathogen strain typing. The objective of this software is to maximise both discriminatory power, using Simpson’s index of diversity (D), and concordance with existing typing methods, using the adjusted Wallace coefficient (AW). The program interrogates molecular typing results for panels of isolates, based on large target sets, and iteratively examines each target, one-by-one, to determine the most informative subset. RESULTS: AuSeTTS was evaluated using three target sets: 51 binary targets (13 toxin genes, 16 phage-related loci and 22 SCCmec elements), used for multilocus typing of 153 methicillin-resistant Staphylococcus aureus (MRSA) isolates; 17 MLVA loci in 502 Streptococcus pneumoniae isolates from the MLVA database (http://www.mlva.eu) and 12 MLST loci for 98 Cryptococcus spp. isolates. The maximum D for MRSA, 0.984, was achieved with a subset of 20 targets and a D value of 0.954 with 7 targets. Twelve targets predicted MLST with a maximum AW of 0.9994. All 17 S. pneumoniae MLVA targets were required to achieve maximum D of 0.997, but 4 targets reached D of 0.990. Twelve targets predicted pneumococcal serotype with a maximum AW of 0.899 and 9 predicted MLST with maximum AW of 0.963. Eight of the 12 MLST loci were sufficient to achieve the maximum D of 0.963 for Cryptococcus spp. CONCLUSIONS: Computerised analysis with AuSeTTS allows rapid selection of the most discriminatory targets for incorporation into typing schemes. Output of the program is presented in both tabular and graphical formats and the software is available for free download from http://www.cidmpublichealth.org/pages/ausetts.html. BioMed Central 2013-05-01 /pmc/articles/PMC3660239/ /pubmed/23635100 http://dx.doi.org/10.1186/1471-2105-14-148 Text en Copyright © 2013 O’Sullivan et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
O’Sullivan, Matthew VN
Sintchenko, Vitali
Gilbert, Gwendolyn L
Software for selecting the most informative sets of genomic loci for multi-target microbial typing
title Software for selecting the most informative sets of genomic loci for multi-target microbial typing
title_full Software for selecting the most informative sets of genomic loci for multi-target microbial typing
title_fullStr Software for selecting the most informative sets of genomic loci for multi-target microbial typing
title_full_unstemmed Software for selecting the most informative sets of genomic loci for multi-target microbial typing
title_short Software for selecting the most informative sets of genomic loci for multi-target microbial typing
title_sort software for selecting the most informative sets of genomic loci for multi-target microbial typing
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3660239/
https://www.ncbi.nlm.nih.gov/pubmed/23635100
http://dx.doi.org/10.1186/1471-2105-14-148
work_keys_str_mv AT osullivanmatthewvn softwareforselectingthemostinformativesetsofgenomiclociformultitargetmicrobialtyping
AT sintchenkovitali softwareforselectingthemostinformativesetsofgenomiclociformultitargetmicrobialtyping
AT gilbertgwendolynl softwareforselectingthemostinformativesetsofgenomiclociformultitargetmicrobialtyping