Cargando…

Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants

BACKGROUND: Single nucleotide polymorphisms (SNPs) and genes that exhibit presence/absence variation have provided informative marker sets for bacterial and viral genotyping. Identification of marker sets optimised for these purposes has been based on maximal generalized discriminatory power as meas...

Descripción completa

Detalles Bibliográficos
Autores principales: Price, Erin P, Inman-Bamber, John, Thiruvenkataswamy, Venugopal, Huygens, Flavia, Giffard, Philip M
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1973086/
https://www.ncbi.nlm.nih.gov/pubmed/17672919
http://dx.doi.org/10.1186/1471-2105-8-278
_version_ 1782135036452012032
author Price, Erin P
Inman-Bamber, John
Thiruvenkataswamy, Venugopal
Huygens, Flavia
Giffard, Philip M
author_facet Price, Erin P
Inman-Bamber, John
Thiruvenkataswamy, Venugopal
Huygens, Flavia
Giffard, Philip M
author_sort Price, Erin P
collection PubMed
description BACKGROUND: Single nucleotide polymorphisms (SNPs) and genes that exhibit presence/absence variation have provided informative marker sets for bacterial and viral genotyping. Identification of marker sets optimised for these purposes has been based on maximal generalized discriminatory power as measured by Simpson's Index of Diversity, or on the ability to identify specific variants. Here we describe the Not-N algorithm, which is designed to identify small sets of genetic markers diagnostic for user-specified subsets of known genetic variants. The algorithm does not treat the user-specified subset and the remaining genetic variants equally. Rather Not-N analysis is designed to underpin assays that provide 0% false negatives, which is very important for e.g. diagnostic procedures for clinically significant subgroups within microbial species. RESULTS: The Not-N algorithm has been incorporated into the "Minimum SNPs" computer program and used to derive genetic markers diagnostic for multilocus sequence typing-defined clonal complexes, hepatitis C virus (HCV) subtypes, and phylogenetic clades defined by comparative genome hybridization (CGH) data for Campylobacter jejuni, Yersinia enterocolitica and Clostridium difficile. CONCLUSION: Not-N analysis is effective for identifying small sets of genetic markers diagnostic for microbial sub-groups. The best results to date have been obtained with CGH data from several bacterial species, and HCV sequence data.
format Text
id pubmed-1973086
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19730862007-09-08 Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants Price, Erin P Inman-Bamber, John Thiruvenkataswamy, Venugopal Huygens, Flavia Giffard, Philip M BMC Bioinformatics Methodology Article BACKGROUND: Single nucleotide polymorphisms (SNPs) and genes that exhibit presence/absence variation have provided informative marker sets for bacterial and viral genotyping. Identification of marker sets optimised for these purposes has been based on maximal generalized discriminatory power as measured by Simpson's Index of Diversity, or on the ability to identify specific variants. Here we describe the Not-N algorithm, which is designed to identify small sets of genetic markers diagnostic for user-specified subsets of known genetic variants. The algorithm does not treat the user-specified subset and the remaining genetic variants equally. Rather Not-N analysis is designed to underpin assays that provide 0% false negatives, which is very important for e.g. diagnostic procedures for clinically significant subgroups within microbial species. RESULTS: The Not-N algorithm has been incorporated into the "Minimum SNPs" computer program and used to derive genetic markers diagnostic for multilocus sequence typing-defined clonal complexes, hepatitis C virus (HCV) subtypes, and phylogenetic clades defined by comparative genome hybridization (CGH) data for Campylobacter jejuni, Yersinia enterocolitica and Clostridium difficile. CONCLUSION: Not-N analysis is effective for identifying small sets of genetic markers diagnostic for microbial sub-groups. The best results to date have been obtained with CGH data from several bacterial species, and HCV sequence data. BioMed Central 2007-08-01 /pmc/articles/PMC1973086/ /pubmed/17672919 http://dx.doi.org/10.1186/1471-2105-8-278 Text en Copyright © 2007 Price et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Price, Erin P
Inman-Bamber, John
Thiruvenkataswamy, Venugopal
Huygens, Flavia
Giffard, Philip M
Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants
title Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants
title_full Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants
title_fullStr Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants
title_full_unstemmed Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants
title_short Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants
title_sort computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1973086/
https://www.ncbi.nlm.nih.gov/pubmed/17672919
http://dx.doi.org/10.1186/1471-2105-8-278
work_keys_str_mv AT priceerinp computeraidedidentificationofpolymorphismsetsdiagnosticforgroupsofbacterialandviralgeneticvariants
AT inmanbamberjohn computeraidedidentificationofpolymorphismsetsdiagnosticforgroupsofbacterialandviralgeneticvariants
AT thiruvenkataswamyvenugopal computeraidedidentificationofpolymorphismsetsdiagnosticforgroupsofbacterialandviralgeneticvariants
AT huygensflavia computeraidedidentificationofpolymorphismsetsdiagnosticforgroupsofbacterialandviralgeneticvariants
AT giffardphilipm computeraidedidentificationofpolymorphismsetsdiagnosticforgroupsofbacterialandviralgeneticvariants