Cargando…
Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants
BACKGROUND: Single nucleotide polymorphisms (SNPs) and genes that exhibit presence/absence variation have provided informative marker sets for bacterial and viral genotyping. Identification of marker sets optimised for these purposes has been based on maximal generalized discriminatory power as meas...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1973086/ https://www.ncbi.nlm.nih.gov/pubmed/17672919 http://dx.doi.org/10.1186/1471-2105-8-278 |
_version_ | 1782135036452012032 |
---|---|
author | Price, Erin P Inman-Bamber, John Thiruvenkataswamy, Venugopal Huygens, Flavia Giffard, Philip M |
author_facet | Price, Erin P Inman-Bamber, John Thiruvenkataswamy, Venugopal Huygens, Flavia Giffard, Philip M |
author_sort | Price, Erin P |
collection | PubMed |
description | BACKGROUND: Single nucleotide polymorphisms (SNPs) and genes that exhibit presence/absence variation have provided informative marker sets for bacterial and viral genotyping. Identification of marker sets optimised for these purposes has been based on maximal generalized discriminatory power as measured by Simpson's Index of Diversity, or on the ability to identify specific variants. Here we describe the Not-N algorithm, which is designed to identify small sets of genetic markers diagnostic for user-specified subsets of known genetic variants. The algorithm does not treat the user-specified subset and the remaining genetic variants equally. Rather Not-N analysis is designed to underpin assays that provide 0% false negatives, which is very important for e.g. diagnostic procedures for clinically significant subgroups within microbial species. RESULTS: The Not-N algorithm has been incorporated into the "Minimum SNPs" computer program and used to derive genetic markers diagnostic for multilocus sequence typing-defined clonal complexes, hepatitis C virus (HCV) subtypes, and phylogenetic clades defined by comparative genome hybridization (CGH) data for Campylobacter jejuni, Yersinia enterocolitica and Clostridium difficile. CONCLUSION: Not-N analysis is effective for identifying small sets of genetic markers diagnostic for microbial sub-groups. The best results to date have been obtained with CGH data from several bacterial species, and HCV sequence data. |
format | Text |
id | pubmed-1973086 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-19730862007-09-08 Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants Price, Erin P Inman-Bamber, John Thiruvenkataswamy, Venugopal Huygens, Flavia Giffard, Philip M BMC Bioinformatics Methodology Article BACKGROUND: Single nucleotide polymorphisms (SNPs) and genes that exhibit presence/absence variation have provided informative marker sets for bacterial and viral genotyping. Identification of marker sets optimised for these purposes has been based on maximal generalized discriminatory power as measured by Simpson's Index of Diversity, or on the ability to identify specific variants. Here we describe the Not-N algorithm, which is designed to identify small sets of genetic markers diagnostic for user-specified subsets of known genetic variants. The algorithm does not treat the user-specified subset and the remaining genetic variants equally. Rather Not-N analysis is designed to underpin assays that provide 0% false negatives, which is very important for e.g. diagnostic procedures for clinically significant subgroups within microbial species. RESULTS: The Not-N algorithm has been incorporated into the "Minimum SNPs" computer program and used to derive genetic markers diagnostic for multilocus sequence typing-defined clonal complexes, hepatitis C virus (HCV) subtypes, and phylogenetic clades defined by comparative genome hybridization (CGH) data for Campylobacter jejuni, Yersinia enterocolitica and Clostridium difficile. CONCLUSION: Not-N analysis is effective for identifying small sets of genetic markers diagnostic for microbial sub-groups. The best results to date have been obtained with CGH data from several bacterial species, and HCV sequence data. BioMed Central 2007-08-01 /pmc/articles/PMC1973086/ /pubmed/17672919 http://dx.doi.org/10.1186/1471-2105-8-278 Text en Copyright © 2007 Price et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Price, Erin P Inman-Bamber, John Thiruvenkataswamy, Venugopal Huygens, Flavia Giffard, Philip M Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants |
title | Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants |
title_full | Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants |
title_fullStr | Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants |
title_full_unstemmed | Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants |
title_short | Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants |
title_sort | computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1973086/ https://www.ncbi.nlm.nih.gov/pubmed/17672919 http://dx.doi.org/10.1186/1471-2105-8-278 |
work_keys_str_mv | AT priceerinp computeraidedidentificationofpolymorphismsetsdiagnosticforgroupsofbacterialandviralgeneticvariants AT inmanbamberjohn computeraidedidentificationofpolymorphismsetsdiagnosticforgroupsofbacterialandviralgeneticvariants AT thiruvenkataswamyvenugopal computeraidedidentificationofpolymorphismsetsdiagnosticforgroupsofbacterialandviralgeneticvariants AT huygensflavia computeraidedidentificationofpolymorphismsetsdiagnosticforgroupsofbacterialandviralgeneticvariants AT giffardphilipm computeraidedidentificationofpolymorphismsetsdiagnosticforgroupsofbacterialandviralgeneticvariants |