Cargando…
Improved analytical methods for microarray-based genome-composition analysis
BACKGROUND: Whereas genome sequencing has given us high-resolution pictures of many different species of bacteria, microarrays provide a means of obtaining information on genome composition for many strains of a given species. Genome-composition analysis using microarrays, or 'genomotyping'...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2002
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC133449/ https://www.ncbi.nlm.nih.gov/pubmed/12429064 |
_version_ | 1782120395504091136 |
---|---|
author | Kim, Charles C Joyce, Elizabeth A Chan, Kaman Falkow, Stanley |
author_facet | Kim, Charles C Joyce, Elizabeth A Chan, Kaman Falkow, Stanley |
author_sort | Kim, Charles C |
collection | PubMed |
description | BACKGROUND: Whereas genome sequencing has given us high-resolution pictures of many different species of bacteria, microarrays provide a means of obtaining information on genome composition for many strains of a given species. Genome-composition analysis using microarrays, or 'genomotyping', can be used to categorize genes into 'present' and 'divergent' categories based on the level of hybridization signal. This typically involves selecting a signal value that is used as a cutoff to discriminate present (high signal) and divergent (low signal) genes. Current methodology uses empirical determination of cutoffs for classification into these categories, but this methodology is subject to several problems that can result in the misclassification of many genes. RESULTS: We describe a method that depends on the shape of the signal-ratio distribution and does not require empirical determination of a cutoff. Moreover, the cutoff is determined on an array-to-array basis, accounting for variation in strain composition and hybridization quality. The algorithm also provides an estimate of the probability that any given gene is present, which provides a measure of confidence in the categorical assignments. CONCLUSIONS: Many genes previously classified as present using static methods are in fact divergent on the basis of microarray signal; this is corrected by our algorithm. We have reassigned hundreds of genes from previous genomotyping studies of Helicobacter pylori and Campylobacter jejuni strains, and expect that the algorithm should be widely applicable to genomotyping data. |
format | Text |
id | pubmed-133449 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2002 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-1334492003-01-07 Improved analytical methods for microarray-based genome-composition analysis Kim, Charles C Joyce, Elizabeth A Chan, Kaman Falkow, Stanley Genome Biol Research BACKGROUND: Whereas genome sequencing has given us high-resolution pictures of many different species of bacteria, microarrays provide a means of obtaining information on genome composition for many strains of a given species. Genome-composition analysis using microarrays, or 'genomotyping', can be used to categorize genes into 'present' and 'divergent' categories based on the level of hybridization signal. This typically involves selecting a signal value that is used as a cutoff to discriminate present (high signal) and divergent (low signal) genes. Current methodology uses empirical determination of cutoffs for classification into these categories, but this methodology is subject to several problems that can result in the misclassification of many genes. RESULTS: We describe a method that depends on the shape of the signal-ratio distribution and does not require empirical determination of a cutoff. Moreover, the cutoff is determined on an array-to-array basis, accounting for variation in strain composition and hybridization quality. The algorithm also provides an estimate of the probability that any given gene is present, which provides a measure of confidence in the categorical assignments. CONCLUSIONS: Many genes previously classified as present using static methods are in fact divergent on the basis of microarray signal; this is corrected by our algorithm. We have reassigned hundreds of genes from previous genomotyping studies of Helicobacter pylori and Campylobacter jejuni strains, and expect that the algorithm should be widely applicable to genomotyping data. BioMed Central 2002 2002-10-29 /pmc/articles/PMC133449/ /pubmed/12429064 Text en Copyright © 2002 Kim et al., licensee BioMed Central Ltd |
spellingShingle | Research Kim, Charles C Joyce, Elizabeth A Chan, Kaman Falkow, Stanley Improved analytical methods for microarray-based genome-composition analysis |
title | Improved analytical methods for microarray-based genome-composition analysis |
title_full | Improved analytical methods for microarray-based genome-composition analysis |
title_fullStr | Improved analytical methods for microarray-based genome-composition analysis |
title_full_unstemmed | Improved analytical methods for microarray-based genome-composition analysis |
title_short | Improved analytical methods for microarray-based genome-composition analysis |
title_sort | improved analytical methods for microarray-based genome-composition analysis |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC133449/ https://www.ncbi.nlm.nih.gov/pubmed/12429064 |
work_keys_str_mv | AT kimcharlesc improvedanalyticalmethodsformicroarraybasedgenomecompositionanalysis AT joyceelizabetha improvedanalyticalmethodsformicroarraybasedgenomecompositionanalysis AT chankaman improvedanalyticalmethodsformicroarraybasedgenomecompositionanalysis AT falkowstanley improvedanalyticalmethodsformicroarraybasedgenomecompositionanalysis |