Cargando…

ArrayOme: a program for estimating the sizes of microarray-visualized bacterial genomes

ArrayOme is a new program that calculates the size of genomes represented by microarray-based probes and facilitates recognition of key bacterial strains carrying large numbers of novel genes. Protein-coding sequences (CDS) that are contiguous on annotated reference templates and classified as ‘Pres...

Descripción completa

Detalles Bibliográficos
Autores principales: Ou, Hong-Yu, Smith, Rebecca, Lucchini, Sacha, Hinton, Jay, Chaudhuri, Roy R., Pallen, Mark, Barer, Michael R., Rajakumar, Kumar
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC546176/
https://www.ncbi.nlm.nih.gov/pubmed/15640440
http://dx.doi.org/10.1093/nar/gni005
_version_ 1782122245820252160
author Ou, Hong-Yu
Smith, Rebecca
Lucchini, Sacha
Hinton, Jay
Chaudhuri, Roy R.
Pallen, Mark
Barer, Michael R.
Rajakumar, Kumar
author_facet Ou, Hong-Yu
Smith, Rebecca
Lucchini, Sacha
Hinton, Jay
Chaudhuri, Roy R.
Pallen, Mark
Barer, Michael R.
Rajakumar, Kumar
author_sort Ou, Hong-Yu
collection PubMed
description ArrayOme is a new program that calculates the size of genomes represented by microarray-based probes and facilitates recognition of key bacterial strains carrying large numbers of novel genes. Protein-coding sequences (CDS) that are contiguous on annotated reference templates and classified as ‘Present’ in the test strain by hybridization to microarrays are merged into ICs (ICs). These ICs are then extended to account for flanking intergenic sequences. Finally, the lengths of all extended ICs are summated to yield the ‘microarray-visualized genome (MVG)’ size. We tested and validated ArrayOme using both experimental and in silico-generated genomic hybridization data. MVG sizing of five sequenced Escherichia coli and Shigella strains resulted in an accuracy of 97–99%, as compared to true genome sizes, when the comprehensive ShE.coli meta-array gene sequences (6239 CDS) were used for in silico hybridization analysis. However, the E.coli CFT073 genome size was underestimated by 14% as this meta-array lacked probes for many CFT073 CDS. ArrayOme permits rapid recognition of discordances between PFGE-measured genome and MVG sizes, thereby enabling high-throughput identification of strains rich in novel genes. Gene discovery studies focused on these strains will greatly facilitate characterization of the global gene pool accessible to individual bacterial species.
format Text
id pubmed-546176
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-5461762005-02-07 ArrayOme: a program for estimating the sizes of microarray-visualized bacterial genomes Ou, Hong-Yu Smith, Rebecca Lucchini, Sacha Hinton, Jay Chaudhuri, Roy R. Pallen, Mark Barer, Michael R. Rajakumar, Kumar Nucleic Acids Res Methods Online ArrayOme is a new program that calculates the size of genomes represented by microarray-based probes and facilitates recognition of key bacterial strains carrying large numbers of novel genes. Protein-coding sequences (CDS) that are contiguous on annotated reference templates and classified as ‘Present’ in the test strain by hybridization to microarrays are merged into ICs (ICs). These ICs are then extended to account for flanking intergenic sequences. Finally, the lengths of all extended ICs are summated to yield the ‘microarray-visualized genome (MVG)’ size. We tested and validated ArrayOme using both experimental and in silico-generated genomic hybridization data. MVG sizing of five sequenced Escherichia coli and Shigella strains resulted in an accuracy of 97–99%, as compared to true genome sizes, when the comprehensive ShE.coli meta-array gene sequences (6239 CDS) were used for in silico hybridization analysis. However, the E.coli CFT073 genome size was underestimated by 14% as this meta-array lacked probes for many CFT073 CDS. ArrayOme permits rapid recognition of discordances between PFGE-measured genome and MVG sizes, thereby enabling high-throughput identification of strains rich in novel genes. Gene discovery studies focused on these strains will greatly facilitate characterization of the global gene pool accessible to individual bacterial species. Oxford University Press 2005 2005-01-07 /pmc/articles/PMC546176/ /pubmed/15640440 http://dx.doi.org/10.1093/nar/gni005 Text en © 2005, the authors Nucleic Acids Research, Vol. 33 No. 1 © Oxford University Press 2005; all rights reserved
spellingShingle Methods Online
Ou, Hong-Yu
Smith, Rebecca
Lucchini, Sacha
Hinton, Jay
Chaudhuri, Roy R.
Pallen, Mark
Barer, Michael R.
Rajakumar, Kumar
ArrayOme: a program for estimating the sizes of microarray-visualized bacterial genomes
title ArrayOme: a program for estimating the sizes of microarray-visualized bacterial genomes
title_full ArrayOme: a program for estimating the sizes of microarray-visualized bacterial genomes
title_fullStr ArrayOme: a program for estimating the sizes of microarray-visualized bacterial genomes
title_full_unstemmed ArrayOme: a program for estimating the sizes of microarray-visualized bacterial genomes
title_short ArrayOme: a program for estimating the sizes of microarray-visualized bacterial genomes
title_sort arrayome: a program for estimating the sizes of microarray-visualized bacterial genomes
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC546176/
https://www.ncbi.nlm.nih.gov/pubmed/15640440
http://dx.doi.org/10.1093/nar/gni005
work_keys_str_mv AT ouhongyu arrayomeaprogramforestimatingthesizesofmicroarrayvisualizedbacterialgenomes
AT smithrebecca arrayomeaprogramforestimatingthesizesofmicroarrayvisualizedbacterialgenomes
AT lucchinisacha arrayomeaprogramforestimatingthesizesofmicroarrayvisualizedbacterialgenomes
AT hintonjay arrayomeaprogramforestimatingthesizesofmicroarrayvisualizedbacterialgenomes
AT chaudhuriroyr arrayomeaprogramforestimatingthesizesofmicroarrayvisualizedbacterialgenomes
AT pallenmark arrayomeaprogramforestimatingthesizesofmicroarrayvisualizedbacterialgenomes
AT barermichaelr arrayomeaprogramforestimatingthesizesofmicroarrayvisualizedbacterialgenomes
AT rajakumarkumar arrayomeaprogramforestimatingthesizesofmicroarrayvisualizedbacterialgenomes