Cargando…

ICDS database: interrupted CoDing sequences in prokaryotic genomes

Unrecognized frameshifts, in-frame stop codons and sequencing errors lead to Interrupted CoDing Sequence (ICDS) that can seriously affect all subsequent steps of functional characterization, from in silico analysis to high-throughput proteomic projects. Here, we describe the Interrupted CoDing Seque...

Descripción completa

Detalles Bibliográficos
Autores principales: Perrodou, Emmanuel, Deshayes, Caroline, Muller, Jean, Schaeffer, Christine, Van Dorsselaer, Alain, Ripp, Raymond, Poch, Olivier, Reyrat, Jean-Marc, Lecompte, Odile
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1347423/
https://www.ncbi.nlm.nih.gov/pubmed/16381882
http://dx.doi.org/10.1093/nar/gkj060
_version_ 1782126614581084160
author Perrodou, Emmanuel
Deshayes, Caroline
Muller, Jean
Schaeffer, Christine
Van Dorsselaer, Alain
Ripp, Raymond
Poch, Olivier
Reyrat, Jean-Marc
Lecompte, Odile
author_facet Perrodou, Emmanuel
Deshayes, Caroline
Muller, Jean
Schaeffer, Christine
Van Dorsselaer, Alain
Ripp, Raymond
Poch, Olivier
Reyrat, Jean-Marc
Lecompte, Odile
author_sort Perrodou, Emmanuel
collection PubMed
description Unrecognized frameshifts, in-frame stop codons and sequencing errors lead to Interrupted CoDing Sequence (ICDS) that can seriously affect all subsequent steps of functional characterization, from in silico analysis to high-throughput proteomic projects. Here, we describe the Interrupted CoDing Sequence database containing ICDS detected by a similarity-based approach in 80 complete prokaryotic genomes. ICDS can be retrieved by species browsing or similarity searches via a web interface (). The definition of each interrupted gene is provided as well as the ICDS genomic localization with the surrounding sequence. Furthermore, to facilitate the experimental characterization of ICDS, we propose optimized primers for re-sequencing purposes. The database will be regularly updated with additional data from ongoing sequenced genomes. Our strategy has been validated by three independent tests: (i) ICDS prediction on a benchmark of artificially created frameshifts, (ii) comparison of predicted ICDS and results obtained from the comparison of the two genomic sequences of Bacillus licheniformis strain ATCC 14580 and (iii) re-sequencing of 25 predicted ICDS of the recently sequenced genome of Mycobacterium smegmatis. This allows us to estimate the specificity and sensitivity (95 and 82%, respectively) of our program and the efficiency of primer determination.
format Text
id pubmed-1347423
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-13474232006-01-25 ICDS database: interrupted CoDing sequences in prokaryotic genomes Perrodou, Emmanuel Deshayes, Caroline Muller, Jean Schaeffer, Christine Van Dorsselaer, Alain Ripp, Raymond Poch, Olivier Reyrat, Jean-Marc Lecompte, Odile Nucleic Acids Res Article Unrecognized frameshifts, in-frame stop codons and sequencing errors lead to Interrupted CoDing Sequence (ICDS) that can seriously affect all subsequent steps of functional characterization, from in silico analysis to high-throughput proteomic projects. Here, we describe the Interrupted CoDing Sequence database containing ICDS detected by a similarity-based approach in 80 complete prokaryotic genomes. ICDS can be retrieved by species browsing or similarity searches via a web interface (). The definition of each interrupted gene is provided as well as the ICDS genomic localization with the surrounding sequence. Furthermore, to facilitate the experimental characterization of ICDS, we propose optimized primers for re-sequencing purposes. The database will be regularly updated with additional data from ongoing sequenced genomes. Our strategy has been validated by three independent tests: (i) ICDS prediction on a benchmark of artificially created frameshifts, (ii) comparison of predicted ICDS and results obtained from the comparison of the two genomic sequences of Bacillus licheniformis strain ATCC 14580 and (iii) re-sequencing of 25 predicted ICDS of the recently sequenced genome of Mycobacterium smegmatis. This allows us to estimate the specificity and sensitivity (95 and 82%, respectively) of our program and the efficiency of primer determination. Oxford University Press 2006-01-01 2005-12-28 /pmc/articles/PMC1347423/ /pubmed/16381882 http://dx.doi.org/10.1093/nar/gkj060 Text en © The Author 2006. Published by Oxford University Press. All rights reserved
spellingShingle Article
Perrodou, Emmanuel
Deshayes, Caroline
Muller, Jean
Schaeffer, Christine
Van Dorsselaer, Alain
Ripp, Raymond
Poch, Olivier
Reyrat, Jean-Marc
Lecompte, Odile
ICDS database: interrupted CoDing sequences in prokaryotic genomes
title ICDS database: interrupted CoDing sequences in prokaryotic genomes
title_full ICDS database: interrupted CoDing sequences in prokaryotic genomes
title_fullStr ICDS database: interrupted CoDing sequences in prokaryotic genomes
title_full_unstemmed ICDS database: interrupted CoDing sequences in prokaryotic genomes
title_short ICDS database: interrupted CoDing sequences in prokaryotic genomes
title_sort icds database: interrupted coding sequences in prokaryotic genomes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1347423/
https://www.ncbi.nlm.nih.gov/pubmed/16381882
http://dx.doi.org/10.1093/nar/gkj060
work_keys_str_mv AT perrodouemmanuel icdsdatabaseinterruptedcodingsequencesinprokaryoticgenomes
AT deshayescaroline icdsdatabaseinterruptedcodingsequencesinprokaryoticgenomes
AT mullerjean icdsdatabaseinterruptedcodingsequencesinprokaryoticgenomes
AT schaefferchristine icdsdatabaseinterruptedcodingsequencesinprokaryoticgenomes
AT vandorsselaeralain icdsdatabaseinterruptedcodingsequencesinprokaryoticgenomes
AT rippraymond icdsdatabaseinterruptedcodingsequencesinprokaryoticgenomes
AT pocholivier icdsdatabaseinterruptedcodingsequencesinprokaryoticgenomes
AT reyratjeanmarc icdsdatabaseinterruptedcodingsequencesinprokaryoticgenomes
AT lecompteodile icdsdatabaseinterruptedcodingsequencesinprokaryoticgenomes