Cargando…

Filtering "genic" open reading frames from genomic DNA samples for advanced annotation

BACKGROUND: In order to carry out experimental gene annotation, DNA encoding open reading frames (ORFs) derived from real genes (termed "genic") in the correct frame is required. When genes are correctly assigned, isolation of genic DNA for functional annotation can be carried out by PCR....

Descripción completa

Detalles Bibliográficos
Autores principales: D'Angelo, Sara, Velappan, Nileena, Mignone, Flavio, Santoro, Claudio, Sblattero, Daniele, Kiss, Csaba, Bradbury, Andrew RM
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3223728/
https://www.ncbi.nlm.nih.gov/pubmed/21810207
http://dx.doi.org/10.1186/1471-2164-12-S1-S5
_version_ 1782217315775938560
author D'Angelo, Sara
Velappan, Nileena
Mignone, Flavio
Santoro, Claudio
Sblattero, Daniele
Kiss, Csaba
Bradbury, Andrew RM
author_facet D'Angelo, Sara
Velappan, Nileena
Mignone, Flavio
Santoro, Claudio
Sblattero, Daniele
Kiss, Csaba
Bradbury, Andrew RM
author_sort D'Angelo, Sara
collection PubMed
description BACKGROUND: In order to carry out experimental gene annotation, DNA encoding open reading frames (ORFs) derived from real genes (termed "genic") in the correct frame is required. When genes are correctly assigned, isolation of genic DNA for functional annotation can be carried out by PCR. However, not all genes are correctly assigned, and even when correctly assigned, gene products are often incorrectly folded when expressed in heterologous hosts. This is a problem that can sometimes be overcome by the expression of protein fragments encoding domains, rather than full-length proteins. One possible method to isolate DNA encoding such domains would to "filter" complex DNA (cDNA libraries, genomic and metagenomic DNA) for gene fragments that confer a selectable phenotype relying on correct folding, with all such domains present in a complex DNA sample, termed the “domainome”. RESULTS: In this paper we discuss the preparation of diverse genic ORF libraries from randomly fragmented genomic DNA using ß-lactamase to filter out the open reading frames. By cloning DNA fragments between leader sequences and the mature ß-lactamase gene, colonies can be selected for resistance to ampicillin, conferred by correct folding of the lactamase gene. Our experiments demonstrate that the majority of surviving colonies contain genic open reading frames, suggesting that ß-lactamase is acting as a selectable folding reporter. Furthermore, different leaders (Sec, TAT and SRP), normally translocating different protein classes, filter different genic fragment subsets, indicating that their use increases the fraction of the “domainone” that is accessible. CONCLUSIONS: The availability of ORF libraries, obtained with the filtering method described here, combined with screening methods such as phage display and protein-protein interaction studies, or with protein structure determination projects, can lead to the identification and structural determination of functional genic ORFs. ORF libraries represent, moreover, a useful tool to proceed towards high-throughput functional annotation of newly sequenced genomes.
format Online
Article
Text
id pubmed-3223728
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32237282011-11-26 Filtering "genic" open reading frames from genomic DNA samples for advanced annotation D'Angelo, Sara Velappan, Nileena Mignone, Flavio Santoro, Claudio Sblattero, Daniele Kiss, Csaba Bradbury, Andrew RM BMC Genomics Research BACKGROUND: In order to carry out experimental gene annotation, DNA encoding open reading frames (ORFs) derived from real genes (termed "genic") in the correct frame is required. When genes are correctly assigned, isolation of genic DNA for functional annotation can be carried out by PCR. However, not all genes are correctly assigned, and even when correctly assigned, gene products are often incorrectly folded when expressed in heterologous hosts. This is a problem that can sometimes be overcome by the expression of protein fragments encoding domains, rather than full-length proteins. One possible method to isolate DNA encoding such domains would to "filter" complex DNA (cDNA libraries, genomic and metagenomic DNA) for gene fragments that confer a selectable phenotype relying on correct folding, with all such domains present in a complex DNA sample, termed the “domainome”. RESULTS: In this paper we discuss the preparation of diverse genic ORF libraries from randomly fragmented genomic DNA using ß-lactamase to filter out the open reading frames. By cloning DNA fragments between leader sequences and the mature ß-lactamase gene, colonies can be selected for resistance to ampicillin, conferred by correct folding of the lactamase gene. Our experiments demonstrate that the majority of surviving colonies contain genic open reading frames, suggesting that ß-lactamase is acting as a selectable folding reporter. Furthermore, different leaders (Sec, TAT and SRP), normally translocating different protein classes, filter different genic fragment subsets, indicating that their use increases the fraction of the “domainone” that is accessible. CONCLUSIONS: The availability of ORF libraries, obtained with the filtering method described here, combined with screening methods such as phage display and protein-protein interaction studies, or with protein structure determination projects, can lead to the identification and structural determination of functional genic ORFs. ORF libraries represent, moreover, a useful tool to proceed towards high-throughput functional annotation of newly sequenced genomes. BioMed Central 2011-06-15 /pmc/articles/PMC3223728/ /pubmed/21810207 http://dx.doi.org/10.1186/1471-2164-12-S1-S5 Text en Copyright ©2011 D'Angelo et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
D'Angelo, Sara
Velappan, Nileena
Mignone, Flavio
Santoro, Claudio
Sblattero, Daniele
Kiss, Csaba
Bradbury, Andrew RM
Filtering "genic" open reading frames from genomic DNA samples for advanced annotation
title Filtering "genic" open reading frames from genomic DNA samples for advanced annotation
title_full Filtering "genic" open reading frames from genomic DNA samples for advanced annotation
title_fullStr Filtering "genic" open reading frames from genomic DNA samples for advanced annotation
title_full_unstemmed Filtering "genic" open reading frames from genomic DNA samples for advanced annotation
title_short Filtering "genic" open reading frames from genomic DNA samples for advanced annotation
title_sort filtering "genic" open reading frames from genomic dna samples for advanced annotation
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3223728/
https://www.ncbi.nlm.nih.gov/pubmed/21810207
http://dx.doi.org/10.1186/1471-2164-12-S1-S5
work_keys_str_mv AT dangelosara filteringgenicopenreadingframesfromgenomicdnasamplesforadvancedannotation
AT velappannileena filteringgenicopenreadingframesfromgenomicdnasamplesforadvancedannotation
AT mignoneflavio filteringgenicopenreadingframesfromgenomicdnasamplesforadvancedannotation
AT santoroclaudio filteringgenicopenreadingframesfromgenomicdnasamplesforadvancedannotation
AT sblatterodaniele filteringgenicopenreadingframesfromgenomicdnasamplesforadvancedannotation
AT kisscsaba filteringgenicopenreadingframesfromgenomicdnasamplesforadvancedannotation
AT bradburyandrewrm filteringgenicopenreadingframesfromgenomicdnasamplesforadvancedannotation