Cargando…

Strong spurious transcription likely contributes to DNA insert bias in typical metagenomic clone libraries

BACKGROUND: Clone libraries provide researchers with a powerful resource to study nucleic acid from diverse sources. Metagenomic clone libraries in particular have aided in studies of microbial biodiversity and function, and allowed the mining of novel enzymes. Libraries are often constructed by clo...

Descripción completa

Detalles Bibliográficos
Autores principales: Lam, Kathy N., Charles, Trevor C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4459075/
https://www.ncbi.nlm.nih.gov/pubmed/26056565
http://dx.doi.org/10.1186/s40168-015-0086-5
_version_ 1782375165516054528
author Lam, Kathy N.
Charles, Trevor C.
author_facet Lam, Kathy N.
Charles, Trevor C.
author_sort Lam, Kathy N.
collection PubMed
description BACKGROUND: Clone libraries provide researchers with a powerful resource to study nucleic acid from diverse sources. Metagenomic clone libraries in particular have aided in studies of microbial biodiversity and function, and allowed the mining of novel enzymes. Libraries are often constructed by cloning large inserts into cosmid or fosmid vectors. Recently, there have been reports of GC bias in fosmid metagenomic libraries, and it was speculated to be a result of fragmentation and loss of AT-rich sequences during cloning. However, evidence in the literature suggests that transcriptional activity or gene product toxicity may play a role. RESULTS: To explore possible mechanisms responsible for sequence bias in clone libraries, we constructed a cosmid library from a human microbiome sample and sequenced DNA from different steps during library construction: crude extract DNA, size-selected DNA, and cosmid library DNA. We confirmed a GC bias in the final cosmid library, and we provide evidence that the bias is not due to fragmentation and loss of AT-rich sequences but is likely occurring after DNA is introduced into Escherichia coli. To investigate the influence of strong constitutive transcription, we searched the sequence data for promoters and found that rpoD/σ(70) promoter sequences were underrepresented in the cosmid library. Furthermore, when we examined the genomes of taxa that were differentially abundant in the cosmid library relative to the original sample, we found the bias to be more correlated with the number of rpoD/σ(70) consensus sequences in the genome than with simple GC content. CONCLUSIONS: The GC bias of metagenomic libraries does not appear to be due to DNA fragmentation. Rather, analysis of promoter sequences provides support for the hypothesis that strong constitutive transcription from sequences recognized as rpoD/σ(70) consensus-like in E. coli may lead to instability, causing loss of the plasmid or loss of the insert DNA that gives rise to the transcription. Despite widespread use of E. coli to propagate foreign DNA in metagenomic libraries, the effects of in vivo transcriptional activity on clone stability are not well understood. Further work is required to tease apart the effects of transcription from those of gene product toxicity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40168-015-0086-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4459075
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44590752015-06-09 Strong spurious transcription likely contributes to DNA insert bias in typical metagenomic clone libraries Lam, Kathy N. Charles, Trevor C. Microbiome Research BACKGROUND: Clone libraries provide researchers with a powerful resource to study nucleic acid from diverse sources. Metagenomic clone libraries in particular have aided in studies of microbial biodiversity and function, and allowed the mining of novel enzymes. Libraries are often constructed by cloning large inserts into cosmid or fosmid vectors. Recently, there have been reports of GC bias in fosmid metagenomic libraries, and it was speculated to be a result of fragmentation and loss of AT-rich sequences during cloning. However, evidence in the literature suggests that transcriptional activity or gene product toxicity may play a role. RESULTS: To explore possible mechanisms responsible for sequence bias in clone libraries, we constructed a cosmid library from a human microbiome sample and sequenced DNA from different steps during library construction: crude extract DNA, size-selected DNA, and cosmid library DNA. We confirmed a GC bias in the final cosmid library, and we provide evidence that the bias is not due to fragmentation and loss of AT-rich sequences but is likely occurring after DNA is introduced into Escherichia coli. To investigate the influence of strong constitutive transcription, we searched the sequence data for promoters and found that rpoD/σ(70) promoter sequences were underrepresented in the cosmid library. Furthermore, when we examined the genomes of taxa that were differentially abundant in the cosmid library relative to the original sample, we found the bias to be more correlated with the number of rpoD/σ(70) consensus sequences in the genome than with simple GC content. CONCLUSIONS: The GC bias of metagenomic libraries does not appear to be due to DNA fragmentation. Rather, analysis of promoter sequences provides support for the hypothesis that strong constitutive transcription from sequences recognized as rpoD/σ(70) consensus-like in E. coli may lead to instability, causing loss of the plasmid or loss of the insert DNA that gives rise to the transcription. Despite widespread use of E. coli to propagate foreign DNA in metagenomic libraries, the effects of in vivo transcriptional activity on clone stability are not well understood. Further work is required to tease apart the effects of transcription from those of gene product toxicity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40168-015-0086-5) contains supplementary material, which is available to authorized users. BioMed Central 2015-05-20 /pmc/articles/PMC4459075/ /pubmed/26056565 http://dx.doi.org/10.1186/s40168-015-0086-5 Text en © Lam and Charles. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Lam, Kathy N.
Charles, Trevor C.
Strong spurious transcription likely contributes to DNA insert bias in typical metagenomic clone libraries
title Strong spurious transcription likely contributes to DNA insert bias in typical metagenomic clone libraries
title_full Strong spurious transcription likely contributes to DNA insert bias in typical metagenomic clone libraries
title_fullStr Strong spurious transcription likely contributes to DNA insert bias in typical metagenomic clone libraries
title_full_unstemmed Strong spurious transcription likely contributes to DNA insert bias in typical metagenomic clone libraries
title_short Strong spurious transcription likely contributes to DNA insert bias in typical metagenomic clone libraries
title_sort strong spurious transcription likely contributes to dna insert bias in typical metagenomic clone libraries
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4459075/
https://www.ncbi.nlm.nih.gov/pubmed/26056565
http://dx.doi.org/10.1186/s40168-015-0086-5
work_keys_str_mv AT lamkathyn strongspurioustranscriptionlikelycontributestodnainsertbiasintypicalmetagenomicclonelibraries
AT charlestrevorc strongspurioustranscriptionlikelycontributestodnainsertbiasintypicalmetagenomicclonelibraries