Cargando…

Using Genomic Sequencing for Classical Genetics in E. coli K12

We here develop computational methods to facilitate use of 454 whole genome shotgun sequencing to identify mutations in Escherichia coli K12. We had Roche sequence eight related strains derived as spontaneous mutants in a background without a whole genome sequence. They provided difference tables ba...

Descripción completa

Detalles Bibliográficos
Autores principales: Lyons, Eric, Freeling, Michael, Kustu, Sydney, Inwood, William
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3045373/
https://www.ncbi.nlm.nih.gov/pubmed/21364914
http://dx.doi.org/10.1371/journal.pone.0016717
_version_ 1782198823247937536
author Lyons, Eric
Freeling, Michael
Kustu, Sydney
Inwood, William
author_facet Lyons, Eric
Freeling, Michael
Kustu, Sydney
Inwood, William
author_sort Lyons, Eric
collection PubMed
description We here develop computational methods to facilitate use of 454 whole genome shotgun sequencing to identify mutations in Escherichia coli K12. We had Roche sequence eight related strains derived as spontaneous mutants in a background without a whole genome sequence. They provided difference tables based on assembling each genome to reference strain E. coli MG1655 (NC_000913). Due to the evolutionary distance to MG1655, these contained a large number of both false negatives and positives. By manual analysis of the dataset, we detected all the known mutations (24 at nine locations) and identified and genetically confirmed new mutations necessary and sufficient for the phenotypes we had selected in four strains. We then had Roche assemble contigs de novo, which we further assembled to full-length pseudomolecules based on synteny with MG1655. This hybrid method facilitated detection of insertion mutations and allowed annotation from MG1655. After removing one genome with less than the optimal 20- to 30-fold sequence coverage, we identified 544 putative polymorphisms that included all of the known and selected mutations apart from insertions. Finally, we detected seven new mutations in a total of only 41 candidates by comparing single genomes to composite data for the remaining six and using a ranking system to penalize homopolymer sequencing and misassembly errors. An additional benefit of the analysis is a table of differences between MG1655 and a physiologically robust E. coli wild-type strain NCM3722. Both projects were greatly facilitated by use of comparative genomics tools in the CoGe software package (http://genomevolution.org/).
format Text
id pubmed-3045373
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30453732011-03-01 Using Genomic Sequencing for Classical Genetics in E. coli K12 Lyons, Eric Freeling, Michael Kustu, Sydney Inwood, William PLoS One Research Article We here develop computational methods to facilitate use of 454 whole genome shotgun sequencing to identify mutations in Escherichia coli K12. We had Roche sequence eight related strains derived as spontaneous mutants in a background without a whole genome sequence. They provided difference tables based on assembling each genome to reference strain E. coli MG1655 (NC_000913). Due to the evolutionary distance to MG1655, these contained a large number of both false negatives and positives. By manual analysis of the dataset, we detected all the known mutations (24 at nine locations) and identified and genetically confirmed new mutations necessary and sufficient for the phenotypes we had selected in four strains. We then had Roche assemble contigs de novo, which we further assembled to full-length pseudomolecules based on synteny with MG1655. This hybrid method facilitated detection of insertion mutations and allowed annotation from MG1655. After removing one genome with less than the optimal 20- to 30-fold sequence coverage, we identified 544 putative polymorphisms that included all of the known and selected mutations apart from insertions. Finally, we detected seven new mutations in a total of only 41 candidates by comparing single genomes to composite data for the remaining six and using a ranking system to penalize homopolymer sequencing and misassembly errors. An additional benefit of the analysis is a table of differences between MG1655 and a physiologically robust E. coli wild-type strain NCM3722. Both projects were greatly facilitated by use of comparative genomics tools in the CoGe software package (http://genomevolution.org/). Public Library of Science 2011-02-25 /pmc/articles/PMC3045373/ /pubmed/21364914 http://dx.doi.org/10.1371/journal.pone.0016717 Text en Lyons et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lyons, Eric
Freeling, Michael
Kustu, Sydney
Inwood, William
Using Genomic Sequencing for Classical Genetics in E. coli K12
title Using Genomic Sequencing for Classical Genetics in E. coli K12
title_full Using Genomic Sequencing for Classical Genetics in E. coli K12
title_fullStr Using Genomic Sequencing for Classical Genetics in E. coli K12
title_full_unstemmed Using Genomic Sequencing for Classical Genetics in E. coli K12
title_short Using Genomic Sequencing for Classical Genetics in E. coli K12
title_sort using genomic sequencing for classical genetics in e. coli k12
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3045373/
https://www.ncbi.nlm.nih.gov/pubmed/21364914
http://dx.doi.org/10.1371/journal.pone.0016717
work_keys_str_mv AT lyonseric usinggenomicsequencingforclassicalgeneticsinecolik12
AT freelingmichael usinggenomicsequencingforclassicalgeneticsinecolik12
AT kustusydney usinggenomicsequencingforclassicalgeneticsinecolik12
AT inwoodwilliam usinggenomicsequencingforclassicalgeneticsinecolik12