Cargando…

Phenotype Sequencing: Identifying the Genes That Cause a Phenotype Directly from Pooled Sequencing of Independent Mutants

Random mutagenesis and phenotype screening provide a powerful method for dissecting microbial functions, but their results can be laborious to analyze experimentally. Each mutant strain may contain 50–100 random mutations, necessitating extensive functional experiments to determine which one causes...

Descripción completa

Detalles Bibliográficos
Autores principales: Harper, Marc A., Chen, Zugen, Toy, Traci, Machado, Iara M. P., Nelson, Stanley F., Liao, James C., Lee, Christopher J.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3041756/
https://www.ncbi.nlm.nih.gov/pubmed/21364744
http://dx.doi.org/10.1371/journal.pone.0016517
_version_ 1782198474205298688
author Harper, Marc A.
Chen, Zugen
Toy, Traci
Machado, Iara M. P.
Nelson, Stanley F.
Liao, James C.
Lee, Christopher J.
author_facet Harper, Marc A.
Chen, Zugen
Toy, Traci
Machado, Iara M. P.
Nelson, Stanley F.
Liao, James C.
Lee, Christopher J.
author_sort Harper, Marc A.
collection PubMed
description Random mutagenesis and phenotype screening provide a powerful method for dissecting microbial functions, but their results can be laborious to analyze experimentally. Each mutant strain may contain 50–100 random mutations, necessitating extensive functional experiments to determine which one causes the selected phenotype. To solve this problem, we propose a “Phenotype Sequencing” approach in which genes causing the phenotype can be identified directly from sequencing of multiple independent mutants. We developed a new computational analysis method showing that 1. causal genes can be identified with high probability from even a modest number of mutant genomes; 2. costs can be cut many-fold compared with a conventional genome sequencing approach via an optimized strategy of library-pooling (multiple strains per library) and tag-pooling (multiple tagged libraries per sequencing lane). We have performed extensive validation experiments on a set of E. coli mutants with increased isobutanol biofuel tolerance. We generated a range of sequencing experiments varying from 3 to 32 mutant strains, with pooling on 1 to 3 sequencing lanes. Our statistical analysis of these data (4099 mutations from 32 mutant genomes) successfully identified 3 genes (acrB, marC, acrA) that have been independently validated as causing this experimental phenotype. It must be emphasized that our approach reduces mutant sequencing costs enormously. Whereas a conventional genome sequencing experiment would have cost $7,200 in reagents alone, our Phenotype Sequencing design yielded the same information value for only $1200. In fact, our smallest experiments reliably identified acrB and marC at a cost of only $110–$340.
format Text
id pubmed-3041756
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30417562011-03-01 Phenotype Sequencing: Identifying the Genes That Cause a Phenotype Directly from Pooled Sequencing of Independent Mutants Harper, Marc A. Chen, Zugen Toy, Traci Machado, Iara M. P. Nelson, Stanley F. Liao, James C. Lee, Christopher J. PLoS One Research Article Random mutagenesis and phenotype screening provide a powerful method for dissecting microbial functions, but their results can be laborious to analyze experimentally. Each mutant strain may contain 50–100 random mutations, necessitating extensive functional experiments to determine which one causes the selected phenotype. To solve this problem, we propose a “Phenotype Sequencing” approach in which genes causing the phenotype can be identified directly from sequencing of multiple independent mutants. We developed a new computational analysis method showing that 1. causal genes can be identified with high probability from even a modest number of mutant genomes; 2. costs can be cut many-fold compared with a conventional genome sequencing approach via an optimized strategy of library-pooling (multiple strains per library) and tag-pooling (multiple tagged libraries per sequencing lane). We have performed extensive validation experiments on a set of E. coli mutants with increased isobutanol biofuel tolerance. We generated a range of sequencing experiments varying from 3 to 32 mutant strains, with pooling on 1 to 3 sequencing lanes. Our statistical analysis of these data (4099 mutations from 32 mutant genomes) successfully identified 3 genes (acrB, marC, acrA) that have been independently validated as causing this experimental phenotype. It must be emphasized that our approach reduces mutant sequencing costs enormously. Whereas a conventional genome sequencing experiment would have cost $7,200 in reagents alone, our Phenotype Sequencing design yielded the same information value for only $1200. In fact, our smallest experiments reliably identified acrB and marC at a cost of only $110–$340. Public Library of Science 2011-02-18 /pmc/articles/PMC3041756/ /pubmed/21364744 http://dx.doi.org/10.1371/journal.pone.0016517 Text en Harper et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Harper, Marc A.
Chen, Zugen
Toy, Traci
Machado, Iara M. P.
Nelson, Stanley F.
Liao, James C.
Lee, Christopher J.
Phenotype Sequencing: Identifying the Genes That Cause a Phenotype Directly from Pooled Sequencing of Independent Mutants
title Phenotype Sequencing: Identifying the Genes That Cause a Phenotype Directly from Pooled Sequencing of Independent Mutants
title_full Phenotype Sequencing: Identifying the Genes That Cause a Phenotype Directly from Pooled Sequencing of Independent Mutants
title_fullStr Phenotype Sequencing: Identifying the Genes That Cause a Phenotype Directly from Pooled Sequencing of Independent Mutants
title_full_unstemmed Phenotype Sequencing: Identifying the Genes That Cause a Phenotype Directly from Pooled Sequencing of Independent Mutants
title_short Phenotype Sequencing: Identifying the Genes That Cause a Phenotype Directly from Pooled Sequencing of Independent Mutants
title_sort phenotype sequencing: identifying the genes that cause a phenotype directly from pooled sequencing of independent mutants
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3041756/
https://www.ncbi.nlm.nih.gov/pubmed/21364744
http://dx.doi.org/10.1371/journal.pone.0016517
work_keys_str_mv AT harpermarca phenotypesequencingidentifyingthegenesthatcauseaphenotypedirectlyfrompooledsequencingofindependentmutants
AT chenzugen phenotypesequencingidentifyingthegenesthatcauseaphenotypedirectlyfrompooledsequencingofindependentmutants
AT toytraci phenotypesequencingidentifyingthegenesthatcauseaphenotypedirectlyfrompooledsequencingofindependentmutants
AT machadoiaramp phenotypesequencingidentifyingthegenesthatcauseaphenotypedirectlyfrompooledsequencingofindependentmutants
AT nelsonstanleyf phenotypesequencingidentifyingthegenesthatcauseaphenotypedirectlyfrompooledsequencingofindependentmutants
AT liaojamesc phenotypesequencingidentifyingthegenesthatcauseaphenotypedirectlyfrompooledsequencingofindependentmutants
AT leechristopherj phenotypesequencingidentifyingthegenesthatcauseaphenotypedirectlyfrompooledsequencingofindependentmutants