Cargando…

De novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries

Interpreting epistatic interactions is crucial for understanding evolutionary dynamics of complex genetic systems and unveiling structure and function of genetic pathways. Although high resolution mapping of en masse variant libraries renders molecular biologists to address genotype-phenotype relati...

Descripción completa

Detalles Bibliográficos
Autores principales: Cho, Namjin, Hwang, Byungjin, Yoon, Jung-ki, Park, Sangun, Lee, Joongoo, Seo, Han Na, Lee, Jeewon, Huh, Sunghoon, Chung, Jinsoo, Bang, Duhee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Pub. Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4595759/
https://www.ncbi.nlm.nih.gov/pubmed/26387459
http://dx.doi.org/10.1038/ncomms9351
_version_ 1782393663134892032
author Cho, Namjin
Hwang, Byungjin
Yoon, Jung-ki
Park, Sangun
Lee, Joongoo
Seo, Han Na
Lee, Jeewon
Huh, Sunghoon
Chung, Jinsoo
Bang, Duhee
author_facet Cho, Namjin
Hwang, Byungjin
Yoon, Jung-ki
Park, Sangun
Lee, Joongoo
Seo, Han Na
Lee, Jeewon
Huh, Sunghoon
Chung, Jinsoo
Bang, Duhee
author_sort Cho, Namjin
collection PubMed
description Interpreting epistatic interactions is crucial for understanding evolutionary dynamics of complex genetic systems and unveiling structure and function of genetic pathways. Although high resolution mapping of en masse variant libraries renders molecular biologists to address genotype-phenotype relationships, long-read sequencing technology remains indispensable to assess functional relationship between mutations that lie far apart. Here, we introduce JigsawSeq for multiplexed sequence identification of pooled gene variant libraries by combining a codon-based molecular barcoding strategy and de novo assembly of short-read data. We first validate JigsawSeq on small sub-pools and observed high precision and recall at various experimental settings. With extensive simulations, we then apply JigsawSeq to large-scale gene variant libraries to show that our method can be reliably scaled using next-generation sequencing. JigsawSeq may serve as a rapid screening tool for functional genomics and offer the opportunity to explore evolutionary trajectories of protein variants.
format Online
Article
Text
id pubmed-4595759
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Pub. Group
record_format MEDLINE/PubMed
spelling pubmed-45957592015-10-21 De novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries Cho, Namjin Hwang, Byungjin Yoon, Jung-ki Park, Sangun Lee, Joongoo Seo, Han Na Lee, Jeewon Huh, Sunghoon Chung, Jinsoo Bang, Duhee Nat Commun Article Interpreting epistatic interactions is crucial for understanding evolutionary dynamics of complex genetic systems and unveiling structure and function of genetic pathways. Although high resolution mapping of en masse variant libraries renders molecular biologists to address genotype-phenotype relationships, long-read sequencing technology remains indispensable to assess functional relationship between mutations that lie far apart. Here, we introduce JigsawSeq for multiplexed sequence identification of pooled gene variant libraries by combining a codon-based molecular barcoding strategy and de novo assembly of short-read data. We first validate JigsawSeq on small sub-pools and observed high precision and recall at various experimental settings. With extensive simulations, we then apply JigsawSeq to large-scale gene variant libraries to show that our method can be reliably scaled using next-generation sequencing. JigsawSeq may serve as a rapid screening tool for functional genomics and offer the opportunity to explore evolutionary trajectories of protein variants. Nature Pub. Group 2015-09-21 /pmc/articles/PMC4595759/ /pubmed/26387459 http://dx.doi.org/10.1038/ncomms9351 Text en Copyright © 2015, Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved. http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Cho, Namjin
Hwang, Byungjin
Yoon, Jung-ki
Park, Sangun
Lee, Joongoo
Seo, Han Na
Lee, Jeewon
Huh, Sunghoon
Chung, Jinsoo
Bang, Duhee
De novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries
title De novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries
title_full De novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries
title_fullStr De novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries
title_full_unstemmed De novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries
title_short De novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries
title_sort de novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4595759/
https://www.ncbi.nlm.nih.gov/pubmed/26387459
http://dx.doi.org/10.1038/ncomms9351
work_keys_str_mv AT chonamjin denovoassemblyandnextgenerationsequencingtoanalysefulllengthgenevariantsfromcodonbarcodedlibraries
AT hwangbyungjin denovoassemblyandnextgenerationsequencingtoanalysefulllengthgenevariantsfromcodonbarcodedlibraries
AT yoonjungki denovoassemblyandnextgenerationsequencingtoanalysefulllengthgenevariantsfromcodonbarcodedlibraries
AT parksangun denovoassemblyandnextgenerationsequencingtoanalysefulllengthgenevariantsfromcodonbarcodedlibraries
AT leejoongoo denovoassemblyandnextgenerationsequencingtoanalysefulllengthgenevariantsfromcodonbarcodedlibraries
AT seohanna denovoassemblyandnextgenerationsequencingtoanalysefulllengthgenevariantsfromcodonbarcodedlibraries
AT leejeewon denovoassemblyandnextgenerationsequencingtoanalysefulllengthgenevariantsfromcodonbarcodedlibraries
AT huhsunghoon denovoassemblyandnextgenerationsequencingtoanalysefulllengthgenevariantsfromcodonbarcodedlibraries
AT chungjinsoo denovoassemblyandnextgenerationsequencingtoanalysefulllengthgenevariantsfromcodonbarcodedlibraries
AT bangduhee denovoassemblyandnextgenerationsequencingtoanalysefulllengthgenevariantsfromcodonbarcodedlibraries