Cargando…

A novel framework for engineering protein loops exploring length and compositional variation

Insertions and deletions (indels) are known to affect function, biophysical properties and substrate specificity of enzymes, and they play a central role in evolution. Despite such clear significance, this class of mutation remains an underexploited tool in protein engineering with few available pla...

Descripción completa

Detalles Bibliográficos
Autores principales: Tizei, Pedro A. G., Harris, Emma, Withanage, Shamal, Renders, Marleen, Pinheiro, Vitor B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8080606/
https://www.ncbi.nlm.nih.gov/pubmed/33911147
http://dx.doi.org/10.1038/s41598-021-88708-4
Descripción
Sumario:Insertions and deletions (indels) are known to affect function, biophysical properties and substrate specificity of enzymes, and they play a central role in evolution. Despite such clear significance, this class of mutation remains an underexploited tool in protein engineering with few available platforms capable of systematically generating and analysing libraries of varying sequence composition and length. We present a novel DNA assembly platform (InDel assembly), based on cycles of endonuclease restriction digestion and ligation of standardised dsDNA building blocks, that can generate libraries exploring both composition and sequence length variation. In addition, we developed a framework to analyse the output of selection from InDel-generated libraries, combining next generation sequencing and alignment-free strategies for sequence analysis. We demonstrate the approach by engineering the well-characterized TEM-1 β-lactamase Ω-loop, involved in substrate specificity, identifying multiple novel extended spectrum β-lactamases with loops of modified length and composition—areas of the sequence space not previously explored. Together, the InDel assembly and analysis platforms provide an efficient route to engineer protein loops or linkers where sequence length and composition are both essential functional parameters.