Cargando…

Automated identification of sequence-tailored Cas9 proteins using massive metagenomic data

The identification of the protospacer adjacent motif (PAM) sequences of Cas9 nucleases is crucial for their exploitation in genome editing. Here we develop a computational pipeline that was used to interrogate a massively expanded dataset of metagenome and virome assemblies for accurate and comprehe...

Descripción completa

Detalles Bibliográficos
Autores principales: Ciciani, Matteo, Demozzi, Michele, Pedrazzoli, Eleonora, Visentin, Elisabetta, Pezzè, Laura, Signorini, Lorenzo Federico, Blanco-Miguez, Aitor, Zolfo, Moreno, Asnicar, Francesco, Casini, Antonio, Cereseto, Anna, Segata, Nicola
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9617884/
https://www.ncbi.nlm.nih.gov/pubmed/36309502
http://dx.doi.org/10.1038/s41467-022-34213-9
Descripción
Sumario:The identification of the protospacer adjacent motif (PAM) sequences of Cas9 nucleases is crucial for their exploitation in genome editing. Here we develop a computational pipeline that was used to interrogate a massively expanded dataset of metagenome and virome assemblies for accurate and comprehensive PAM predictions. This procedure allows the identification and isolation of sequence-tailored Cas9 nucleases by using the target sequence as bait. As proof of concept, starting from the disease-causing mutation P23H in the RHO gene, we find, isolate and experimentally validate a Cas9 which uses the mutated sequence as PAM. Our PAM prediction pipeline will be instrumental to generate a Cas9 nuclease repertoire responding to any PAM requirement.