Cargando…

Machine-learning approach expands the repertoire of anti-CRISPR protein families

The CRISPR-Cas are adaptive bacterial and archaeal immunity systems that have been harnessed for the development of powerful genome editing and engineering tools. In the incessant host-parasite arms race, viruses evolved multiple anti-defense mechanisms including diverse anti-CRISPR proteins (Acrs)...

Descripción completa

Detalles Bibliográficos
Autores principales: Gussow, Ayal B., Park, Allyson E., Borges, Adair L., Shmakov, Sergey A., Makarova, Kira S., Wolf, Yuri I., Bondy-Denomy, Joseph, Koonin, Eugene V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7391736/
https://www.ncbi.nlm.nih.gov/pubmed/32728052
http://dx.doi.org/10.1038/s41467-020-17652-0
_version_ 1783564712123826176
author Gussow, Ayal B.
Park, Allyson E.
Borges, Adair L.
Shmakov, Sergey A.
Makarova, Kira S.
Wolf, Yuri I.
Bondy-Denomy, Joseph
Koonin, Eugene V.
author_facet Gussow, Ayal B.
Park, Allyson E.
Borges, Adair L.
Shmakov, Sergey A.
Makarova, Kira S.
Wolf, Yuri I.
Bondy-Denomy, Joseph
Koonin, Eugene V.
author_sort Gussow, Ayal B.
collection PubMed
description The CRISPR-Cas are adaptive bacterial and archaeal immunity systems that have been harnessed for the development of powerful genome editing and engineering tools. In the incessant host-parasite arms race, viruses evolved multiple anti-defense mechanisms including diverse anti-CRISPR proteins (Acrs) that specifically inhibit CRISPR-Cas and therefore have enormous potential for application as modulators of genome editing tools. Most Acrs are small and highly variable proteins which makes their bioinformatic prediction a formidable task. We present a machine-learning approach for comprehensive Acr prediction. The model shows high predictive power when tested against an unseen test set and was employed to predict 2,500 candidate Acr families. Experimental validation of top candidates revealed two unknown Acrs (AcrIC9, IC10) and three other top candidates were coincidentally identified and found to possess anti-CRISPR activity. These results substantially expand the repertoire of predicted Acrs and provide a resource for experimental Acr discovery.
format Online
Article
Text
id pubmed-7391736
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-73917362020-08-12 Machine-learning approach expands the repertoire of anti-CRISPR protein families Gussow, Ayal B. Park, Allyson E. Borges, Adair L. Shmakov, Sergey A. Makarova, Kira S. Wolf, Yuri I. Bondy-Denomy, Joseph Koonin, Eugene V. Nat Commun Article The CRISPR-Cas are adaptive bacterial and archaeal immunity systems that have been harnessed for the development of powerful genome editing and engineering tools. In the incessant host-parasite arms race, viruses evolved multiple anti-defense mechanisms including diverse anti-CRISPR proteins (Acrs) that specifically inhibit CRISPR-Cas and therefore have enormous potential for application as modulators of genome editing tools. Most Acrs are small and highly variable proteins which makes their bioinformatic prediction a formidable task. We present a machine-learning approach for comprehensive Acr prediction. The model shows high predictive power when tested against an unseen test set and was employed to predict 2,500 candidate Acr families. Experimental validation of top candidates revealed two unknown Acrs (AcrIC9, IC10) and three other top candidates were coincidentally identified and found to possess anti-CRISPR activity. These results substantially expand the repertoire of predicted Acrs and provide a resource for experimental Acr discovery. Nature Publishing Group UK 2020-07-29 /pmc/articles/PMC7391736/ /pubmed/32728052 http://dx.doi.org/10.1038/s41467-020-17652-0 Text en © This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Gussow, Ayal B.
Park, Allyson E.
Borges, Adair L.
Shmakov, Sergey A.
Makarova, Kira S.
Wolf, Yuri I.
Bondy-Denomy, Joseph
Koonin, Eugene V.
Machine-learning approach expands the repertoire of anti-CRISPR protein families
title Machine-learning approach expands the repertoire of anti-CRISPR protein families
title_full Machine-learning approach expands the repertoire of anti-CRISPR protein families
title_fullStr Machine-learning approach expands the repertoire of anti-CRISPR protein families
title_full_unstemmed Machine-learning approach expands the repertoire of anti-CRISPR protein families
title_short Machine-learning approach expands the repertoire of anti-CRISPR protein families
title_sort machine-learning approach expands the repertoire of anti-crispr protein families
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7391736/
https://www.ncbi.nlm.nih.gov/pubmed/32728052
http://dx.doi.org/10.1038/s41467-020-17652-0
work_keys_str_mv AT gussowayalb machinelearningapproachexpandstherepertoireofanticrisprproteinfamilies
AT parkallysone machinelearningapproachexpandstherepertoireofanticrisprproteinfamilies
AT borgesadairl machinelearningapproachexpandstherepertoireofanticrisprproteinfamilies
AT shmakovsergeya machinelearningapproachexpandstherepertoireofanticrisprproteinfamilies
AT makarovakiras machinelearningapproachexpandstherepertoireofanticrisprproteinfamilies
AT wolfyurii machinelearningapproachexpandstherepertoireofanticrisprproteinfamilies
AT bondydenomyjoseph machinelearningapproachexpandstherepertoireofanticrisprproteinfamilies
AT koonineugenev machinelearningapproachexpandstherepertoireofanticrisprproteinfamilies