Cargando…

PaCRISPR: a server for predicting and visualizing anti-CRISPR proteins

Anti-CRISPRs are widespread amongst bacteriophage and promote bacteriophage infection by inactivating the bacterial host's CRISPR–Cas defence system. Identifying and characterizing anti-CRISPR proteins opens an avenue to explore and control CRISPR–Cas machineries for the development of new CRIS...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jiawei, Dai, Wei, Li, Jiahui, Xie, Ruopeng, Dunstan, Rhys A, Stubenrauch, Christopher, Zhang, Yanju, Lithgow, Trevor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7319593/
https://www.ncbi.nlm.nih.gov/pubmed/32459325
http://dx.doi.org/10.1093/nar/gkaa432
_version_ 1783551085879754752
author Wang, Jiawei
Dai, Wei
Li, Jiahui
Xie, Ruopeng
Dunstan, Rhys A
Stubenrauch, Christopher
Zhang, Yanju
Lithgow, Trevor
author_facet Wang, Jiawei
Dai, Wei
Li, Jiahui
Xie, Ruopeng
Dunstan, Rhys A
Stubenrauch, Christopher
Zhang, Yanju
Lithgow, Trevor
author_sort Wang, Jiawei
collection PubMed
description Anti-CRISPRs are widespread amongst bacteriophage and promote bacteriophage infection by inactivating the bacterial host's CRISPR–Cas defence system. Identifying and characterizing anti-CRISPR proteins opens an avenue to explore and control CRISPR–Cas machineries for the development of new CRISPR–Cas based biotechnological and therapeutic tools. Past studies have identified anti-CRISPRs in several model phage genomes, but a challenge exists to comprehensively screen for anti-CRISPRs accurately and efficiently from genome and metagenome sequence data. Here, we have developed an ensemble learning based predictor, PaCRISPR, to accurately identify anti-CRISPRs from protein datasets derived from genome and metagenome sequencing projects. PaCRISPR employs different types of feature recognition united within an ensemble framework. Extensive cross-validation and independent tests show that PaCRISPR achieves a significantly more accurate performance compared with homology-based baseline predictors and an existing toolkit. The performance of PaCRISPR was further validated in discovering anti-CRISPRs that were not part of the training for PaCRISPR, but which were recently demonstrated to function as anti-CRISPRs for phage infections. Data visualization on anti-CRISPR relationships, highlighting sequence similarity and phylogenetic considerations, is part of the output from the PaCRISPR toolkit, which is freely available at http://pacrispr.erc.monash.edu/.
format Online
Article
Text
id pubmed-7319593
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73195932020-07-01 PaCRISPR: a server for predicting and visualizing anti-CRISPR proteins Wang, Jiawei Dai, Wei Li, Jiahui Xie, Ruopeng Dunstan, Rhys A Stubenrauch, Christopher Zhang, Yanju Lithgow, Trevor Nucleic Acids Res Web Server Issue Anti-CRISPRs are widespread amongst bacteriophage and promote bacteriophage infection by inactivating the bacterial host's CRISPR–Cas defence system. Identifying and characterizing anti-CRISPR proteins opens an avenue to explore and control CRISPR–Cas machineries for the development of new CRISPR–Cas based biotechnological and therapeutic tools. Past studies have identified anti-CRISPRs in several model phage genomes, but a challenge exists to comprehensively screen for anti-CRISPRs accurately and efficiently from genome and metagenome sequence data. Here, we have developed an ensemble learning based predictor, PaCRISPR, to accurately identify anti-CRISPRs from protein datasets derived from genome and metagenome sequencing projects. PaCRISPR employs different types of feature recognition united within an ensemble framework. Extensive cross-validation and independent tests show that PaCRISPR achieves a significantly more accurate performance compared with homology-based baseline predictors and an existing toolkit. The performance of PaCRISPR was further validated in discovering anti-CRISPRs that were not part of the training for PaCRISPR, but which were recently demonstrated to function as anti-CRISPRs for phage infections. Data visualization on anti-CRISPR relationships, highlighting sequence similarity and phylogenetic considerations, is part of the output from the PaCRISPR toolkit, which is freely available at http://pacrispr.erc.monash.edu/. Oxford University Press 2020-07-02 2020-05-27 /pmc/articles/PMC7319593/ /pubmed/32459325 http://dx.doi.org/10.1093/nar/gkaa432 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Web Server Issue
Wang, Jiawei
Dai, Wei
Li, Jiahui
Xie, Ruopeng
Dunstan, Rhys A
Stubenrauch, Christopher
Zhang, Yanju
Lithgow, Trevor
PaCRISPR: a server for predicting and visualizing anti-CRISPR proteins
title PaCRISPR: a server for predicting and visualizing anti-CRISPR proteins
title_full PaCRISPR: a server for predicting and visualizing anti-CRISPR proteins
title_fullStr PaCRISPR: a server for predicting and visualizing anti-CRISPR proteins
title_full_unstemmed PaCRISPR: a server for predicting and visualizing anti-CRISPR proteins
title_short PaCRISPR: a server for predicting and visualizing anti-CRISPR proteins
title_sort pacrispr: a server for predicting and visualizing anti-crispr proteins
topic Web Server Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7319593/
https://www.ncbi.nlm.nih.gov/pubmed/32459325
http://dx.doi.org/10.1093/nar/gkaa432
work_keys_str_mv AT wangjiawei pacrispraserverforpredictingandvisualizinganticrisprproteins
AT daiwei pacrispraserverforpredictingandvisualizinganticrisprproteins
AT lijiahui pacrispraserverforpredictingandvisualizinganticrisprproteins
AT xieruopeng pacrispraserverforpredictingandvisualizinganticrisprproteins
AT dunstanrhysa pacrispraserverforpredictingandvisualizinganticrisprproteins
AT stubenrauchchristopher pacrispraserverforpredictingandvisualizinganticrisprproteins
AT zhangyanju pacrispraserverforpredictingandvisualizinganticrisprproteins
AT lithgowtrevor pacrispraserverforpredictingandvisualizinganticrisprproteins