Cargando…

Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome

BACKGROUND: The existence of protospacer adjacent motifs (PAMs) sequences in bacteriophage genome is critical for the recognition and function of the clustered regularly interspaced short palindromic repeats-Cas (CRISPR-Cas) machinery system. We further elucidate the significance of PAMs and their f...

Descripción completa

Detalles Bibliográficos
Autores principales: Rawashdeh, Omar, Rawashdeh, Rabeah Y., Kebede, Temesgen, Kapp, David, Ralescu, Anca
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9161530/
https://www.ncbi.nlm.nih.gov/pubmed/35655130
http://dx.doi.org/10.1186/s12863-022-01056-8
_version_ 1784719504599678976
author Rawashdeh, Omar
Rawashdeh, Rabeah Y.
Kebede, Temesgen
Kapp, David
Ralescu, Anca
author_facet Rawashdeh, Omar
Rawashdeh, Rabeah Y.
Kebede, Temesgen
Kapp, David
Ralescu, Anca
author_sort Rawashdeh, Omar
collection PubMed
description BACKGROUND: The existence of protospacer adjacent motifs (PAMs) sequences in bacteriophage genome is critical for the recognition and function of the clustered regularly interspaced short palindromic repeats-Cas (CRISPR-Cas) machinery system. We further elucidate the significance of PAMs and their function, particularly as a part of transcriptional regulatory regions in T4 bacteriophages. METHODS: A scripting language was used to analyze a sequence of T4 phage genome, and a list of few selected PAMs. Mann-Whitney Wilcoxon (MWW) test was used to compare the sequence hits for the PAMs versus the hits of all the possible sequences of equal lengths. RESULTS: The results of MWW test show that certain PAMs such as: ‘NGG’ and ‘TATA’ are preferably located at the core of phage promoters: around -10 position, whereas the position around -35 appears to have no detectable count variation of any of the tested PAMs. Among all tested PAMs, the following three sequences: 5’-GCTV-3’, 5’-TTGAAT-3’ and 5’-TTGGGT-3’ have higher prevalence in essential genes. By analyzing all the possible ways of reading PAM sequences as codons for the corresponding amino acids, it was found that deduced amino acids of some PAMs have a significant tendency to prefer the surface of proteins. CONCLUSION: These results provide novel insights into the location and the subsequent identification of the role of PAMs as transcriptional regulatory elements. Also, CRISPR targeting certain PAM sequences is somehow likely to be connected to the hydrophilicity (water solubility) of amino acids translated from PAM’s triplets. Therefore, these amino acids are found at the interacting unit at protein-protein interfaces. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12863-022-01056-8.
format Online
Article
Text
id pubmed-9161530
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-91615302022-06-03 Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome Rawashdeh, Omar Rawashdeh, Rabeah Y. Kebede, Temesgen Kapp, David Ralescu, Anca BMC Genom Data Research BACKGROUND: The existence of protospacer adjacent motifs (PAMs) sequences in bacteriophage genome is critical for the recognition and function of the clustered regularly interspaced short palindromic repeats-Cas (CRISPR-Cas) machinery system. We further elucidate the significance of PAMs and their function, particularly as a part of transcriptional regulatory regions in T4 bacteriophages. METHODS: A scripting language was used to analyze a sequence of T4 phage genome, and a list of few selected PAMs. Mann-Whitney Wilcoxon (MWW) test was used to compare the sequence hits for the PAMs versus the hits of all the possible sequences of equal lengths. RESULTS: The results of MWW test show that certain PAMs such as: ‘NGG’ and ‘TATA’ are preferably located at the core of phage promoters: around -10 position, whereas the position around -35 appears to have no detectable count variation of any of the tested PAMs. Among all tested PAMs, the following three sequences: 5’-GCTV-3’, 5’-TTGAAT-3’ and 5’-TTGGGT-3’ have higher prevalence in essential genes. By analyzing all the possible ways of reading PAM sequences as codons for the corresponding amino acids, it was found that deduced amino acids of some PAMs have a significant tendency to prefer the surface of proteins. CONCLUSION: These results provide novel insights into the location and the subsequent identification of the role of PAMs as transcriptional regulatory elements. Also, CRISPR targeting certain PAM sequences is somehow likely to be connected to the hydrophilicity (water solubility) of amino acids translated from PAM’s triplets. Therefore, these amino acids are found at the interacting unit at protein-protein interfaces. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12863-022-01056-8. BioMed Central 2022-06-02 /pmc/articles/PMC9161530/ /pubmed/35655130 http://dx.doi.org/10.1186/s12863-022-01056-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Rawashdeh, Omar
Rawashdeh, Rabeah Y.
Kebede, Temesgen
Kapp, David
Ralescu, Anca
Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome
title Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome
title_full Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome
title_fullStr Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome
title_full_unstemmed Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome
title_short Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome
title_sort bio-informatic analysis of crispr protospacer adjacent motifs (pams) in t4 genome
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9161530/
https://www.ncbi.nlm.nih.gov/pubmed/35655130
http://dx.doi.org/10.1186/s12863-022-01056-8
work_keys_str_mv AT rawashdehomar bioinformaticanalysisofcrisprprotospaceradjacentmotifspamsint4genome
AT rawashdehrabeahy bioinformaticanalysisofcrisprprotospaceradjacentmotifspamsint4genome
AT kebedetemesgen bioinformaticanalysisofcrisprprotospaceradjacentmotifspamsint4genome
AT kappdavid bioinformaticanalysisofcrisprprotospaceradjacentmotifspamsint4genome
AT ralescuanca bioinformaticanalysisofcrisprprotospaceradjacentmotifspamsint4genome