Cargando…
Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome
BACKGROUND: The existence of protospacer adjacent motifs (PAMs) sequences in bacteriophage genome is critical for the recognition and function of the clustered regularly interspaced short palindromic repeats-Cas (CRISPR-Cas) machinery system. We further elucidate the significance of PAMs and their f...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9161530/ https://www.ncbi.nlm.nih.gov/pubmed/35655130 http://dx.doi.org/10.1186/s12863-022-01056-8 |
_version_ | 1784719504599678976 |
---|---|
author | Rawashdeh, Omar Rawashdeh, Rabeah Y. Kebede, Temesgen Kapp, David Ralescu, Anca |
author_facet | Rawashdeh, Omar Rawashdeh, Rabeah Y. Kebede, Temesgen Kapp, David Ralescu, Anca |
author_sort | Rawashdeh, Omar |
collection | PubMed |
description | BACKGROUND: The existence of protospacer adjacent motifs (PAMs) sequences in bacteriophage genome is critical for the recognition and function of the clustered regularly interspaced short palindromic repeats-Cas (CRISPR-Cas) machinery system. We further elucidate the significance of PAMs and their function, particularly as a part of transcriptional regulatory regions in T4 bacteriophages. METHODS: A scripting language was used to analyze a sequence of T4 phage genome, and a list of few selected PAMs. Mann-Whitney Wilcoxon (MWW) test was used to compare the sequence hits for the PAMs versus the hits of all the possible sequences of equal lengths. RESULTS: The results of MWW test show that certain PAMs such as: ‘NGG’ and ‘TATA’ are preferably located at the core of phage promoters: around -10 position, whereas the position around -35 appears to have no detectable count variation of any of the tested PAMs. Among all tested PAMs, the following three sequences: 5’-GCTV-3’, 5’-TTGAAT-3’ and 5’-TTGGGT-3’ have higher prevalence in essential genes. By analyzing all the possible ways of reading PAM sequences as codons for the corresponding amino acids, it was found that deduced amino acids of some PAMs have a significant tendency to prefer the surface of proteins. CONCLUSION: These results provide novel insights into the location and the subsequent identification of the role of PAMs as transcriptional regulatory elements. Also, CRISPR targeting certain PAM sequences is somehow likely to be connected to the hydrophilicity (water solubility) of amino acids translated from PAM’s triplets. Therefore, these amino acids are found at the interacting unit at protein-protein interfaces. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12863-022-01056-8. |
format | Online Article Text |
id | pubmed-9161530 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-91615302022-06-03 Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome Rawashdeh, Omar Rawashdeh, Rabeah Y. Kebede, Temesgen Kapp, David Ralescu, Anca BMC Genom Data Research BACKGROUND: The existence of protospacer adjacent motifs (PAMs) sequences in bacteriophage genome is critical for the recognition and function of the clustered regularly interspaced short palindromic repeats-Cas (CRISPR-Cas) machinery system. We further elucidate the significance of PAMs and their function, particularly as a part of transcriptional regulatory regions in T4 bacteriophages. METHODS: A scripting language was used to analyze a sequence of T4 phage genome, and a list of few selected PAMs. Mann-Whitney Wilcoxon (MWW) test was used to compare the sequence hits for the PAMs versus the hits of all the possible sequences of equal lengths. RESULTS: The results of MWW test show that certain PAMs such as: ‘NGG’ and ‘TATA’ are preferably located at the core of phage promoters: around -10 position, whereas the position around -35 appears to have no detectable count variation of any of the tested PAMs. Among all tested PAMs, the following three sequences: 5’-GCTV-3’, 5’-TTGAAT-3’ and 5’-TTGGGT-3’ have higher prevalence in essential genes. By analyzing all the possible ways of reading PAM sequences as codons for the corresponding amino acids, it was found that deduced amino acids of some PAMs have a significant tendency to prefer the surface of proteins. CONCLUSION: These results provide novel insights into the location and the subsequent identification of the role of PAMs as transcriptional regulatory elements. Also, CRISPR targeting certain PAM sequences is somehow likely to be connected to the hydrophilicity (water solubility) of amino acids translated from PAM’s triplets. Therefore, these amino acids are found at the interacting unit at protein-protein interfaces. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12863-022-01056-8. BioMed Central 2022-06-02 /pmc/articles/PMC9161530/ /pubmed/35655130 http://dx.doi.org/10.1186/s12863-022-01056-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Rawashdeh, Omar Rawashdeh, Rabeah Y. Kebede, Temesgen Kapp, David Ralescu, Anca Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome |
title | Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome |
title_full | Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome |
title_fullStr | Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome |
title_full_unstemmed | Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome |
title_short | Bio-informatic analysis of CRISPR protospacer adjacent motifs (PAMs) in T4 genome |
title_sort | bio-informatic analysis of crispr protospacer adjacent motifs (pams) in t4 genome |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9161530/ https://www.ncbi.nlm.nih.gov/pubmed/35655130 http://dx.doi.org/10.1186/s12863-022-01056-8 |
work_keys_str_mv | AT rawashdehomar bioinformaticanalysisofcrisprprotospaceradjacentmotifspamsint4genome AT rawashdehrabeahy bioinformaticanalysisofcrisprprotospaceradjacentmotifspamsint4genome AT kebedetemesgen bioinformaticanalysisofcrisprprotospaceradjacentmotifspamsint4genome AT kappdavid bioinformaticanalysisofcrisprprotospaceradjacentmotifspamsint4genome AT ralescuanca bioinformaticanalysisofcrisprprotospaceradjacentmotifspamsint4genome |