Cargando…

QUFIND: tool for comparative prediction and mining of G4 quadruplexes overlapping with CpG islands

G-quadruplexes (G4s) are secondary structures in DNA that have been shown to be involved in gene regulation. They play a vital role in the cellular processes and several pathogens including bacteria, fungi, and viruses have also been shown to possess G4s that help them in their pathogenesis. Additio...

Descripción completa

Detalles Bibliográficos
Autores principales: Kaur, Baljeet, Sharma, Priya, Arora, Pooja, Sood, Vikas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10634401/
https://www.ncbi.nlm.nih.gov/pubmed/37953924
http://dx.doi.org/10.3389/fgene.2023.1265808
_version_ 1785132820800208896
author Kaur, Baljeet
Sharma, Priya
Arora, Pooja
Sood, Vikas
author_facet Kaur, Baljeet
Sharma, Priya
Arora, Pooja
Sood, Vikas
author_sort Kaur, Baljeet
collection PubMed
description G-quadruplexes (G4s) are secondary structures in DNA that have been shown to be involved in gene regulation. They play a vital role in the cellular processes and several pathogens including bacteria, fungi, and viruses have also been shown to possess G4s that help them in their pathogenesis. Additionally, cross-talk among the CpG islands and G4s has been shown to influence biological processes. The virus-encoded G4s are affected by the mutational landscape leading to the formation/deletion of these G4s. Therefore, understanding and predicting these multivariate effects on traditional and non-traditional quadruplexes forms an important area of research, that is, yet to be investigated. We have designed a user-friendly webserver QUFIND (http://soodlab.com/qufinder/) that can predict traditional as well as non-traditional quadruplexes in a given sequence. QUFIND is connected with ENSEMBL and NCBI so that the sequences can be fetched in a real-time manner. The algorithm is designed in such a way that the user is provided with multiple options to customize the base (A, T, G, or C), size of the stem (2–5), loop length (1–30), number of bulges (1–5) as well as the number of mismatches (0–2) enabling the identification of any of the secondary structure as per their interest. QUFIND is designed to predict both CpG islands as well as G4s in a given sequence. Since G4s are very short as compared to the CpG islands, hence, QUFIND can also predict the overlapping G4s within CpG islands. Therefore, the user has the flexibility to identify either overlapping or non-overlapping G4s along with the CpG islands. Additionally, one section of QUFIND is dedicated to comparing the G4s in two viral sequences. The visualization is designed in such a manner that the user is able to see the unique quadruplexes in both the input sequences. The efficiency of QUFIND is calculated on G4s obtained from G4 high throughput sequencing data (n = 1000) or experimentally validated G4s (n = 329). Our results revealed that QUFIND is able to predict G4-quadruplexes obtained from G4-sequencing data with 90.06% prediction accuracy whereas experimentally validated quadruplexes were predicted with 97.26% prediction accuracy.
format Online
Article
Text
id pubmed-10634401
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-106344012023-11-10 QUFIND: tool for comparative prediction and mining of G4 quadruplexes overlapping with CpG islands Kaur, Baljeet Sharma, Priya Arora, Pooja Sood, Vikas Front Genet Genetics G-quadruplexes (G4s) are secondary structures in DNA that have been shown to be involved in gene regulation. They play a vital role in the cellular processes and several pathogens including bacteria, fungi, and viruses have also been shown to possess G4s that help them in their pathogenesis. Additionally, cross-talk among the CpG islands and G4s has been shown to influence biological processes. The virus-encoded G4s are affected by the mutational landscape leading to the formation/deletion of these G4s. Therefore, understanding and predicting these multivariate effects on traditional and non-traditional quadruplexes forms an important area of research, that is, yet to be investigated. We have designed a user-friendly webserver QUFIND (http://soodlab.com/qufinder/) that can predict traditional as well as non-traditional quadruplexes in a given sequence. QUFIND is connected with ENSEMBL and NCBI so that the sequences can be fetched in a real-time manner. The algorithm is designed in such a way that the user is provided with multiple options to customize the base (A, T, G, or C), size of the stem (2–5), loop length (1–30), number of bulges (1–5) as well as the number of mismatches (0–2) enabling the identification of any of the secondary structure as per their interest. QUFIND is designed to predict both CpG islands as well as G4s in a given sequence. Since G4s are very short as compared to the CpG islands, hence, QUFIND can also predict the overlapping G4s within CpG islands. Therefore, the user has the flexibility to identify either overlapping or non-overlapping G4s along with the CpG islands. Additionally, one section of QUFIND is dedicated to comparing the G4s in two viral sequences. The visualization is designed in such a manner that the user is able to see the unique quadruplexes in both the input sequences. The efficiency of QUFIND is calculated on G4s obtained from G4 high throughput sequencing data (n = 1000) or experimentally validated G4s (n = 329). Our results revealed that QUFIND is able to predict G4-quadruplexes obtained from G4-sequencing data with 90.06% prediction accuracy whereas experimentally validated quadruplexes were predicted with 97.26% prediction accuracy. Frontiers Media S.A. 2023-10-25 /pmc/articles/PMC10634401/ /pubmed/37953924 http://dx.doi.org/10.3389/fgene.2023.1265808 Text en Copyright © 2023 Kaur, Sharma, Arora and Sood. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Kaur, Baljeet
Sharma, Priya
Arora, Pooja
Sood, Vikas
QUFIND: tool for comparative prediction and mining of G4 quadruplexes overlapping with CpG islands
title QUFIND: tool for comparative prediction and mining of G4 quadruplexes overlapping with CpG islands
title_full QUFIND: tool for comparative prediction and mining of G4 quadruplexes overlapping with CpG islands
title_fullStr QUFIND: tool for comparative prediction and mining of G4 quadruplexes overlapping with CpG islands
title_full_unstemmed QUFIND: tool for comparative prediction and mining of G4 quadruplexes overlapping with CpG islands
title_short QUFIND: tool for comparative prediction and mining of G4 quadruplexes overlapping with CpG islands
title_sort qufind: tool for comparative prediction and mining of g4 quadruplexes overlapping with cpg islands
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10634401/
https://www.ncbi.nlm.nih.gov/pubmed/37953924
http://dx.doi.org/10.3389/fgene.2023.1265808
work_keys_str_mv AT kaurbaljeet qufindtoolforcomparativepredictionandminingofg4quadruplexesoverlappingwithcpgislands
AT sharmapriya qufindtoolforcomparativepredictionandminingofg4quadruplexesoverlappingwithcpgislands
AT arorapooja qufindtoolforcomparativepredictionandminingofg4quadruplexesoverlappingwithcpgislands
AT soodvikas qufindtoolforcomparativepredictionandminingofg4quadruplexesoverlappingwithcpgislands