Cargando…

PENGUINN: Precise Exploration of Nuclear G-Quadruplexes Using Interpretable Neural Networks

G-quadruplexes (G4s) are a class of stable structural nucleic acid secondary structures that are known to play a role in a wide spectrum of genomic functions, such as DNA replication and transcription. The classical understanding of G4 structure points to four variable length guanine strands joined...

Descripción completa

Detalles Bibliográficos
Autores principales: Klimentova, Eva, Polacek, Jakub, Simecek, Petr, Alexiou, Panagiotis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7653191/
https://www.ncbi.nlm.nih.gov/pubmed/33193663
http://dx.doi.org/10.3389/fgene.2020.568546
_version_ 1783607853004619776
author Klimentova, Eva
Polacek, Jakub
Simecek, Petr
Alexiou, Panagiotis
author_facet Klimentova, Eva
Polacek, Jakub
Simecek, Petr
Alexiou, Panagiotis
author_sort Klimentova, Eva
collection PubMed
description G-quadruplexes (G4s) are a class of stable structural nucleic acid secondary structures that are known to play a role in a wide spectrum of genomic functions, such as DNA replication and transcription. The classical understanding of G4 structure points to four variable length guanine strands joined by variable length nucleotide stretches. Experiments using G4 immunoprecipitation and sequencing experiments have produced a high number of highly probable G4 forming genomic sequences. The expense and technical difficulty of experimental techniques highlights the need for computational approaches of G4 identification. Here, we present PENGUINN, a machine learning method based on Convolutional neural networks, that learns the characteristics of G4 sequences and accurately predicts G4s outperforming state-of-the-art methods. We provide both a standalone implementation of the trained model, and a web application that can be used to evaluate sequences for their G4 potential.
format Online
Article
Text
id pubmed-7653191
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-76531912020-11-13 PENGUINN: Precise Exploration of Nuclear G-Quadruplexes Using Interpretable Neural Networks Klimentova, Eva Polacek, Jakub Simecek, Petr Alexiou, Panagiotis Front Genet Genetics G-quadruplexes (G4s) are a class of stable structural nucleic acid secondary structures that are known to play a role in a wide spectrum of genomic functions, such as DNA replication and transcription. The classical understanding of G4 structure points to four variable length guanine strands joined by variable length nucleotide stretches. Experiments using G4 immunoprecipitation and sequencing experiments have produced a high number of highly probable G4 forming genomic sequences. The expense and technical difficulty of experimental techniques highlights the need for computational approaches of G4 identification. Here, we present PENGUINN, a machine learning method based on Convolutional neural networks, that learns the characteristics of G4 sequences and accurately predicts G4s outperforming state-of-the-art methods. We provide both a standalone implementation of the trained model, and a web application that can be used to evaluate sequences for their G4 potential. Frontiers Media S.A. 2020-10-27 /pmc/articles/PMC7653191/ /pubmed/33193663 http://dx.doi.org/10.3389/fgene.2020.568546 Text en Copyright © 2020 Klimentova, Polacek, Simecek and Alexiou. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Klimentova, Eva
Polacek, Jakub
Simecek, Petr
Alexiou, Panagiotis
PENGUINN: Precise Exploration of Nuclear G-Quadruplexes Using Interpretable Neural Networks
title PENGUINN: Precise Exploration of Nuclear G-Quadruplexes Using Interpretable Neural Networks
title_full PENGUINN: Precise Exploration of Nuclear G-Quadruplexes Using Interpretable Neural Networks
title_fullStr PENGUINN: Precise Exploration of Nuclear G-Quadruplexes Using Interpretable Neural Networks
title_full_unstemmed PENGUINN: Precise Exploration of Nuclear G-Quadruplexes Using Interpretable Neural Networks
title_short PENGUINN: Precise Exploration of Nuclear G-Quadruplexes Using Interpretable Neural Networks
title_sort penguinn: precise exploration of nuclear g-quadruplexes using interpretable neural networks
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7653191/
https://www.ncbi.nlm.nih.gov/pubmed/33193663
http://dx.doi.org/10.3389/fgene.2020.568546
work_keys_str_mv AT klimentovaeva penguinnpreciseexplorationofnucleargquadruplexesusinginterpretableneuralnetworks
AT polacekjakub penguinnpreciseexplorationofnucleargquadruplexesusinginterpretableneuralnetworks
AT simecekpetr penguinnpreciseexplorationofnucleargquadruplexesusinginterpretableneuralnetworks
AT alexioupanagiotis penguinnpreciseexplorationofnucleargquadruplexesusinginterpretableneuralnetworks