Cargando…

LSTM4piRNA: Efficient piRNA Detection in Large-Scale Genome Databases Using a Deep Learning-Based LSTM Network

Piwi-interacting RNAs (piRNAs) are a new class of small, non-coding RNAs, crucial in the regulation of gene expression. Recent research has revealed links between piRNAs, viral defense mechanisms, and certain human cancers. Due to their clinical potential, there is a great interest in identifying pi...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Chun-Chi, Chan, Yi-Ming, Jeong, Hyundoo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10649320/
https://www.ncbi.nlm.nih.gov/pubmed/37958663
http://dx.doi.org/10.3390/ijms242115681
_version_ 1785135539695910912
author Chen, Chun-Chi
Chan, Yi-Ming
Jeong, Hyundoo
author_facet Chen, Chun-Chi
Chan, Yi-Ming
Jeong, Hyundoo
author_sort Chen, Chun-Chi
collection PubMed
description Piwi-interacting RNAs (piRNAs) are a new class of small, non-coding RNAs, crucial in the regulation of gene expression. Recent research has revealed links between piRNAs, viral defense mechanisms, and certain human cancers. Due to their clinical potential, there is a great interest in identifying piRNAs from large genome databases through efficient computational methods. However, piRNAs lack conserved structure and sequence homology across species, which makes piRNA detection challenging. Current detection algorithms heavily rely on manually crafted features, which may overlook or improperly use certain features. Furthermore, there is a lack of suitable computational tools for analyzing large-scale databases and accurately identifying piRNAs. To address these issues, we propose LSTM4piRNA, a highly efficient deep learning-based method for predicting piRNAs in large-scale genome databases. LSTM4piRNA utilizes a compact LSTM network that can effectively analyze RNA sequences from extensive datasets to detect piRNAs. It can automatically learn the dependencies among RNA sequences, and regularization is further integrated to reduce the generalization error. Comprehensive performance evaluations based on piRNAs from the piRBase database demonstrate that LSTM4piRNA outperforms current advanced methods and is well-suited for analysis with large-scale databases.
format Online
Article
Text
id pubmed-10649320
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106493202023-10-27 LSTM4piRNA: Efficient piRNA Detection in Large-Scale Genome Databases Using a Deep Learning-Based LSTM Network Chen, Chun-Chi Chan, Yi-Ming Jeong, Hyundoo Int J Mol Sci Article Piwi-interacting RNAs (piRNAs) are a new class of small, non-coding RNAs, crucial in the regulation of gene expression. Recent research has revealed links between piRNAs, viral defense mechanisms, and certain human cancers. Due to their clinical potential, there is a great interest in identifying piRNAs from large genome databases through efficient computational methods. However, piRNAs lack conserved structure and sequence homology across species, which makes piRNA detection challenging. Current detection algorithms heavily rely on manually crafted features, which may overlook or improperly use certain features. Furthermore, there is a lack of suitable computational tools for analyzing large-scale databases and accurately identifying piRNAs. To address these issues, we propose LSTM4piRNA, a highly efficient deep learning-based method for predicting piRNAs in large-scale genome databases. LSTM4piRNA utilizes a compact LSTM network that can effectively analyze RNA sequences from extensive datasets to detect piRNAs. It can automatically learn the dependencies among RNA sequences, and regularization is further integrated to reduce the generalization error. Comprehensive performance evaluations based on piRNAs from the piRBase database demonstrate that LSTM4piRNA outperforms current advanced methods and is well-suited for analysis with large-scale databases. MDPI 2023-10-27 /pmc/articles/PMC10649320/ /pubmed/37958663 http://dx.doi.org/10.3390/ijms242115681 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chen, Chun-Chi
Chan, Yi-Ming
Jeong, Hyundoo
LSTM4piRNA: Efficient piRNA Detection in Large-Scale Genome Databases Using a Deep Learning-Based LSTM Network
title LSTM4piRNA: Efficient piRNA Detection in Large-Scale Genome Databases Using a Deep Learning-Based LSTM Network
title_full LSTM4piRNA: Efficient piRNA Detection in Large-Scale Genome Databases Using a Deep Learning-Based LSTM Network
title_fullStr LSTM4piRNA: Efficient piRNA Detection in Large-Scale Genome Databases Using a Deep Learning-Based LSTM Network
title_full_unstemmed LSTM4piRNA: Efficient piRNA Detection in Large-Scale Genome Databases Using a Deep Learning-Based LSTM Network
title_short LSTM4piRNA: Efficient piRNA Detection in Large-Scale Genome Databases Using a Deep Learning-Based LSTM Network
title_sort lstm4pirna: efficient pirna detection in large-scale genome databases using a deep learning-based lstm network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10649320/
https://www.ncbi.nlm.nih.gov/pubmed/37958663
http://dx.doi.org/10.3390/ijms242115681
work_keys_str_mv AT chenchunchi lstm4pirnaefficientpirnadetectioninlargescalegenomedatabasesusingadeeplearningbasedlstmnetwork
AT chanyiming lstm4pirnaefficientpirnadetectioninlargescalegenomedatabasesusingadeeplearningbasedlstmnetwork
AT jeonghyundoo lstm4pirnaefficientpirnadetectioninlargescalegenomedatabasesusingadeeplearningbasedlstmnetwork