Cargando…

KDeep: a new memory-efficient data extraction method for accurately predicting DNA/RNA transcription factor binding sites

This paper addresses the crucial task of identifying DNA/RNA binding sites, which has implications in drug/vaccine design, protein engineering, and cancer research. Existing methods utilize complex neural network structures, diverse input types, and machine learning techniques for feature extraction...

Descripción completa

Detalles Bibliográficos
Autores principales: Akbari Rokn Abadi, Saeedeh, Tabatabaei, SeyedehFatemeh, Koohi, Somayyeh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10580661/
https://www.ncbi.nlm.nih.gov/pubmed/37845681
http://dx.doi.org/10.1186/s12967-023-04593-7
Descripción
Sumario:This paper addresses the crucial task of identifying DNA/RNA binding sites, which has implications in drug/vaccine design, protein engineering, and cancer research. Existing methods utilize complex neural network structures, diverse input types, and machine learning techniques for feature extraction. However, the growing volume of sequences poses processing challenges. This study introduces KDeep, employing a CNN-LSTM architecture with a novel encoding method called 2Lk. 2Lk enhances prediction accuracy, reduces memory consumption by up to 84%, reduces trainable parameters, and improves interpretability by approximately 79% compared to state-of-the-art approaches. KDeep offers a promising solution for accurate and efficient binding site prediction. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12967-023-04593-7.