Cargando…
ChromDL: A Next-Generation Regulatory DNA Classifier
MOTIVATION: Predicting the regulatory function of non-coding DNA using only the DNA sequence continues to be a major challenge in genomics. With the advent of improved optimization algorithms, faster GPU speeds, and more intricate machine learning libraries, hybrid convolutional and recurrent neural...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9928050/ https://www.ncbi.nlm.nih.gov/pubmed/36789431 http://dx.doi.org/10.1101/2023.01.27.525971 |
_version_ | 1784888573156130816 |
---|---|
author | Hill, Christopher Hudaiberdiev, Sanjarbek Ovcharenko, Ivan |
author_facet | Hill, Christopher Hudaiberdiev, Sanjarbek Ovcharenko, Ivan |
author_sort | Hill, Christopher |
collection | PubMed |
description | MOTIVATION: Predicting the regulatory function of non-coding DNA using only the DNA sequence continues to be a major challenge in genomics. With the advent of improved optimization algorithms, faster GPU speeds, and more intricate machine learning libraries, hybrid convolutional and recurrent neural network architectures can be constructed and applied to extract crucial information from non-coding DNA. RESULTS: Using a comparative analysis of the performance of thousands of Deep Learning (DL) architectures, we developed ChromDL, a neural network architecture combining bidirectional gated recurrent units (BiGRU), convolutional neural networks (CNNs), and bidirectional long short-term memory units (BiLSTM), which significantly improves upon a range of prediction metrics compared to its predecessors in transcription factor binding site (TFBS), histone modification (HM), and DNase-I hypersensitive site (DHS) detection. Combined with a secondary model, it can be utilized for accurate classification of gene regulatory elements. The model can also detect weak transcription factor (TF) binding with higher accuracy as compared to previously developed methods and has the potential to accurately delineate TF binding motif specificities. AVAILABILITY: The ChromDL source code can be found at https://github.com/chrishil1/ChromDL. |
format | Online Article Text |
id | pubmed-9928050 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-99280502023-02-15 ChromDL: A Next-Generation Regulatory DNA Classifier Hill, Christopher Hudaiberdiev, Sanjarbek Ovcharenko, Ivan bioRxiv Article MOTIVATION: Predicting the regulatory function of non-coding DNA using only the DNA sequence continues to be a major challenge in genomics. With the advent of improved optimization algorithms, faster GPU speeds, and more intricate machine learning libraries, hybrid convolutional and recurrent neural network architectures can be constructed and applied to extract crucial information from non-coding DNA. RESULTS: Using a comparative analysis of the performance of thousands of Deep Learning (DL) architectures, we developed ChromDL, a neural network architecture combining bidirectional gated recurrent units (BiGRU), convolutional neural networks (CNNs), and bidirectional long short-term memory units (BiLSTM), which significantly improves upon a range of prediction metrics compared to its predecessors in transcription factor binding site (TFBS), histone modification (HM), and DNase-I hypersensitive site (DHS) detection. Combined with a secondary model, it can be utilized for accurate classification of gene regulatory elements. The model can also detect weak transcription factor (TF) binding with higher accuracy as compared to previously developed methods and has the potential to accurately delineate TF binding motif specificities. AVAILABILITY: The ChromDL source code can be found at https://github.com/chrishil1/ChromDL. Cold Spring Harbor Laboratory 2023-01-29 /pmc/articles/PMC9928050/ /pubmed/36789431 http://dx.doi.org/10.1101/2023.01.27.525971 Text en https://creativecommons.org/licenses/by-nd/4.0/This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Hill, Christopher Hudaiberdiev, Sanjarbek Ovcharenko, Ivan ChromDL: A Next-Generation Regulatory DNA Classifier |
title | ChromDL: A Next-Generation Regulatory DNA Classifier |
title_full | ChromDL: A Next-Generation Regulatory DNA Classifier |
title_fullStr | ChromDL: A Next-Generation Regulatory DNA Classifier |
title_full_unstemmed | ChromDL: A Next-Generation Regulatory DNA Classifier |
title_short | ChromDL: A Next-Generation Regulatory DNA Classifier |
title_sort | chromdl: a next-generation regulatory dna classifier |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9928050/ https://www.ncbi.nlm.nih.gov/pubmed/36789431 http://dx.doi.org/10.1101/2023.01.27.525971 |
work_keys_str_mv | AT hillchristopher chromdlanextgenerationregulatorydnaclassifier AT hudaiberdievsanjarbek chromdlanextgenerationregulatorydnaclassifier AT ovcharenkoivan chromdlanextgenerationregulatorydnaclassifier |