Cargando…
DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification
BACKGROUND: Autism spectrum disorders (ASD) are a group of neurodevelopmental disorders characterized by difficulty communicating with society and others, behavioral difficulties, and a brain that processes information differently than normal. Genetics has a strong impact on ASD associated with earl...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10286466/ https://www.ncbi.nlm.nih.gov/pubmed/37349705 http://dx.doi.org/10.1186/s12859-023-05378-x |
_version_ | 1785061755912716288 |
---|---|
author | Fan, Yongxian Xiong, Hui Sun, Guicong |
author_facet | Fan, Yongxian Xiong, Hui Sun, Guicong |
author_sort | Fan, Yongxian |
collection | PubMed |
description | BACKGROUND: Autism spectrum disorders (ASD) are a group of neurodevelopmental disorders characterized by difficulty communicating with society and others, behavioral difficulties, and a brain that processes information differently than normal. Genetics has a strong impact on ASD associated with early onset and distinctive signs. Currently, all known ASD risk genes are able to encode proteins, and some de novo mutations disrupting protein-coding genes have been demonstrated to cause ASD. Next-generation sequencing technology enables high-throughput identification of ASD risk RNAs. However, these efforts are time-consuming and expensive, so an efficient computational model for ASD risk gene prediction is necessary. RESULTS: In this study, we propose DeepASDPerd, a predictor for ASD risk RNA based on deep learning. Firstly, we use K-mer to feature encode the RNA transcript sequences, and then fuse them with corresponding gene expression values to construct a feature matrix. After combining chi-square test and logistic regression to select the best feature subset, we input them into a binary classification prediction model constructed by convolutional neural network and long short-term memory for training and classification. The results of the tenfold cross-validation proved our method outperformed the state-of-the-art methods. Dataset and source code are available at https://github.com/Onebear-X/DeepASDPred is freely available. CONCLUSIONS: Our experimental results show that DeepASDPred has outstanding performance in identifying ASD risk RNA genes. |
format | Online Article Text |
id | pubmed-10286466 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-102864662023-06-23 DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification Fan, Yongxian Xiong, Hui Sun, Guicong BMC Bioinformatics Research BACKGROUND: Autism spectrum disorders (ASD) are a group of neurodevelopmental disorders characterized by difficulty communicating with society and others, behavioral difficulties, and a brain that processes information differently than normal. Genetics has a strong impact on ASD associated with early onset and distinctive signs. Currently, all known ASD risk genes are able to encode proteins, and some de novo mutations disrupting protein-coding genes have been demonstrated to cause ASD. Next-generation sequencing technology enables high-throughput identification of ASD risk RNAs. However, these efforts are time-consuming and expensive, so an efficient computational model for ASD risk gene prediction is necessary. RESULTS: In this study, we propose DeepASDPerd, a predictor for ASD risk RNA based on deep learning. Firstly, we use K-mer to feature encode the RNA transcript sequences, and then fuse them with corresponding gene expression values to construct a feature matrix. After combining chi-square test and logistic regression to select the best feature subset, we input them into a binary classification prediction model constructed by convolutional neural network and long short-term memory for training and classification. The results of the tenfold cross-validation proved our method outperformed the state-of-the-art methods. Dataset and source code are available at https://github.com/Onebear-X/DeepASDPred is freely available. CONCLUSIONS: Our experimental results show that DeepASDPred has outstanding performance in identifying ASD risk RNA genes. BioMed Central 2023-06-22 /pmc/articles/PMC10286466/ /pubmed/37349705 http://dx.doi.org/10.1186/s12859-023-05378-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Fan, Yongxian Xiong, Hui Sun, Guicong DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification |
title | DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification |
title_full | DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification |
title_fullStr | DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification |
title_full_unstemmed | DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification |
title_short | DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification |
title_sort | deepasdpred: a cnn-lstm-based deep learning method for autism spectrum disorders risk rna identification |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10286466/ https://www.ncbi.nlm.nih.gov/pubmed/37349705 http://dx.doi.org/10.1186/s12859-023-05378-x |
work_keys_str_mv | AT fanyongxian deepasdpredacnnlstmbaseddeeplearningmethodforautismspectrumdisordersriskrnaidentification AT xionghui deepasdpredacnnlstmbaseddeeplearningmethodforautismspectrumdisordersriskrnaidentification AT sunguicong deepasdpredacnnlstmbaseddeeplearningmethodforautismspectrumdisordersriskrnaidentification |