Cargando…

DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification

BACKGROUND: Autism spectrum disorders (ASD) are a group of neurodevelopmental disorders characterized by difficulty communicating with society and others, behavioral difficulties, and a brain that processes information differently than normal. Genetics has a strong impact on ASD associated with earl...

Descripción completa

Detalles Bibliográficos
Autores principales: Fan, Yongxian, Xiong, Hui, Sun, Guicong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10286466/
https://www.ncbi.nlm.nih.gov/pubmed/37349705
http://dx.doi.org/10.1186/s12859-023-05378-x
_version_ 1785061755912716288
author Fan, Yongxian
Xiong, Hui
Sun, Guicong
author_facet Fan, Yongxian
Xiong, Hui
Sun, Guicong
author_sort Fan, Yongxian
collection PubMed
description BACKGROUND: Autism spectrum disorders (ASD) are a group of neurodevelopmental disorders characterized by difficulty communicating with society and others, behavioral difficulties, and a brain that processes information differently than normal. Genetics has a strong impact on ASD associated with early onset and distinctive signs. Currently, all known ASD risk genes are able to encode proteins, and some de novo mutations disrupting protein-coding genes have been demonstrated to cause ASD. Next-generation sequencing technology enables high-throughput identification of ASD risk RNAs. However, these efforts are time-consuming and expensive, so an efficient computational model for ASD risk gene prediction is necessary. RESULTS: In this study, we propose DeepASDPerd, a predictor for ASD risk RNA based on deep learning. Firstly, we use K-mer to feature encode the RNA transcript sequences, and then fuse them with corresponding gene expression values to construct a feature matrix. After combining chi-square test and logistic regression to select the best feature subset, we input them into a binary classification prediction model constructed by convolutional neural network and long short-term memory for training and classification. The results of the tenfold cross-validation proved our method outperformed the state-of-the-art methods. Dataset and source code are available at https://github.com/Onebear-X/DeepASDPred is freely available. CONCLUSIONS: Our experimental results show that DeepASDPred has outstanding performance in identifying ASD risk RNA genes.
format Online
Article
Text
id pubmed-10286466
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-102864662023-06-23 DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification Fan, Yongxian Xiong, Hui Sun, Guicong BMC Bioinformatics Research BACKGROUND: Autism spectrum disorders (ASD) are a group of neurodevelopmental disorders characterized by difficulty communicating with society and others, behavioral difficulties, and a brain that processes information differently than normal. Genetics has a strong impact on ASD associated with early onset and distinctive signs. Currently, all known ASD risk genes are able to encode proteins, and some de novo mutations disrupting protein-coding genes have been demonstrated to cause ASD. Next-generation sequencing technology enables high-throughput identification of ASD risk RNAs. However, these efforts are time-consuming and expensive, so an efficient computational model for ASD risk gene prediction is necessary. RESULTS: In this study, we propose DeepASDPerd, a predictor for ASD risk RNA based on deep learning. Firstly, we use K-mer to feature encode the RNA transcript sequences, and then fuse them with corresponding gene expression values to construct a feature matrix. After combining chi-square test and logistic regression to select the best feature subset, we input them into a binary classification prediction model constructed by convolutional neural network and long short-term memory for training and classification. The results of the tenfold cross-validation proved our method outperformed the state-of-the-art methods. Dataset and source code are available at https://github.com/Onebear-X/DeepASDPred is freely available. CONCLUSIONS: Our experimental results show that DeepASDPred has outstanding performance in identifying ASD risk RNA genes. BioMed Central 2023-06-22 /pmc/articles/PMC10286466/ /pubmed/37349705 http://dx.doi.org/10.1186/s12859-023-05378-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Fan, Yongxian
Xiong, Hui
Sun, Guicong
DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification
title DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification
title_full DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification
title_fullStr DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification
title_full_unstemmed DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification
title_short DeepASDPred: a CNN-LSTM-based deep learning method for Autism spectrum disorders risk RNA identification
title_sort deepasdpred: a cnn-lstm-based deep learning method for autism spectrum disorders risk rna identification
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10286466/
https://www.ncbi.nlm.nih.gov/pubmed/37349705
http://dx.doi.org/10.1186/s12859-023-05378-x
work_keys_str_mv AT fanyongxian deepasdpredacnnlstmbaseddeeplearningmethodforautismspectrumdisordersriskrnaidentification
AT xionghui deepasdpredacnnlstmbaseddeeplearningmethodforautismspectrumdisordersriskrnaidentification
AT sunguicong deepasdpredacnnlstmbaseddeeplearningmethodforautismspectrumdisordersriskrnaidentification