Cargando…

NCNet: Deep Learning Network Models for Predicting Function of Non-coding DNA

The human genome consists of 98.5% non-coding DNA sequences, and most of them have no known function. However, a majority of disease-associated variants lie in these regions. Therefore, it is critical to predict the function of non-coding DNA. Hence, we propose the NCNet, which integrates deep resid...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Hanyu, Hung, Che-Lun, Liu, Meiyuan, Hu, Xiaoye, Lin, Yi-Yang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6549219/
https://www.ncbi.nlm.nih.gov/pubmed/31191597
http://dx.doi.org/10.3389/fgene.2019.00432
_version_ 1783423962174193664
author Zhang, Hanyu
Hung, Che-Lun
Liu, Meiyuan
Hu, Xiaoye
Lin, Yi-Yang
author_facet Zhang, Hanyu
Hung, Che-Lun
Liu, Meiyuan
Hu, Xiaoye
Lin, Yi-Yang
author_sort Zhang, Hanyu
collection PubMed
description The human genome consists of 98.5% non-coding DNA sequences, and most of them have no known function. However, a majority of disease-associated variants lie in these regions. Therefore, it is critical to predict the function of non-coding DNA. Hence, we propose the NCNet, which integrates deep residual learning and sequence-to-sequence learning networks, to predict the transcription factor (TF) binding sites, which can then be used to predict non-coding functions. In NCNet, deep residual learning networks are used to enhance the identification rate of regulatory patterns of motifs, so that the sequence-to-sequence learning network may make the most out of the sequential dependency between the patterns. With the identity shortcut technique and deep architectures of the networks, NCNet achieves significant improvement compared to the original hybrid model in identifying regulatory markers.
format Online
Article
Text
id pubmed-6549219
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-65492192019-06-12 NCNet: Deep Learning Network Models for Predicting Function of Non-coding DNA Zhang, Hanyu Hung, Che-Lun Liu, Meiyuan Hu, Xiaoye Lin, Yi-Yang Front Genet Genetics The human genome consists of 98.5% non-coding DNA sequences, and most of them have no known function. However, a majority of disease-associated variants lie in these regions. Therefore, it is critical to predict the function of non-coding DNA. Hence, we propose the NCNet, which integrates deep residual learning and sequence-to-sequence learning networks, to predict the transcription factor (TF) binding sites, which can then be used to predict non-coding functions. In NCNet, deep residual learning networks are used to enhance the identification rate of regulatory patterns of motifs, so that the sequence-to-sequence learning network may make the most out of the sequential dependency between the patterns. With the identity shortcut technique and deep architectures of the networks, NCNet achieves significant improvement compared to the original hybrid model in identifying regulatory markers. Frontiers Media S.A. 2019-05-29 /pmc/articles/PMC6549219/ /pubmed/31191597 http://dx.doi.org/10.3389/fgene.2019.00432 Text en Copyright © 2019 Zhang, Hung, Liu, Hu and Lin. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Zhang, Hanyu
Hung, Che-Lun
Liu, Meiyuan
Hu, Xiaoye
Lin, Yi-Yang
NCNet: Deep Learning Network Models for Predicting Function of Non-coding DNA
title NCNet: Deep Learning Network Models for Predicting Function of Non-coding DNA
title_full NCNet: Deep Learning Network Models for Predicting Function of Non-coding DNA
title_fullStr NCNet: Deep Learning Network Models for Predicting Function of Non-coding DNA
title_full_unstemmed NCNet: Deep Learning Network Models for Predicting Function of Non-coding DNA
title_short NCNet: Deep Learning Network Models for Predicting Function of Non-coding DNA
title_sort ncnet: deep learning network models for predicting function of non-coding dna
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6549219/
https://www.ncbi.nlm.nih.gov/pubmed/31191597
http://dx.doi.org/10.3389/fgene.2019.00432
work_keys_str_mv AT zhanghanyu ncnetdeeplearningnetworkmodelsforpredictingfunctionofnoncodingdna
AT hungchelun ncnetdeeplearningnetworkmodelsforpredictingfunctionofnoncodingdna
AT liumeiyuan ncnetdeeplearningnetworkmodelsforpredictingfunctionofnoncodingdna
AT huxiaoye ncnetdeeplearningnetworkmodelsforpredictingfunctionofnoncodingdna
AT linyiyang ncnetdeeplearningnetworkmodelsforpredictingfunctionofnoncodingdna