Cargando…

MMGraph: a multiple motif predictor based on graph neural network and coexisting probability for ATAC-seq data

MOTIVATION: Transcription factor binding sites (TFBSs) prediction is a crucial step in revealing functions of transcription factors from high-throughput sequencing data. Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) provides insight on TFBSs and nucleosome positioning by pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Shuangquan, Yang, Lili, Wu, Xiaotian, Sheng, Nan, Fu, Yuan, Ma, Anjun, Wang, Yan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9524997/
https://www.ncbi.nlm.nih.gov/pubmed/35997564
http://dx.doi.org/10.1093/bioinformatics/btac572
_version_ 1784800612695670784
author Zhang, Shuangquan
Yang, Lili
Wu, Xiaotian
Sheng, Nan
Fu, Yuan
Ma, Anjun
Wang, Yan
author_facet Zhang, Shuangquan
Yang, Lili
Wu, Xiaotian
Sheng, Nan
Fu, Yuan
Ma, Anjun
Wang, Yan
author_sort Zhang, Shuangquan
collection PubMed
description MOTIVATION: Transcription factor binding sites (TFBSs) prediction is a crucial step in revealing functions of transcription factors from high-throughput sequencing data. Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) provides insight on TFBSs and nucleosome positioning by probing open chromatic, which can simultaneously reveal multiple TFBSs compare to traditional technologies. The existing tools based on convolutional neural network (CNN) only find the fixed length of TFBSs from ATAC-seq data. Graph neural network (GNN) can be considered as the extension of CNN, which has great potential in finding multiple TFBSs with different lengths from ATAC-seq data. RESULTS: We develop a motif predictor called MMGraph based on three-layer GNN and coexisting probability of k-mers for finding multiple motifs from ATAC-seq data. The results of the experiment which has been conducted on 88 ATAC-seq datasets indicate that MMGraph has achieved the best performance on area of eight metrics radar score of 2.31 and could find 207 higher-quality multiple motifs than other existing tools. AVAILABILITY AND IMPLEMENTATION: MMGraph is wrapped in Python package, which is available at https://github.com/zhangsq06/MMGraph.git SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9524997
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-95249972022-10-03 MMGraph: a multiple motif predictor based on graph neural network and coexisting probability for ATAC-seq data Zhang, Shuangquan Yang, Lili Wu, Xiaotian Sheng, Nan Fu, Yuan Ma, Anjun Wang, Yan Bioinformatics Applications Notes MOTIVATION: Transcription factor binding sites (TFBSs) prediction is a crucial step in revealing functions of transcription factors from high-throughput sequencing data. Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) provides insight on TFBSs and nucleosome positioning by probing open chromatic, which can simultaneously reveal multiple TFBSs compare to traditional technologies. The existing tools based on convolutional neural network (CNN) only find the fixed length of TFBSs from ATAC-seq data. Graph neural network (GNN) can be considered as the extension of CNN, which has great potential in finding multiple TFBSs with different lengths from ATAC-seq data. RESULTS: We develop a motif predictor called MMGraph based on three-layer GNN and coexisting probability of k-mers for finding multiple motifs from ATAC-seq data. The results of the experiment which has been conducted on 88 ATAC-seq datasets indicate that MMGraph has achieved the best performance on area of eight metrics radar score of 2.31 and could find 207 higher-quality multiple motifs than other existing tools. AVAILABILITY AND IMPLEMENTATION: MMGraph is wrapped in Python package, which is available at https://github.com/zhangsq06/MMGraph.git SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-08-23 /pmc/articles/PMC9524997/ /pubmed/35997564 http://dx.doi.org/10.1093/bioinformatics/btac572 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Applications Notes
Zhang, Shuangquan
Yang, Lili
Wu, Xiaotian
Sheng, Nan
Fu, Yuan
Ma, Anjun
Wang, Yan
MMGraph: a multiple motif predictor based on graph neural network and coexisting probability for ATAC-seq data
title MMGraph: a multiple motif predictor based on graph neural network and coexisting probability for ATAC-seq data
title_full MMGraph: a multiple motif predictor based on graph neural network and coexisting probability for ATAC-seq data
title_fullStr MMGraph: a multiple motif predictor based on graph neural network and coexisting probability for ATAC-seq data
title_full_unstemmed MMGraph: a multiple motif predictor based on graph neural network and coexisting probability for ATAC-seq data
title_short MMGraph: a multiple motif predictor based on graph neural network and coexisting probability for ATAC-seq data
title_sort mmgraph: a multiple motif predictor based on graph neural network and coexisting probability for atac-seq data
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9524997/
https://www.ncbi.nlm.nih.gov/pubmed/35997564
http://dx.doi.org/10.1093/bioinformatics/btac572
work_keys_str_mv AT zhangshuangquan mmgraphamultiplemotifpredictorbasedongraphneuralnetworkandcoexistingprobabilityforatacseqdata
AT yanglili mmgraphamultiplemotifpredictorbasedongraphneuralnetworkandcoexistingprobabilityforatacseqdata
AT wuxiaotian mmgraphamultiplemotifpredictorbasedongraphneuralnetworkandcoexistingprobabilityforatacseqdata
AT shengnan mmgraphamultiplemotifpredictorbasedongraphneuralnetworkandcoexistingprobabilityforatacseqdata
AT fuyuan mmgraphamultiplemotifpredictorbasedongraphneuralnetworkandcoexistingprobabilityforatacseqdata
AT maanjun mmgraphamultiplemotifpredictorbasedongraphneuralnetworkandcoexistingprobabilityforatacseqdata
AT wangyan mmgraphamultiplemotifpredictorbasedongraphneuralnetworkandcoexistingprobabilityforatacseqdata