Cargando…

YAMDA: thousandfold speedup of EM-based motif discovery using deep learning libraries and GPU

MOTIVATION: Motif discovery in large biopolymer sequence datasets can be computationally demanding, presenting significant challenges for discovery in omics research. MEME, arguably one of the most popular motif discovery software, takes quadratic time with respect to dataset size, leading to excess...

Descripción completa

Detalles Bibliográficos
Autores principales:	Quang, Daniel, Guan, Yuanfang, Parker, Stephen C J
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2018
Materias:	Applications Notes
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6184538/ https://www.ncbi.nlm.nih.gov/pubmed/29790915 http://dx.doi.org/10.1093/bioinformatics/bty396

_version_	1783362730242080768
author	Quang, Daniel Guan, Yuanfang Parker, Stephen C J
author_facet	Quang, Daniel Guan, Yuanfang Parker, Stephen C J
author_sort	Quang, Daniel
collection	PubMed
description	MOTIVATION: Motif discovery in large biopolymer sequence datasets can be computationally demanding, presenting significant challenges for discovery in omics research. MEME, arguably one of the most popular motif discovery software, takes quadratic time with respect to dataset size, leading to excessively long runtimes for large datasets. Therefore, there is a demand for fast programs that can generate results of the same quality as MEME. RESULTS: Here we describe YAMDA, a highly scalable motif discovery software package. It is built on Pytorch, a tensor computation deep learning library with strong GPU acceleration that is highly optimized for tensor operations that are also useful for motifs. YAMDA takes linear time to find motifs as accurately as MEME, completing in seconds or minutes, which translates to speedups over a thousandfold. AVAILABILITY AND IMPLEMENTATION: YAMDA is freely available on Github (https://github.com/daquang/YAMDA). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format	Online Article Text
id	pubmed-6184538
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-61845382018-10-18 YAMDA: thousandfold speedup of EM-based motif discovery using deep learning libraries and GPU Quang, Daniel Guan, Yuanfang Parker, Stephen C J Bioinformatics Applications Notes MOTIVATION: Motif discovery in large biopolymer sequence datasets can be computationally demanding, presenting significant challenges for discovery in omics research. MEME, arguably one of the most popular motif discovery software, takes quadratic time with respect to dataset size, leading to excessively long runtimes for large datasets. Therefore, there is a demand for fast programs that can generate results of the same quality as MEME. RESULTS: Here we describe YAMDA, a highly scalable motif discovery software package. It is built on Pytorch, a tensor computation deep learning library with strong GPU acceleration that is highly optimized for tensor operations that are also useful for motifs. YAMDA takes linear time to find motifs as accurately as MEME, completing in seconds or minutes, which translates to speedups over a thousandfold. AVAILABILITY AND IMPLEMENTATION: YAMDA is freely available on Github (https://github.com/daquang/YAMDA). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-10-15 2018-05-22 /pmc/articles/PMC6184538/ /pubmed/29790915 http://dx.doi.org/10.1093/bioinformatics/bty396 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Applications Notes Quang, Daniel Guan, Yuanfang Parker, Stephen C J YAMDA: thousandfold speedup of EM-based motif discovery using deep learning libraries and GPU
title	YAMDA: thousandfold speedup of EM-based motif discovery using deep learning libraries and GPU
title_full	YAMDA: thousandfold speedup of EM-based motif discovery using deep learning libraries and GPU
title_fullStr	YAMDA: thousandfold speedup of EM-based motif discovery using deep learning libraries and GPU
title_full_unstemmed	YAMDA: thousandfold speedup of EM-based motif discovery using deep learning libraries and GPU
title_short	YAMDA: thousandfold speedup of EM-based motif discovery using deep learning libraries and GPU
title_sort	yamda: thousandfold speedup of em-based motif discovery using deep learning libraries and gpu
topic	Applications Notes
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6184538/ https://www.ncbi.nlm.nih.gov/pubmed/29790915 http://dx.doi.org/10.1093/bioinformatics/bty396
work_keys_str_mv	AT quangdaniel yamdathousandfoldspeedupofembasedmotifdiscoveryusingdeeplearninglibrariesandgpu AT guanyuanfang yamdathousandfoldspeedupofembasedmotifdiscoveryusingdeeplearninglibrariesandgpu AT parkerstephencj yamdathousandfoldspeedupofembasedmotifdiscoveryusingdeeplearninglibrariesandgpu

YAMDA: thousandfold speedup of EM-based motif discovery using deep learning libraries and GPU

Ejemplares similares