Cargando…

ExactSearch: a web-based plant motif search tool

BACKGROUND: Plant biologists frequently need to examine if a sequence motif bound by a specific transcription or translation factor is present in the proximal promoters or 3′ untranslated regions (3′ UTR) of a set of plant genes of interest. To achieve such a task, plant biologists have to not only...

Descripción completa

Detalles Bibliográficos
Autores principales: Gunasekara, Chathura, Subramanian, Avinash, Avvari, Janaki Venkata Ram Kumar, Li, Bin, Chen, Su, Wei, Hairong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4850730/
https://www.ncbi.nlm.nih.gov/pubmed/27134638
http://dx.doi.org/10.1186/s13007-016-0126-6
_version_ 1782429706625220608
author Gunasekara, Chathura
Subramanian, Avinash
Avvari, Janaki Venkata Ram Kumar
Li, Bin
Chen, Su
Wei, Hairong
author_facet Gunasekara, Chathura
Subramanian, Avinash
Avvari, Janaki Venkata Ram Kumar
Li, Bin
Chen, Su
Wei, Hairong
author_sort Gunasekara, Chathura
collection PubMed
description BACKGROUND: Plant biologists frequently need to examine if a sequence motif bound by a specific transcription or translation factor is present in the proximal promoters or 3′ untranslated regions (3′ UTR) of a set of plant genes of interest. To achieve such a task, plant biologists have to not only identify an appropriate algorithm for motif searching, but also manipulate the large volume of sequence data, making it burdensome to carry out or fulfill. RESULT: In this study, we developed a web portal that enables plant molecular biologists to search for DNA motifs especially degenerate ones in custom sequences or the flanking regions of all genes in the 50 plant species whose genomes have been sequenced. A web tool like this is demanded to meet a variety of needs of plant biologists for identifying the potential gene regulatory relationships. We implemented a suffix tree algorithm to accelerate the searching process of a group of motifs in a multitude of target genes. The motifs to be searched can be in the degenerate bases in addition to adenine (A), cytosine (C), guanine (G), and thymine (T). The target sequences to be searched can be custom sequences or the selected proximal gene sequences from any one of the 50 sequenced plant species. The web portal also contains the functionality to facilitate the search of motifs that are represented by position probability matrix in above-mentioned species. Currently, the algorithm can accomplish an exhaust search of 100 motifs in 35,000 target sequences of 2 kb long in 4.2 min. However, the runtime may change in the future depending on the space availability, number of running jobs, network traffic, data loading, and output packing and delivery through electronic mailing. CONCLUSION: A web portal was developed to facilitate searching of motifs presents in custom sequences or the proximal promoters or 3′ UTR of 50 plant species with the sequenced genomes. This web tool is accessible by using this URL: http://sys.bio.mtu.edu/motif/index.php.
format Online
Article
Text
id pubmed-4850730
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48507302016-04-30 ExactSearch: a web-based plant motif search tool Gunasekara, Chathura Subramanian, Avinash Avvari, Janaki Venkata Ram Kumar Li, Bin Chen, Su Wei, Hairong Plant Methods Software BACKGROUND: Plant biologists frequently need to examine if a sequence motif bound by a specific transcription or translation factor is present in the proximal promoters or 3′ untranslated regions (3′ UTR) of a set of plant genes of interest. To achieve such a task, plant biologists have to not only identify an appropriate algorithm for motif searching, but also manipulate the large volume of sequence data, making it burdensome to carry out or fulfill. RESULT: In this study, we developed a web portal that enables plant molecular biologists to search for DNA motifs especially degenerate ones in custom sequences or the flanking regions of all genes in the 50 plant species whose genomes have been sequenced. A web tool like this is demanded to meet a variety of needs of plant biologists for identifying the potential gene regulatory relationships. We implemented a suffix tree algorithm to accelerate the searching process of a group of motifs in a multitude of target genes. The motifs to be searched can be in the degenerate bases in addition to adenine (A), cytosine (C), guanine (G), and thymine (T). The target sequences to be searched can be custom sequences or the selected proximal gene sequences from any one of the 50 sequenced plant species. The web portal also contains the functionality to facilitate the search of motifs that are represented by position probability matrix in above-mentioned species. Currently, the algorithm can accomplish an exhaust search of 100 motifs in 35,000 target sequences of 2 kb long in 4.2 min. However, the runtime may change in the future depending on the space availability, number of running jobs, network traffic, data loading, and output packing and delivery through electronic mailing. CONCLUSION: A web portal was developed to facilitate searching of motifs presents in custom sequences or the proximal promoters or 3′ UTR of 50 plant species with the sequenced genomes. This web tool is accessible by using this URL: http://sys.bio.mtu.edu/motif/index.php. BioMed Central 2016-04-28 /pmc/articles/PMC4850730/ /pubmed/27134638 http://dx.doi.org/10.1186/s13007-016-0126-6 Text en © Gunasekara et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Gunasekara, Chathura
Subramanian, Avinash
Avvari, Janaki Venkata Ram Kumar
Li, Bin
Chen, Su
Wei, Hairong
ExactSearch: a web-based plant motif search tool
title ExactSearch: a web-based plant motif search tool
title_full ExactSearch: a web-based plant motif search tool
title_fullStr ExactSearch: a web-based plant motif search tool
title_full_unstemmed ExactSearch: a web-based plant motif search tool
title_short ExactSearch: a web-based plant motif search tool
title_sort exactsearch: a web-based plant motif search tool
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4850730/
https://www.ncbi.nlm.nih.gov/pubmed/27134638
http://dx.doi.org/10.1186/s13007-016-0126-6
work_keys_str_mv AT gunasekarachathura exactsearchawebbasedplantmotifsearchtool
AT subramanianavinash exactsearchawebbasedplantmotifsearchtool
AT avvarijanakivenkataramkumar exactsearchawebbasedplantmotifsearchtool
AT libin exactsearchawebbasedplantmotifsearchtool
AT chensu exactsearchawebbasedplantmotifsearchtool
AT weihairong exactsearchawebbasedplantmotifsearchtool