Cargando…

TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes

Promoter annotation is an important task in the analysis of a genome. One of the main challenges for this task is locating the border between the promoter region and the transcribing region of the gene, the transcription start site (TSS). The TSS is the reference point to delimit the DNA sequence re...

Descripción completa

Detalles Bibliográficos
Autores principales: de Medeiros Oliveira, Mauro, Bonadio, Igor, Lie de Melo, Alicia, Mendes Souza, Glaucia, Durham, Alan Mitchell
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8574697/
https://www.ncbi.nlm.nih.gov/pubmed/34050351
http://dx.doi.org/10.1093/bib/bbab198
_version_ 1784595583198035968
author de Medeiros Oliveira, Mauro
Bonadio, Igor
Lie de Melo, Alicia
Mendes Souza, Glaucia
Durham, Alan Mitchell
author_facet de Medeiros Oliveira, Mauro
Bonadio, Igor
Lie de Melo, Alicia
Mendes Souza, Glaucia
Durham, Alan Mitchell
author_sort de Medeiros Oliveira, Mauro
collection PubMed
description Promoter annotation is an important task in the analysis of a genome. One of the main challenges for this task is locating the border between the promoter region and the transcribing region of the gene, the transcription start site (TSS). The TSS is the reference point to delimit the DNA sequence responsible for the assembly of the transcribing complex. As the same gene can have more than one TSS, so to delimit the promoter region, it is important to locate the closest TSS to the site of the beginning of the translation. This paper presents TSSFinder, a new software for the prediction of the TSS signal of eukaryotic genes that is significantly more accurate than other available software. We currently are the only application to offer pre-trained models for six different eukaryotic organisms: Arabidopsis thaliana, Drosophila melanogaster, Gallus gallus, Homo sapiens, Oryza sativa and Saccharomyces cerevisiae. Additionally, our software can be easily customized for specific organisms using only 125 DNA sequences with a validated TSS signal and corresponding genomic locations as a training set. TSSFinder is a valuable new tool for the annotation of genomes. TSSFinder source code and docker container can be downloaded from http://tssfinder.github.io. Alternatively, TSSFinder is also available as a web service at http://sucest-fun.org/wsapp/tssfinder/.
format Online
Article
Text
id pubmed-8574697
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-85746972021-11-09 TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes de Medeiros Oliveira, Mauro Bonadio, Igor Lie de Melo, Alicia Mendes Souza, Glaucia Durham, Alan Mitchell Brief Bioinform Problem Solving Protocol Promoter annotation is an important task in the analysis of a genome. One of the main challenges for this task is locating the border between the promoter region and the transcribing region of the gene, the transcription start site (TSS). The TSS is the reference point to delimit the DNA sequence responsible for the assembly of the transcribing complex. As the same gene can have more than one TSS, so to delimit the promoter region, it is important to locate the closest TSS to the site of the beginning of the translation. This paper presents TSSFinder, a new software for the prediction of the TSS signal of eukaryotic genes that is significantly more accurate than other available software. We currently are the only application to offer pre-trained models for six different eukaryotic organisms: Arabidopsis thaliana, Drosophila melanogaster, Gallus gallus, Homo sapiens, Oryza sativa and Saccharomyces cerevisiae. Additionally, our software can be easily customized for specific organisms using only 125 DNA sequences with a validated TSS signal and corresponding genomic locations as a training set. TSSFinder is a valuable new tool for the annotation of genomes. TSSFinder source code and docker container can be downloaded from http://tssfinder.github.io. Alternatively, TSSFinder is also available as a web service at http://sucest-fun.org/wsapp/tssfinder/. Oxford University Press 2021-05-31 /pmc/articles/PMC8574697/ /pubmed/34050351 http://dx.doi.org/10.1093/bib/bbab198 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Problem Solving Protocol
de Medeiros Oliveira, Mauro
Bonadio, Igor
Lie de Melo, Alicia
Mendes Souza, Glaucia
Durham, Alan Mitchell
TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
title TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
title_full TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
title_fullStr TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
title_full_unstemmed TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
title_short TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
title_sort tssfinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8574697/
https://www.ncbi.nlm.nih.gov/pubmed/34050351
http://dx.doi.org/10.1093/bib/bbab198
work_keys_str_mv AT demedeirosoliveiramauro tssfinderfastandaccurateabinitiopredictionofthecorepromoterineukaryoticgenomes
AT bonadioigor tssfinderfastandaccurateabinitiopredictionofthecorepromoterineukaryoticgenomes
AT liedemeloalicia tssfinderfastandaccurateabinitiopredictionofthecorepromoterineukaryoticgenomes
AT mendessouzaglaucia tssfinderfastandaccurateabinitiopredictionofthecorepromoterineukaryoticgenomes
AT durhamalanmitchell tssfinderfastandaccurateabinitiopredictionofthecorepromoterineukaryoticgenomes