Cargando…

SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes

BACKGROUND: Recently developed genome resources in Salmonid fish provides tools for studying the genomics underlying a wide range of properties including life history trait variation in the wild, economically important traits in aquaculture and the evolutionary consequences of whole genome duplicati...

Descripción completa

Detalles Bibliográficos
Autores principales: Mulugeta, Teshome Dagne, Nome, Torfinn, To, Thu-Hien, Gundappa, Manu Kumar, Macqueen, Daniel J., Våge, Dag Inge, Sandve, Simen Rød, Hvidsten, Torgeir R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6720087/
https://www.ncbi.nlm.nih.gov/pubmed/31477007
http://dx.doi.org/10.1186/s12864-019-6051-0
_version_ 1783448049248370688
author Mulugeta, Teshome Dagne
Nome, Torfinn
To, Thu-Hien
Gundappa, Manu Kumar
Macqueen, Daniel J.
Våge, Dag Inge
Sandve, Simen Rød
Hvidsten, Torgeir R.
author_facet Mulugeta, Teshome Dagne
Nome, Torfinn
To, Thu-Hien
Gundappa, Manu Kumar
Macqueen, Daniel J.
Våge, Dag Inge
Sandve, Simen Rød
Hvidsten, Torgeir R.
author_sort Mulugeta, Teshome Dagne
collection PubMed
description BACKGROUND: Recently developed genome resources in Salmonid fish provides tools for studying the genomics underlying a wide range of properties including life history trait variation in the wild, economically important traits in aquaculture and the evolutionary consequences of whole genome duplications. Although genome assemblies now exist for a number of salmonid species, the lack of regulatory annotations are holding back our mechanistic understanding of how genetic variation in non-coding regulatory regions affect gene expression and the downstream phenotypic effects. RESULTS: We present SalMotifDB, a database and associated web and R interface for the analysis of transcription factors (TFs) and their cis-regulatory binding sites in five salmonid genomes. SalMotifDB integrates TF-binding site information for 3072 non-redundant DNA patterns (motifs) assembled from a large number of metazoan motif databases. Through motif matching and TF prediction, we have used these multi-species databases to construct putative regulatory networks in salmonid species. The utility of SalMotifDB is demonstrated by showing that key lipid metabolism regulators are predicted to regulate a set of genes affected by different lipid and fatty acid content in the feed, and by showing that our motif database explains a significant proportion of gene expression divergence in gene duplicates originating from the salmonid specific whole genome duplication. CONCLUSIONS: SalMotifDB is an effective tool for analyzing transcription factors, their binding sites and the resulting gene regulatory networks in salmonid species, and will be an important tool for gaining a better mechanistic understanding of gene regulation and the associated phenotypes in salmonids. SalMotifDB is available at https://salmobase.org/apps/SalMotifDB. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-6051-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6720087
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-67200872019-09-06 SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes Mulugeta, Teshome Dagne Nome, Torfinn To, Thu-Hien Gundappa, Manu Kumar Macqueen, Daniel J. Våge, Dag Inge Sandve, Simen Rød Hvidsten, Torgeir R. BMC Genomics Database BACKGROUND: Recently developed genome resources in Salmonid fish provides tools for studying the genomics underlying a wide range of properties including life history trait variation in the wild, economically important traits in aquaculture and the evolutionary consequences of whole genome duplications. Although genome assemblies now exist for a number of salmonid species, the lack of regulatory annotations are holding back our mechanistic understanding of how genetic variation in non-coding regulatory regions affect gene expression and the downstream phenotypic effects. RESULTS: We present SalMotifDB, a database and associated web and R interface for the analysis of transcription factors (TFs) and their cis-regulatory binding sites in five salmonid genomes. SalMotifDB integrates TF-binding site information for 3072 non-redundant DNA patterns (motifs) assembled from a large number of metazoan motif databases. Through motif matching and TF prediction, we have used these multi-species databases to construct putative regulatory networks in salmonid species. The utility of SalMotifDB is demonstrated by showing that key lipid metabolism regulators are predicted to regulate a set of genes affected by different lipid and fatty acid content in the feed, and by showing that our motif database explains a significant proportion of gene expression divergence in gene duplicates originating from the salmonid specific whole genome duplication. CONCLUSIONS: SalMotifDB is an effective tool for analyzing transcription factors, their binding sites and the resulting gene regulatory networks in salmonid species, and will be an important tool for gaining a better mechanistic understanding of gene regulation and the associated phenotypes in salmonids. SalMotifDB is available at https://salmobase.org/apps/SalMotifDB. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-6051-0) contains supplementary material, which is available to authorized users. BioMed Central 2019-09-02 /pmc/articles/PMC6720087/ /pubmed/31477007 http://dx.doi.org/10.1186/s12864-019-6051-0 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Database
Mulugeta, Teshome Dagne
Nome, Torfinn
To, Thu-Hien
Gundappa, Manu Kumar
Macqueen, Daniel J.
Våge, Dag Inge
Sandve, Simen Rød
Hvidsten, Torgeir R.
SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes
title SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes
title_full SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes
title_fullStr SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes
title_full_unstemmed SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes
title_short SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes
title_sort salmotifdb: a tool for analyzing putative transcription factor binding sites in salmonid genomes
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6720087/
https://www.ncbi.nlm.nih.gov/pubmed/31477007
http://dx.doi.org/10.1186/s12864-019-6051-0
work_keys_str_mv AT mulugetateshomedagne salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
AT nometorfinn salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
AT tothuhien salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
AT gundappamanukumar salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
AT macqueendanielj salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
AT vagedaginge salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
AT sandvesimenrød salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
AT hvidstentorgeirr salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes