Cargando…

CoSREM: a graph mining algorithm for the discovery of combinatorial splicing regulatory elements

BACKGROUND: Alternative splicing (AS) is a post-transcriptional regulatory mechanism for gene expression regulation. Splicing decisions are affected by the combinatorial behavior of different splicing factors that bind to multiple binding sites in exons and introns. These binding sites are called sp...

Descripción completa

Detalles Bibliográficos
Autores principales: Badr, Eman, Heath, Lenwood S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4559876/
https://www.ncbi.nlm.nih.gov/pubmed/26337677
http://dx.doi.org/10.1186/s12859-015-0698-6
_version_ 1782388847490891776
author Badr, Eman
Heath, Lenwood S.
author_facet Badr, Eman
Heath, Lenwood S.
author_sort Badr, Eman
collection PubMed
description BACKGROUND: Alternative splicing (AS) is a post-transcriptional regulatory mechanism for gene expression regulation. Splicing decisions are affected by the combinatorial behavior of different splicing factors that bind to multiple binding sites in exons and introns. These binding sites are called splicing regulatory elements (SREs). Here we develop CoSREM (Combinatorial SRE Miner), a graph mining algorithm to discover combinatorial SREs in human exons. Our model does not assume a fixed length of SREs and incorporates experimental evidence as well to increase accuracy. CoSREM is able to identify sets of SREs and is not limited to SRE pairs as are current approaches. RESULTS: We identified 37 SRE sets that include both enhancer and silencer elements. We show that our results intersect with previous results, including some that are experimental. We also show that the SRE set GGGAGG and GAGGAC identified by CoSREM may play a role in exon skipping events in several tumor samples. We applied CoSREM to RNA-Seq data for multiple tissues to identify combinatorial SREs which may be responsible for exon inclusion or exclusion across tissues. CONCLUSION: The new algorithm can identify different combinations of splicing enhancers and silencers without assuming a predefined size or limiting the algorithm to find only pairs of SREs. Our approach opens new directions to study SREs and the roles that AS may play in diseases and tissue specificity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0698-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4559876
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45598762015-09-05 CoSREM: a graph mining algorithm for the discovery of combinatorial splicing regulatory elements Badr, Eman Heath, Lenwood S. BMC Bioinformatics Research BACKGROUND: Alternative splicing (AS) is a post-transcriptional regulatory mechanism for gene expression regulation. Splicing decisions are affected by the combinatorial behavior of different splicing factors that bind to multiple binding sites in exons and introns. These binding sites are called splicing regulatory elements (SREs). Here we develop CoSREM (Combinatorial SRE Miner), a graph mining algorithm to discover combinatorial SREs in human exons. Our model does not assume a fixed length of SREs and incorporates experimental evidence as well to increase accuracy. CoSREM is able to identify sets of SREs and is not limited to SRE pairs as are current approaches. RESULTS: We identified 37 SRE sets that include both enhancer and silencer elements. We show that our results intersect with previous results, including some that are experimental. We also show that the SRE set GGGAGG and GAGGAC identified by CoSREM may play a role in exon skipping events in several tumor samples. We applied CoSREM to RNA-Seq data for multiple tissues to identify combinatorial SREs which may be responsible for exon inclusion or exclusion across tissues. CONCLUSION: The new algorithm can identify different combinations of splicing enhancers and silencers without assuming a predefined size or limiting the algorithm to find only pairs of SREs. Our approach opens new directions to study SREs and the roles that AS may play in diseases and tissue specificity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0698-6) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-04 /pmc/articles/PMC4559876/ /pubmed/26337677 http://dx.doi.org/10.1186/s12859-015-0698-6 Text en © Badr and Heath. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Badr, Eman
Heath, Lenwood S.
CoSREM: a graph mining algorithm for the discovery of combinatorial splicing regulatory elements
title CoSREM: a graph mining algorithm for the discovery of combinatorial splicing regulatory elements
title_full CoSREM: a graph mining algorithm for the discovery of combinatorial splicing regulatory elements
title_fullStr CoSREM: a graph mining algorithm for the discovery of combinatorial splicing regulatory elements
title_full_unstemmed CoSREM: a graph mining algorithm for the discovery of combinatorial splicing regulatory elements
title_short CoSREM: a graph mining algorithm for the discovery of combinatorial splicing regulatory elements
title_sort cosrem: a graph mining algorithm for the discovery of combinatorial splicing regulatory elements
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4559876/
https://www.ncbi.nlm.nih.gov/pubmed/26337677
http://dx.doi.org/10.1186/s12859-015-0698-6
work_keys_str_mv AT badreman cosremagraphminingalgorithmforthediscoveryofcombinatorialsplicingregulatoryelements
AT heathlenwoods cosremagraphminingalgorithmforthediscoveryofcombinatorialsplicingregulatoryelements