Cargando…

Conserved and divergent signals in 5’ splice site sequences across fungi, metazoa and plants

In eukaryotic organisms the ensemble of 5’ splice site sequences reflects the balance between natural nucleotide variability and minimal molecular constraints necessary to ensure splicing fidelity. This compromise shapes the underlying statistical patterns in the composition of donor splice site seq...

Descripción completa

Detalles Bibliográficos
Autores principales: Beckel, Maximiliano S., Kaufman, Bruno, Yanovsky, Marcelo, Chernomoretz, Ariel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10599564/
https://www.ncbi.nlm.nih.gov/pubmed/37831726
http://dx.doi.org/10.1371/journal.pcbi.1011540
_version_ 1785125792057917440
author Beckel, Maximiliano S.
Kaufman, Bruno
Yanovsky, Marcelo
Chernomoretz, Ariel
author_facet Beckel, Maximiliano S.
Kaufman, Bruno
Yanovsky, Marcelo
Chernomoretz, Ariel
author_sort Beckel, Maximiliano S.
collection PubMed
description In eukaryotic organisms the ensemble of 5’ splice site sequences reflects the balance between natural nucleotide variability and minimal molecular constraints necessary to ensure splicing fidelity. This compromise shapes the underlying statistical patterns in the composition of donor splice site sequences. The scope of this study was to mine conserved and divergent signals in the composition of 5’ splice site sequences. Because 5’ donor sequences are a major cue for proper recognition of splice sites, we reasoned that statistical regularities in their composition could reflect the biological functionality and evolutionary history associated with splicing mechanisms. Results: We considered a regularized maximum entropy modeling framework to mine for non-trivial two-site correlations in donor sequence datasets corresponding to 30 different eukaryotes. For each analyzed species, we identified minimal sets of two-site coupling patterns that were able to replicate, at a given regularization level, the observed one-site and two-site frequencies in donor sequences. By performing a systematic and comparative analysis of 5’splice sites we showed that lineage information could be traced from joint di-nucleotide probabilities. We were able to identify characteristic two-site coupling patterns for plants and animals, and propose that they may echo differences in splicing regulation previously reported between these groups.
format Online
Article
Text
id pubmed-10599564
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-105995642023-10-26 Conserved and divergent signals in 5’ splice site sequences across fungi, metazoa and plants Beckel, Maximiliano S. Kaufman, Bruno Yanovsky, Marcelo Chernomoretz, Ariel PLoS Comput Biol Research Article In eukaryotic organisms the ensemble of 5’ splice site sequences reflects the balance between natural nucleotide variability and minimal molecular constraints necessary to ensure splicing fidelity. This compromise shapes the underlying statistical patterns in the composition of donor splice site sequences. The scope of this study was to mine conserved and divergent signals in the composition of 5’ splice site sequences. Because 5’ donor sequences are a major cue for proper recognition of splice sites, we reasoned that statistical regularities in their composition could reflect the biological functionality and evolutionary history associated with splicing mechanisms. Results: We considered a regularized maximum entropy modeling framework to mine for non-trivial two-site correlations in donor sequence datasets corresponding to 30 different eukaryotes. For each analyzed species, we identified minimal sets of two-site coupling patterns that were able to replicate, at a given regularization level, the observed one-site and two-site frequencies in donor sequences. By performing a systematic and comparative analysis of 5’splice sites we showed that lineage information could be traced from joint di-nucleotide probabilities. We were able to identify characteristic two-site coupling patterns for plants and animals, and propose that they may echo differences in splicing regulation previously reported between these groups. Public Library of Science 2023-10-13 /pmc/articles/PMC10599564/ /pubmed/37831726 http://dx.doi.org/10.1371/journal.pcbi.1011540 Text en © 2023 Beckel et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Beckel, Maximiliano S.
Kaufman, Bruno
Yanovsky, Marcelo
Chernomoretz, Ariel
Conserved and divergent signals in 5’ splice site sequences across fungi, metazoa and plants
title Conserved and divergent signals in 5’ splice site sequences across fungi, metazoa and plants
title_full Conserved and divergent signals in 5’ splice site sequences across fungi, metazoa and plants
title_fullStr Conserved and divergent signals in 5’ splice site sequences across fungi, metazoa and plants
title_full_unstemmed Conserved and divergent signals in 5’ splice site sequences across fungi, metazoa and plants
title_short Conserved and divergent signals in 5’ splice site sequences across fungi, metazoa and plants
title_sort conserved and divergent signals in 5’ splice site sequences across fungi, metazoa and plants
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10599564/
https://www.ncbi.nlm.nih.gov/pubmed/37831726
http://dx.doi.org/10.1371/journal.pcbi.1011540
work_keys_str_mv AT beckelmaximilianos conservedanddivergentsignalsin5splicesitesequencesacrossfungimetazoaandplants
AT kaufmanbruno conservedanddivergentsignalsin5splicesitesequencesacrossfungimetazoaandplants
AT yanovskymarcelo conservedanddivergentsignalsin5splicesitesequencesacrossfungimetazoaandplants
AT chernomoretzariel conservedanddivergentsignalsin5splicesitesequencesacrossfungimetazoaandplants