Cargando…

Comprehensive splice-site analysis using comparative genomics

We have collected over half a million splice sites from five species—Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans and Arabidopsis thaliana—and classified them into four subtypes: U2-type GT–AG and GC–AG and U12-type GT–AG and AT–AC. We have also found new examples of r...

Descripción completa

Detalles Bibliográficos
Autores principales: Sheth, Nihar, Roca, Xavier, Hastings, Michelle L., Roeder, Ted, Krainer, Adrian R., Sachidanandam, Ravi
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1557818/
https://www.ncbi.nlm.nih.gov/pubmed/16914448
http://dx.doi.org/10.1093/nar/gkl556
_version_ 1782129415456555008
author Sheth, Nihar
Roca, Xavier
Hastings, Michelle L.
Roeder, Ted
Krainer, Adrian R.
Sachidanandam, Ravi
author_facet Sheth, Nihar
Roca, Xavier
Hastings, Michelle L.
Roeder, Ted
Krainer, Adrian R.
Sachidanandam, Ravi
author_sort Sheth, Nihar
collection PubMed
description We have collected over half a million splice sites from five species—Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans and Arabidopsis thaliana—and classified them into four subtypes: U2-type GT–AG and GC–AG and U12-type GT–AG and AT–AC. We have also found new examples of rare splice-site categories, such as U12-type introns without canonical borders, and U2-dependent AT–AC introns. The splice-site sequences and several tools to explore them are available on a public website (SpliceRack). For the U12-type introns, we find several features conserved across species, as well as a clustering of these introns on genes. Using the information content of the splice-site motifs, and the phylogenetic distance between them, we identify: (i) a higher degree of conservation in the exonic portion of the U2-type splice sites in more complex organisms; (ii) conservation of exonic nucleotides for U12-type splice sites; (iii) divergent evolution of C.elegans 3′ splice sites (3′ss) and (iv) distinct evolutionary histories of 5′ and 3′ss. Our study proves that the identification of broad patterns in naturally-occurring splice sites, through the analysis of genomic datasets, provides mechanistic and evolutionary insights into pre-mRNA splicing.
format Text
id pubmed-1557818
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-15578182006-09-06 Comprehensive splice-site analysis using comparative genomics Sheth, Nihar Roca, Xavier Hastings, Michelle L. Roeder, Ted Krainer, Adrian R. Sachidanandam, Ravi Nucleic Acids Res Genomics We have collected over half a million splice sites from five species—Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans and Arabidopsis thaliana—and classified them into four subtypes: U2-type GT–AG and GC–AG and U12-type GT–AG and AT–AC. We have also found new examples of rare splice-site categories, such as U12-type introns without canonical borders, and U2-dependent AT–AC introns. The splice-site sequences and several tools to explore them are available on a public website (SpliceRack). For the U12-type introns, we find several features conserved across species, as well as a clustering of these introns on genes. Using the information content of the splice-site motifs, and the phylogenetic distance between them, we identify: (i) a higher degree of conservation in the exonic portion of the U2-type splice sites in more complex organisms; (ii) conservation of exonic nucleotides for U12-type splice sites; (iii) divergent evolution of C.elegans 3′ splice sites (3′ss) and (iv) distinct evolutionary histories of 5′ and 3′ss. Our study proves that the identification of broad patterns in naturally-occurring splice sites, through the analysis of genomic datasets, provides mechanistic and evolutionary insights into pre-mRNA splicing. Oxford University Press 2006 2006-08-12 /pmc/articles/PMC1557818/ /pubmed/16914448 http://dx.doi.org/10.1093/nar/gkl556 Text en © 2006 The Author(s).
spellingShingle Genomics
Sheth, Nihar
Roca, Xavier
Hastings, Michelle L.
Roeder, Ted
Krainer, Adrian R.
Sachidanandam, Ravi
Comprehensive splice-site analysis using comparative genomics
title Comprehensive splice-site analysis using comparative genomics
title_full Comprehensive splice-site analysis using comparative genomics
title_fullStr Comprehensive splice-site analysis using comparative genomics
title_full_unstemmed Comprehensive splice-site analysis using comparative genomics
title_short Comprehensive splice-site analysis using comparative genomics
title_sort comprehensive splice-site analysis using comparative genomics
topic Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1557818/
https://www.ncbi.nlm.nih.gov/pubmed/16914448
http://dx.doi.org/10.1093/nar/gkl556
work_keys_str_mv AT shethnihar comprehensivesplicesiteanalysisusingcomparativegenomics
AT rocaxavier comprehensivesplicesiteanalysisusingcomparativegenomics
AT hastingsmichellel comprehensivesplicesiteanalysisusingcomparativegenomics
AT roederted comprehensivesplicesiteanalysisusingcomparativegenomics
AT kraineradrianr comprehensivesplicesiteanalysisusingcomparativegenomics
AT sachidanandamravi comprehensivesplicesiteanalysisusingcomparativegenomics