Cargando…
Comprehensive splice-site analysis using comparative genomics
We have collected over half a million splice sites from five species—Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans and Arabidopsis thaliana—and classified them into four subtypes: U2-type GT–AG and GC–AG and U12-type GT–AG and AT–AC. We have also found new examples of r...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1557818/ https://www.ncbi.nlm.nih.gov/pubmed/16914448 http://dx.doi.org/10.1093/nar/gkl556 |
_version_ | 1782129415456555008 |
---|---|
author | Sheth, Nihar Roca, Xavier Hastings, Michelle L. Roeder, Ted Krainer, Adrian R. Sachidanandam, Ravi |
author_facet | Sheth, Nihar Roca, Xavier Hastings, Michelle L. Roeder, Ted Krainer, Adrian R. Sachidanandam, Ravi |
author_sort | Sheth, Nihar |
collection | PubMed |
description | We have collected over half a million splice sites from five species—Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans and Arabidopsis thaliana—and classified them into four subtypes: U2-type GT–AG and GC–AG and U12-type GT–AG and AT–AC. We have also found new examples of rare splice-site categories, such as U12-type introns without canonical borders, and U2-dependent AT–AC introns. The splice-site sequences and several tools to explore them are available on a public website (SpliceRack). For the U12-type introns, we find several features conserved across species, as well as a clustering of these introns on genes. Using the information content of the splice-site motifs, and the phylogenetic distance between them, we identify: (i) a higher degree of conservation in the exonic portion of the U2-type splice sites in more complex organisms; (ii) conservation of exonic nucleotides for U12-type splice sites; (iii) divergent evolution of C.elegans 3′ splice sites (3′ss) and (iv) distinct evolutionary histories of 5′ and 3′ss. Our study proves that the identification of broad patterns in naturally-occurring splice sites, through the analysis of genomic datasets, provides mechanistic and evolutionary insights into pre-mRNA splicing. |
format | Text |
id | pubmed-1557818 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-15578182006-09-06 Comprehensive splice-site analysis using comparative genomics Sheth, Nihar Roca, Xavier Hastings, Michelle L. Roeder, Ted Krainer, Adrian R. Sachidanandam, Ravi Nucleic Acids Res Genomics We have collected over half a million splice sites from five species—Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans and Arabidopsis thaliana—and classified them into four subtypes: U2-type GT–AG and GC–AG and U12-type GT–AG and AT–AC. We have also found new examples of rare splice-site categories, such as U12-type introns without canonical borders, and U2-dependent AT–AC introns. The splice-site sequences and several tools to explore them are available on a public website (SpliceRack). For the U12-type introns, we find several features conserved across species, as well as a clustering of these introns on genes. Using the information content of the splice-site motifs, and the phylogenetic distance between them, we identify: (i) a higher degree of conservation in the exonic portion of the U2-type splice sites in more complex organisms; (ii) conservation of exonic nucleotides for U12-type splice sites; (iii) divergent evolution of C.elegans 3′ splice sites (3′ss) and (iv) distinct evolutionary histories of 5′ and 3′ss. Our study proves that the identification of broad patterns in naturally-occurring splice sites, through the analysis of genomic datasets, provides mechanistic and evolutionary insights into pre-mRNA splicing. Oxford University Press 2006 2006-08-12 /pmc/articles/PMC1557818/ /pubmed/16914448 http://dx.doi.org/10.1093/nar/gkl556 Text en © 2006 The Author(s). |
spellingShingle | Genomics Sheth, Nihar Roca, Xavier Hastings, Michelle L. Roeder, Ted Krainer, Adrian R. Sachidanandam, Ravi Comprehensive splice-site analysis using comparative genomics |
title | Comprehensive splice-site analysis using comparative genomics |
title_full | Comprehensive splice-site analysis using comparative genomics |
title_fullStr | Comprehensive splice-site analysis using comparative genomics |
title_full_unstemmed | Comprehensive splice-site analysis using comparative genomics |
title_short | Comprehensive splice-site analysis using comparative genomics |
title_sort | comprehensive splice-site analysis using comparative genomics |
topic | Genomics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1557818/ https://www.ncbi.nlm.nih.gov/pubmed/16914448 http://dx.doi.org/10.1093/nar/gkl556 |
work_keys_str_mv | AT shethnihar comprehensivesplicesiteanalysisusingcomparativegenomics AT rocaxavier comprehensivesplicesiteanalysisusingcomparativegenomics AT hastingsmichellel comprehensivesplicesiteanalysisusingcomparativegenomics AT roederted comprehensivesplicesiteanalysisusingcomparativegenomics AT kraineradrianr comprehensivesplicesiteanalysisusingcomparativegenomics AT sachidanandamravi comprehensivesplicesiteanalysisusingcomparativegenomics |