Cargando…

Distinct patterns of simple sequence repeats and GC distribution in intragenic and intergenic regions of primate genomes

As the first systematic examination of simple sequence repeats (SSRs) and guanine-cytosine (GC) distribution in intragenic and intergenic regions of ten primates, our study showed that SSRs and GC displayed nonrandom distribution for both intragenic and intergenic regions, suggesting that they have...

Descripción completa

Detalles Bibliográficos
Autores principales: Qi, Wen-Hua, Yan, Chao-chao, Li, Wu-Jiao, Jiang, Xue-Mei, Li, Guang-Zhou, Zhang, Xiu-Yue, Hu, Ting-Zhang, Li, Jing, Yue, Bi-Song
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Impact Journals LLC 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5191860/
https://www.ncbi.nlm.nih.gov/pubmed/27644032
http://dx.doi.org/10.18632/aging.101025
Descripción
Sumario:As the first systematic examination of simple sequence repeats (SSRs) and guanine-cytosine (GC) distribution in intragenic and intergenic regions of ten primates, our study showed that SSRs and GC displayed nonrandom distribution for both intragenic and intergenic regions, suggesting that they have potential roles in transcriptional or translational regulation. Our results suggest that the majority of SSRs are distributed in non-coding regions, such as the introns, TEs, and intergenic regions. In these primates, trinucleotide perfect (P) SSRs were the most abundant repeats type in the 5′UTRs and CDSs, whereas, mononucleotide P-SSRs were the most in the intron, 3′UTRs, TEs, and intergenic regions. The GC-contents varied greatly among different intragenic and intergenic regions: 5′UTRs > CDSs > 3′UTRs > TEs > introns > intergenic regions, and high GC-content was frequently distributed in exon-rich regions. Our results also showed that in the same intragenic and intergenic regions, the distribution of GC-contents were great similarity in the different primates. Tri- and hexanucleotide P-SSRs had the most GC-contents in the 5′UTRs and CDSs, whereas mononucleotide P-SSRs had the least GC-contents in the six genomic regions of these primates. The most frequent motifs for different length varied obviously with the different genomic regions.