Cargando…

Microsatellite diversity and complexity in the viral genomes of the family Caliciviridae

BACKGROUND: Microsatellites or simple sequence repeats (SSR) consist of 1–6 nucleotide motifs of DNA or RNA which are ubiquitously present in tandem repeated sequences across genome in viruses: prokaryotes and eukaryotes. They may be localized to both the coding and non-coding regions. SSRs play an...

Descripción completa

Detalles Bibliográficos
Autores principales: Jilani, Md Gulam, Hoque, Mehboob, Ali, Safdar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10673786/
https://www.ncbi.nlm.nih.gov/pubmed/37999808
http://dx.doi.org/10.1186/s43141-023-00582-x
_version_ 1785149634893578240
author Jilani, Md Gulam
Hoque, Mehboob
Ali, Safdar
author_facet Jilani, Md Gulam
Hoque, Mehboob
Ali, Safdar
author_sort Jilani, Md Gulam
collection PubMed
description BACKGROUND: Microsatellites or simple sequence repeats (SSR) consist of 1–6 nucleotide motifs of DNA or RNA which are ubiquitously present in tandem repeated sequences across genome in viruses: prokaryotes and eukaryotes. They may be localized to both the coding and non-coding regions. SSRs play an important role in replication, gene regulation, transcription, and protein function. The Caliciviridae (CLV) family of viruses have ss-RNA, non-enveloped, icosahedral symmetry 27–35 nm in diameter in size. The size of the genome lies between 6.4 and 8.6 kb. RESULTS: The incidence, composition, diversity, complexity, and host range of different microsatellites in 62 representatives of the family of Caliciviridae were systematically analyzed. The full-length genome sequences were assessed from NCBI (https://www.ncbi.nlm.nih.gov), and microsatellites were extracted through MISA software. The average genome size is about 7538 bp ranging from 6273 (CLV61) to 8798 (CLV47) bp. The average GC content of the genomes was ~ 51%. There are a total of 1317 SSRs and 53 cSSRs in the studied genomes. CLV 41 and CLV 49 contain the highest and lowest value of SSRs with 32 and 10 respectively, while CLV16 had maximum cSSR incidence of 4. There were 29 species which do not contain any cSSR. The incidence of mono-, di-, and tri-nucleotide SSRs was 219, 884, and 206, respectively. The most prevalent mono-, di-, and tri-nucleotide repeat motifs were “C” (126 SSRs), AC/CA (240 SSRs), and TGA/ACT (23 SSRs), respectively. Most of the SSRs and cSSRs are biased toward the coding region with a minimum of ~ 90% incident SSRs in the genomes’ coding region. Viruses with similar host are found close to each other on the phylogenetic tree suggesting virus host being one of the driving forces for their evolution. CONCLUSIONS: The Caliciviridae genomes does not conform to any pattern of SSR signature in terms of incidence, composition, and localization. This unique property of SSR plays an important role in viral evolution. Clustering of similar host in the phylogenetic tree is the evidence of the uniqueness of SSR signature. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s43141-023-00582-x.
format Online
Article
Text
id pubmed-10673786
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-106737862023-11-24 Microsatellite diversity and complexity in the viral genomes of the family Caliciviridae Jilani, Md Gulam Hoque, Mehboob Ali, Safdar J Genet Eng Biotechnol Research BACKGROUND: Microsatellites or simple sequence repeats (SSR) consist of 1–6 nucleotide motifs of DNA or RNA which are ubiquitously present in tandem repeated sequences across genome in viruses: prokaryotes and eukaryotes. They may be localized to both the coding and non-coding regions. SSRs play an important role in replication, gene regulation, transcription, and protein function. The Caliciviridae (CLV) family of viruses have ss-RNA, non-enveloped, icosahedral symmetry 27–35 nm in diameter in size. The size of the genome lies between 6.4 and 8.6 kb. RESULTS: The incidence, composition, diversity, complexity, and host range of different microsatellites in 62 representatives of the family of Caliciviridae were systematically analyzed. The full-length genome sequences were assessed from NCBI (https://www.ncbi.nlm.nih.gov), and microsatellites were extracted through MISA software. The average genome size is about 7538 bp ranging from 6273 (CLV61) to 8798 (CLV47) bp. The average GC content of the genomes was ~ 51%. There are a total of 1317 SSRs and 53 cSSRs in the studied genomes. CLV 41 and CLV 49 contain the highest and lowest value of SSRs with 32 and 10 respectively, while CLV16 had maximum cSSR incidence of 4. There were 29 species which do not contain any cSSR. The incidence of mono-, di-, and tri-nucleotide SSRs was 219, 884, and 206, respectively. The most prevalent mono-, di-, and tri-nucleotide repeat motifs were “C” (126 SSRs), AC/CA (240 SSRs), and TGA/ACT (23 SSRs), respectively. Most of the SSRs and cSSRs are biased toward the coding region with a minimum of ~ 90% incident SSRs in the genomes’ coding region. Viruses with similar host are found close to each other on the phylogenetic tree suggesting virus host being one of the driving forces for their evolution. CONCLUSIONS: The Caliciviridae genomes does not conform to any pattern of SSR signature in terms of incidence, composition, and localization. This unique property of SSR plays an important role in viral evolution. Clustering of similar host in the phylogenetic tree is the evidence of the uniqueness of SSR signature. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s43141-023-00582-x. Springer Berlin Heidelberg 2023-11-24 /pmc/articles/PMC10673786/ /pubmed/37999808 http://dx.doi.org/10.1186/s43141-023-00582-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research
Jilani, Md Gulam
Hoque, Mehboob
Ali, Safdar
Microsatellite diversity and complexity in the viral genomes of the family Caliciviridae
title Microsatellite diversity and complexity in the viral genomes of the family Caliciviridae
title_full Microsatellite diversity and complexity in the viral genomes of the family Caliciviridae
title_fullStr Microsatellite diversity and complexity in the viral genomes of the family Caliciviridae
title_full_unstemmed Microsatellite diversity and complexity in the viral genomes of the family Caliciviridae
title_short Microsatellite diversity and complexity in the viral genomes of the family Caliciviridae
title_sort microsatellite diversity and complexity in the viral genomes of the family caliciviridae
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10673786/
https://www.ncbi.nlm.nih.gov/pubmed/37999808
http://dx.doi.org/10.1186/s43141-023-00582-x
work_keys_str_mv AT jilanimdgulam microsatellitediversityandcomplexityintheviralgenomesofthefamilycaliciviridae
AT hoquemehboob microsatellitediversityandcomplexityintheviralgenomesofthefamilycaliciviridae
AT alisafdar microsatellitediversityandcomplexityintheviralgenomesofthefamilycaliciviridae