Cargando…

Assessment of simple sequence repeats signature in hepatitis E virus (HEV) genomes

BACKGROUND: Hepatitis E virus (HEV) is small (27–34 nm diameter) non-enveloped with positive sense ssRNA genome. Microsatellites or simple sequence repeats (SSR) are short tandem repeat sequences present across coding and non-coding regions of both prokaryotes and eukaryotes. They are involved with...

Descripción completa

Detalles Bibliográficos
Autores principales: Jilani, Md Gulam, Ali, Safdar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9114184/
https://www.ncbi.nlm.nih.gov/pubmed/35579724
http://dx.doi.org/10.1186/s43141-022-00365-w
_version_ 1784709726350606336
author Jilani, Md Gulam
Ali, Safdar
author_facet Jilani, Md Gulam
Ali, Safdar
author_sort Jilani, Md Gulam
collection PubMed
description BACKGROUND: Hepatitis E virus (HEV) is small (27–34 nm diameter) non-enveloped with positive sense ssRNA genome. Microsatellites or simple sequence repeats (SSR) are short tandem repeat sequences present across coding and non-coding regions of both prokaryotes and eukaryotes. They are involved with genome function and evolution at multiple levels. RESULTS: The complete genome sequences of 22 HEV genomes of the family Hepeviridae and genus Orthohepevirus (21 species) and Piscihepevirus (1 species) were extracted from NCBI database (http://www.ncbi.nlm.nih.gov/). The extraction of microsatellites was done using Imperfect Microsatellite Extractor (IMEx) in ‘Advance-Mode’. The average genome size of the studied HEV genomes was 7003nt and it ranged from 6649nt (HEV11) to 7310nt (HEV22). The average GC content of the genomes was ~ 55%. A total of 519 SSRs and 21 cSSRS were extracted from the HEV genomes with an average incidence of 24 per genome ranging from 14 (HEV13) to 34 (HEV19). The cSSR incidence ranged from 0 (eight species) to 4 (HEV19). The genomes with no cSSR incidence had an SSR incidence range from 14 to 28. There were just four hexa-nucleotide repeat motifs and 5 penta-nucleotide repeat motifs observed. The most prevalent mono-, di-, and tri-nucleotide repeat motifs were “C”, “GT/TG”, and “GAC/CTG” respectively. The studied genomes had a minimum of ~ 90% incident SSRs present in the coding regions. Viruses with same or similar hosts are placed together on the phylogenetic tree implicating viral host being one of the driving forces for evolution. Conclusions Host range in viruses is being decided by multiple factors aided by the unique genome SSR signature and genomes of varied compositions need to be analyzed to forge a widely acceptable rule for predicting the same. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s43141-022-00365-w.
format Online
Article
Text
id pubmed-9114184
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-91141842022-06-04 Assessment of simple sequence repeats signature in hepatitis E virus (HEV) genomes Jilani, Md Gulam Ali, Safdar J Genet Eng Biotechnol Research BACKGROUND: Hepatitis E virus (HEV) is small (27–34 nm diameter) non-enveloped with positive sense ssRNA genome. Microsatellites or simple sequence repeats (SSR) are short tandem repeat sequences present across coding and non-coding regions of both prokaryotes and eukaryotes. They are involved with genome function and evolution at multiple levels. RESULTS: The complete genome sequences of 22 HEV genomes of the family Hepeviridae and genus Orthohepevirus (21 species) and Piscihepevirus (1 species) were extracted from NCBI database (http://www.ncbi.nlm.nih.gov/). The extraction of microsatellites was done using Imperfect Microsatellite Extractor (IMEx) in ‘Advance-Mode’. The average genome size of the studied HEV genomes was 7003nt and it ranged from 6649nt (HEV11) to 7310nt (HEV22). The average GC content of the genomes was ~ 55%. A total of 519 SSRs and 21 cSSRS were extracted from the HEV genomes with an average incidence of 24 per genome ranging from 14 (HEV13) to 34 (HEV19). The cSSR incidence ranged from 0 (eight species) to 4 (HEV19). The genomes with no cSSR incidence had an SSR incidence range from 14 to 28. There were just four hexa-nucleotide repeat motifs and 5 penta-nucleotide repeat motifs observed. The most prevalent mono-, di-, and tri-nucleotide repeat motifs were “C”, “GT/TG”, and “GAC/CTG” respectively. The studied genomes had a minimum of ~ 90% incident SSRs present in the coding regions. Viruses with same or similar hosts are placed together on the phylogenetic tree implicating viral host being one of the driving forces for evolution. Conclusions Host range in viruses is being decided by multiple factors aided by the unique genome SSR signature and genomes of varied compositions need to be analyzed to forge a widely acceptable rule for predicting the same. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s43141-022-00365-w. Springer Berlin Heidelberg 2022-05-17 /pmc/articles/PMC9114184/ /pubmed/35579724 http://dx.doi.org/10.1186/s43141-022-00365-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research
Jilani, Md Gulam
Ali, Safdar
Assessment of simple sequence repeats signature in hepatitis E virus (HEV) genomes
title Assessment of simple sequence repeats signature in hepatitis E virus (HEV) genomes
title_full Assessment of simple sequence repeats signature in hepatitis E virus (HEV) genomes
title_fullStr Assessment of simple sequence repeats signature in hepatitis E virus (HEV) genomes
title_full_unstemmed Assessment of simple sequence repeats signature in hepatitis E virus (HEV) genomes
title_short Assessment of simple sequence repeats signature in hepatitis E virus (HEV) genomes
title_sort assessment of simple sequence repeats signature in hepatitis e virus (hev) genomes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9114184/
https://www.ncbi.nlm.nih.gov/pubmed/35579724
http://dx.doi.org/10.1186/s43141-022-00365-w
work_keys_str_mv AT jilanimdgulam assessmentofsimplesequencerepeatssignatureinhepatitisevirushevgenomes
AT alisafdar assessmentofsimplesequencerepeatssignatureinhepatitisevirushevgenomes