Cargando…

Mining and analysis of microsatellites in human coronavirus genomes using the in-house built Java pipeline

Microsatellites or simple sequence repeats are motifs of 1 to 6 nucleotides in length present in both coding and non-coding regions of DNA. These are found widely distributed in the whole genome of prokaryotes, eukaryotes, bacteria, and viruses and are used as molecular markers in studying DNA varia...

Descripción completa

Detalles Bibliográficos
Autores principales:	Umang, Bharti, Pawan Kumar, Husain, Akhtar
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Korea Genome Organization 2022
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9576472/ https://www.ncbi.nlm.nih.gov/pubmed/36239112 http://dx.doi.org/10.5808/gi.20033

_version_	1784811535829303296
author	Umang, Bharti, Pawan Kumar Husain, Akhtar
author_facet	Umang, Bharti, Pawan Kumar Husain, Akhtar
author_sort	Umang,
collection	PubMed
description	Microsatellites or simple sequence repeats are motifs of 1 to 6 nucleotides in length present in both coding and non-coding regions of DNA. These are found widely distributed in the whole genome of prokaryotes, eukaryotes, bacteria, and viruses and are used as molecular markers in studying DNA variations, gene regulation, genetic diversity and evolutionary studies, etc. However, in vitro microsatellite identification proves to be time-consuming and expensive. Therefore, the present research has been focused on using an in-house built java pipeline to identify, analyse, design primers and find related statistics of perfect and compound microsatellites in the seven complete genome sequences of coronavirus, including the genome of coronavirus disease 2019, where the host is Homo sapiens. Based on search criteria among seven genomic sequences, it was revealed that the total number of perfect simple sequence repeats (SSRs) found to be in the range of 76 to 118 and compound SSRs from 01 to10, thus reflecting the low conversion of perfect simple sequence to compound repeats. Furthermore, the incidence of SSRs was insignificant but positively correlated with genome size (R(2) = 0.45, p > 0.05), with simple sequence repeats relative abundance (R(2) = 0.18, p > 0.05) and relative density (R(2) = 0.23, p > 0.05). Dinucleotide repeats were the most abundant in the coding region of the genome, followed by tri, mono, and tetra. This comparative study would help us understand the evolutionary relationship, genetic diversity, and hypervariability in minimal time and cost.
format	Online Article Text
id	pubmed-9576472
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Korea Genome Organization
record_format	MEDLINE/PubMed
spelling	pubmed-95764722022-10-19 Mining and analysis of microsatellites in human coronavirus genomes using the in-house built Java pipeline Umang, Bharti, Pawan Kumar Husain, Akhtar Genomics Inform Original Article Microsatellites or simple sequence repeats are motifs of 1 to 6 nucleotides in length present in both coding and non-coding regions of DNA. These are found widely distributed in the whole genome of prokaryotes, eukaryotes, bacteria, and viruses and are used as molecular markers in studying DNA variations, gene regulation, genetic diversity and evolutionary studies, etc. However, in vitro microsatellite identification proves to be time-consuming and expensive. Therefore, the present research has been focused on using an in-house built java pipeline to identify, analyse, design primers and find related statistics of perfect and compound microsatellites in the seven complete genome sequences of coronavirus, including the genome of coronavirus disease 2019, where the host is Homo sapiens. Based on search criteria among seven genomic sequences, it was revealed that the total number of perfect simple sequence repeats (SSRs) found to be in the range of 76 to 118 and compound SSRs from 01 to10, thus reflecting the low conversion of perfect simple sequence to compound repeats. Furthermore, the incidence of SSRs was insignificant but positively correlated with genome size (R(2) = 0.45, p > 0.05), with simple sequence repeats relative abundance (R(2) = 0.18, p > 0.05) and relative density (R(2) = 0.23, p > 0.05). Dinucleotide repeats were the most abundant in the coding region of the genome, followed by tri, mono, and tetra. This comparative study would help us understand the evolutionary relationship, genetic diversity, and hypervariability in minimal time and cost. Korea Genome Organization 2022-09-30 /pmc/articles/PMC9576472/ /pubmed/36239112 http://dx.doi.org/10.5808/gi.20033 Text en (c) 2022, Korea Genome Organization https://creativecommons.org/licenses/by/4.0/(CC) This is an open-access article distributed under the terms of the Creative Commons Attribution license(https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Article Umang, Bharti, Pawan Kumar Husain, Akhtar Mining and analysis of microsatellites in human coronavirus genomes using the in-house built Java pipeline
title	Mining and analysis of microsatellites in human coronavirus genomes using the in-house built Java pipeline
title_full	Mining and analysis of microsatellites in human coronavirus genomes using the in-house built Java pipeline
title_fullStr	Mining and analysis of microsatellites in human coronavirus genomes using the in-house built Java pipeline
title_full_unstemmed	Mining and analysis of microsatellites in human coronavirus genomes using the in-house built Java pipeline
title_short	Mining and analysis of microsatellites in human coronavirus genomes using the in-house built Java pipeline
title_sort	mining and analysis of microsatellites in human coronavirus genomes using the in-house built java pipeline
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9576472/ https://www.ncbi.nlm.nih.gov/pubmed/36239112 http://dx.doi.org/10.5808/gi.20033
work_keys_str_mv	AT umang miningandanalysisofmicrosatellitesinhumancoronavirusgenomesusingtheinhousebuiltjavapipeline AT bhartipawankumar miningandanalysisofmicrosatellitesinhumancoronavirusgenomesusingtheinhousebuiltjavapipeline AT husainakhtar miningandanalysisofmicrosatellitesinhumancoronavirusgenomesusingtheinhousebuiltjavapipeline

Mining and analysis of microsatellites in human coronavirus genomes using the in-house built Java pipeline

Ejemplares similares