Cargando…

Gene Presence/Absence Variation analysis of coronavirus family displays its pan-genomic diversity

SARS-CoV-2 belongs to the coronavirus family. Comparing genomic features of viral genomes of coronavirus family can improve our understanding about SARS-CoV-2. Here we present the first pan-genome analysis of 3,932 whole genomes of 101 species out of 4 genera from the coronavirus family. We found th...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiao, Du, Dong, Xiaorui, Yu, Yingyan, Wei, Chaochun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Ivyspring International Publisher 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8495401/
https://www.ncbi.nlm.nih.gov/pubmed/34671195
http://dx.doi.org/10.7150/ijbs.58220
_version_ 1784579542794371072
author Jiao, Du
Dong, Xiaorui
Yu, Yingyan
Wei, Chaochun
author_facet Jiao, Du
Dong, Xiaorui
Yu, Yingyan
Wei, Chaochun
author_sort Jiao, Du
collection PubMed
description SARS-CoV-2 belongs to the coronavirus family. Comparing genomic features of viral genomes of coronavirus family can improve our understanding about SARS-CoV-2. Here we present the first pan-genome analysis of 3,932 whole genomes of 101 species out of 4 genera from the coronavirus family. We found that a total of 181 genes in the pan-genome of coronavirus family, among which only 3 genes, the S gene, M gene and N gene, are highly conserved. We also constructed a pan-genome from 23,539 whole genomes of SARS-CoV-2. There are 13 genes in total in the SARS-CoV-2 pan-genome. All of the 13 genes are core genes for SARS-CoV-2. The pan-genome of coronaviruses shows a lower level of diversity than the pan-genomes of other RNA viruses, which contain no core gene. The three highly conserved genes in coronavirus family, which are also core genes in SARS-CoV-2 pan-genome, could be potential targets in developing nucleic acid diagnostic reagents with a decreased possibility of cross-reaction with other coronavirus species.
format Online
Article
Text
id pubmed-8495401
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Ivyspring International Publisher
record_format MEDLINE/PubMed
spelling pubmed-84954012021-10-19 Gene Presence/Absence Variation analysis of coronavirus family displays its pan-genomic diversity Jiao, Du Dong, Xiaorui Yu, Yingyan Wei, Chaochun Int J Biol Sci Research Paper SARS-CoV-2 belongs to the coronavirus family. Comparing genomic features of viral genomes of coronavirus family can improve our understanding about SARS-CoV-2. Here we present the first pan-genome analysis of 3,932 whole genomes of 101 species out of 4 genera from the coronavirus family. We found that a total of 181 genes in the pan-genome of coronavirus family, among which only 3 genes, the S gene, M gene and N gene, are highly conserved. We also constructed a pan-genome from 23,539 whole genomes of SARS-CoV-2. There are 13 genes in total in the SARS-CoV-2 pan-genome. All of the 13 genes are core genes for SARS-CoV-2. The pan-genome of coronaviruses shows a lower level of diversity than the pan-genomes of other RNA viruses, which contain no core gene. The three highly conserved genes in coronavirus family, which are also core genes in SARS-CoV-2 pan-genome, could be potential targets in developing nucleic acid diagnostic reagents with a decreased possibility of cross-reaction with other coronavirus species. Ivyspring International Publisher 2021-08-27 /pmc/articles/PMC8495401/ /pubmed/34671195 http://dx.doi.org/10.7150/ijbs.58220 Text en © The author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/). See http://ivyspring.com/terms for full terms and conditions.
spellingShingle Research Paper
Jiao, Du
Dong, Xiaorui
Yu, Yingyan
Wei, Chaochun
Gene Presence/Absence Variation analysis of coronavirus family displays its pan-genomic diversity
title Gene Presence/Absence Variation analysis of coronavirus family displays its pan-genomic diversity
title_full Gene Presence/Absence Variation analysis of coronavirus family displays its pan-genomic diversity
title_fullStr Gene Presence/Absence Variation analysis of coronavirus family displays its pan-genomic diversity
title_full_unstemmed Gene Presence/Absence Variation analysis of coronavirus family displays its pan-genomic diversity
title_short Gene Presence/Absence Variation analysis of coronavirus family displays its pan-genomic diversity
title_sort gene presence/absence variation analysis of coronavirus family displays its pan-genomic diversity
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8495401/
https://www.ncbi.nlm.nih.gov/pubmed/34671195
http://dx.doi.org/10.7150/ijbs.58220
work_keys_str_mv AT jiaodu genepresenceabsencevariationanalysisofcoronavirusfamilydisplaysitspangenomicdiversity
AT dongxiaorui genepresenceabsencevariationanalysisofcoronavirusfamilydisplaysitspangenomicdiversity
AT yuyingyan genepresenceabsencevariationanalysisofcoronavirusfamilydisplaysitspangenomicdiversity
AT weichaochun genepresenceabsencevariationanalysisofcoronavirusfamilydisplaysitspangenomicdiversity