Cargando…

Large-Scale Genomics Reveals the Genetic Characteristics of Seven Species and Importance of Phylogenetic Distance for Estimating Pan-Genome Size

For more than a decade, pan-genome analysis has been applied as an effective method for explaining the genetic contents variation of prokaryotic species. However, genomic characteristics and detailed structures of gene pools have not been fully clarified, because most studies have used a small numbe...

Descripción completa

Detalles Bibliográficos
Autores principales: Park, Sang-Cheol, Lee, Kihyun, Kim, Yeong Ouk, Won, Sungho, Chun, Jongsik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6491781/
https://www.ncbi.nlm.nih.gov/pubmed/31068915
http://dx.doi.org/10.3389/fmicb.2019.00834
_version_ 1783415014015631360
author Park, Sang-Cheol
Lee, Kihyun
Kim, Yeong Ouk
Won, Sungho
Chun, Jongsik
author_facet Park, Sang-Cheol
Lee, Kihyun
Kim, Yeong Ouk
Won, Sungho
Chun, Jongsik
author_sort Park, Sang-Cheol
collection PubMed
description For more than a decade, pan-genome analysis has been applied as an effective method for explaining the genetic contents variation of prokaryotic species. However, genomic characteristics and detailed structures of gene pools have not been fully clarified, because most studies have used a small number of genomes. Here, we constructed pan-genomes of seven species in order to elucidate variations in the genetic contents of >27,000 genomes belonging to Streptococcus pneumoniae, Staphylococcus aureus subsp. aureus, Salmonella enterica subsp. enterica, Escherichia coli and Shigella spp., Mycobacterium tuberculosis complex, Pseudomonas aeruginosa, and Acinetobacter baumannii. This work showed the pan-genomes of all seven species has open property. Additionally, systematic evaluation of the characteristics of their pan-genome revealed that phylogenetic distance provided valuable information for estimating the parameters for pan-genome size among several models including Heaps’ law. Our results provide a better understanding of the species and a solution to minimize sampling biases associated with genome-sequencing preferences for pathogenic strains.
format Online
Article
Text
id pubmed-6491781
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-64917812019-05-08 Large-Scale Genomics Reveals the Genetic Characteristics of Seven Species and Importance of Phylogenetic Distance for Estimating Pan-Genome Size Park, Sang-Cheol Lee, Kihyun Kim, Yeong Ouk Won, Sungho Chun, Jongsik Front Microbiol Microbiology For more than a decade, pan-genome analysis has been applied as an effective method for explaining the genetic contents variation of prokaryotic species. However, genomic characteristics and detailed structures of gene pools have not been fully clarified, because most studies have used a small number of genomes. Here, we constructed pan-genomes of seven species in order to elucidate variations in the genetic contents of >27,000 genomes belonging to Streptococcus pneumoniae, Staphylococcus aureus subsp. aureus, Salmonella enterica subsp. enterica, Escherichia coli and Shigella spp., Mycobacterium tuberculosis complex, Pseudomonas aeruginosa, and Acinetobacter baumannii. This work showed the pan-genomes of all seven species has open property. Additionally, systematic evaluation of the characteristics of their pan-genome revealed that phylogenetic distance provided valuable information for estimating the parameters for pan-genome size among several models including Heaps’ law. Our results provide a better understanding of the species and a solution to minimize sampling biases associated with genome-sequencing preferences for pathogenic strains. Frontiers Media S.A. 2019-04-24 /pmc/articles/PMC6491781/ /pubmed/31068915 http://dx.doi.org/10.3389/fmicb.2019.00834 Text en Copyright © 2019 Park, Lee, Kim, Won and Chun. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Park, Sang-Cheol
Lee, Kihyun
Kim, Yeong Ouk
Won, Sungho
Chun, Jongsik
Large-Scale Genomics Reveals the Genetic Characteristics of Seven Species and Importance of Phylogenetic Distance for Estimating Pan-Genome Size
title Large-Scale Genomics Reveals the Genetic Characteristics of Seven Species and Importance of Phylogenetic Distance for Estimating Pan-Genome Size
title_full Large-Scale Genomics Reveals the Genetic Characteristics of Seven Species and Importance of Phylogenetic Distance for Estimating Pan-Genome Size
title_fullStr Large-Scale Genomics Reveals the Genetic Characteristics of Seven Species and Importance of Phylogenetic Distance for Estimating Pan-Genome Size
title_full_unstemmed Large-Scale Genomics Reveals the Genetic Characteristics of Seven Species and Importance of Phylogenetic Distance for Estimating Pan-Genome Size
title_short Large-Scale Genomics Reveals the Genetic Characteristics of Seven Species and Importance of Phylogenetic Distance for Estimating Pan-Genome Size
title_sort large-scale genomics reveals the genetic characteristics of seven species and importance of phylogenetic distance for estimating pan-genome size
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6491781/
https://www.ncbi.nlm.nih.gov/pubmed/31068915
http://dx.doi.org/10.3389/fmicb.2019.00834
work_keys_str_mv AT parksangcheol largescalegenomicsrevealsthegeneticcharacteristicsofsevenspeciesandimportanceofphylogeneticdistanceforestimatingpangenomesize
AT leekihyun largescalegenomicsrevealsthegeneticcharacteristicsofsevenspeciesandimportanceofphylogeneticdistanceforestimatingpangenomesize
AT kimyeongouk largescalegenomicsrevealsthegeneticcharacteristicsofsevenspeciesandimportanceofphylogeneticdistanceforestimatingpangenomesize
AT wonsungho largescalegenomicsrevealsthegeneticcharacteristicsofsevenspeciesandimportanceofphylogeneticdistanceforestimatingpangenomesize
AT chunjongsik largescalegenomicsrevealsthegeneticcharacteristicsofsevenspeciesandimportanceofphylogeneticdistanceforestimatingpangenomesize