Cargando…

Phenetic Comparison of Prokaryotic Genomes Using k-mers

Bacterial genomics studies are getting more extensive and complex, requiring new ways to envision analyses. Using the Ray Surveyor software, we demonstrate that comparison of genomes based on their k-mer content allows reconstruction of phenetic trees without the need of prior data curation, such as...

Descripción completa

Detalles Bibliográficos
Autores principales: Déraspe, Maxime, Raymond, Frédéric, Boisvert, Sébastien, Culley, Alexander, Roy, Paul H., Laviolette, François, Corbeil, Jacques
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850840/
https://www.ncbi.nlm.nih.gov/pubmed/28957508
http://dx.doi.org/10.1093/molbev/msx200
_version_ 1783306293625225216
author Déraspe, Maxime
Raymond, Frédéric
Boisvert, Sébastien
Culley, Alexander
Roy, Paul H.
Laviolette, François
Corbeil, Jacques
author_facet Déraspe, Maxime
Raymond, Frédéric
Boisvert, Sébastien
Culley, Alexander
Roy, Paul H.
Laviolette, François
Corbeil, Jacques
author_sort Déraspe, Maxime
collection PubMed
description Bacterial genomics studies are getting more extensive and complex, requiring new ways to envision analyses. Using the Ray Surveyor software, we demonstrate that comparison of genomes based on their k-mer content allows reconstruction of phenetic trees without the need of prior data curation, such as core genome alignment of a species. We validated the methodology using simulated genomes and previously published phylogenomic studies of Streptococcus pneumoniae and Pseudomonas aeruginosa. We also investigated the relationship of specific genetic determinants with bacterial population structures. By comparing clusters from the complete genomic content of a genome population with clusters from specific functional categories of genes, we can determine how the population structures are correlated. Indeed, the strain clustering based on a subset of k-mers allows determination of its similarity with the whole genome clusters. We also applied this methodology on 42 species of bacteria to determine the correlational significance of five important bacterial genomic characteristics. For example, intrinsic resistance is more important in P. aeruginosa than in S. pneumoniae, and the former has increased correlation of its population structure with antibiotic resistance genes. The global view of the pangenome of bacteria also demonstrated the taxa-dependent interaction of population structure with antibiotic resistance, bacteriophage, plasmid, and mobile element k-mer data sets.
format Online
Article
Text
id pubmed-5850840
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58508402018-03-23 Phenetic Comparison of Prokaryotic Genomes Using k-mers Déraspe, Maxime Raymond, Frédéric Boisvert, Sébastien Culley, Alexander Roy, Paul H. Laviolette, François Corbeil, Jacques Mol Biol Evol Methods Bacterial genomics studies are getting more extensive and complex, requiring new ways to envision analyses. Using the Ray Surveyor software, we demonstrate that comparison of genomes based on their k-mer content allows reconstruction of phenetic trees without the need of prior data curation, such as core genome alignment of a species. We validated the methodology using simulated genomes and previously published phylogenomic studies of Streptococcus pneumoniae and Pseudomonas aeruginosa. We also investigated the relationship of specific genetic determinants with bacterial population structures. By comparing clusters from the complete genomic content of a genome population with clusters from specific functional categories of genes, we can determine how the population structures are correlated. Indeed, the strain clustering based on a subset of k-mers allows determination of its similarity with the whole genome clusters. We also applied this methodology on 42 species of bacteria to determine the correlational significance of five important bacterial genomic characteristics. For example, intrinsic resistance is more important in P. aeruginosa than in S. pneumoniae, and the former has increased correlation of its population structure with antibiotic resistance genes. The global view of the pangenome of bacteria also demonstrated the taxa-dependent interaction of population structure with antibiotic resistance, bacteriophage, plasmid, and mobile element k-mer data sets. Oxford University Press 2017-10 2017-07-16 /pmc/articles/PMC5850840/ /pubmed/28957508 http://dx.doi.org/10.1093/molbev/msx200 Text en © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods
Déraspe, Maxime
Raymond, Frédéric
Boisvert, Sébastien
Culley, Alexander
Roy, Paul H.
Laviolette, François
Corbeil, Jacques
Phenetic Comparison of Prokaryotic Genomes Using k-mers
title Phenetic Comparison of Prokaryotic Genomes Using k-mers
title_full Phenetic Comparison of Prokaryotic Genomes Using k-mers
title_fullStr Phenetic Comparison of Prokaryotic Genomes Using k-mers
title_full_unstemmed Phenetic Comparison of Prokaryotic Genomes Using k-mers
title_short Phenetic Comparison of Prokaryotic Genomes Using k-mers
title_sort phenetic comparison of prokaryotic genomes using k-mers
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850840/
https://www.ncbi.nlm.nih.gov/pubmed/28957508
http://dx.doi.org/10.1093/molbev/msx200
work_keys_str_mv AT deraspemaxime pheneticcomparisonofprokaryoticgenomesusingkmers
AT raymondfrederic pheneticcomparisonofprokaryoticgenomesusingkmers
AT boisvertsebastien pheneticcomparisonofprokaryoticgenomesusingkmers
AT culleyalexander pheneticcomparisonofprokaryoticgenomesusingkmers
AT roypaulh pheneticcomparisonofprokaryoticgenomesusingkmers
AT laviolettefrancois pheneticcomparisonofprokaryoticgenomesusingkmers
AT corbeiljacques pheneticcomparisonofprokaryoticgenomesusingkmers