Cargando…

Lactobacillus paracasei Comparative Genomics: Towards Species Pan-Genome Definition and Exploitation of Diversity

Lactobacillus paracasei is a member of the normal human and animal gut microbiota and is used extensively in the food industry in starter cultures for dairy products or as probiotics. With the development of low-cost, high-throughput sequencing techniques it has become feasible to sequence many diff...

Descripción completa

Detalles Bibliográficos
Autores principales: Smokvina, Tamara, Wels, Michiel, Polka, Justyna, Chervaux, Christian, Brisse, Sylvain, Boekhorst, Jos, Vlieg, Johan E. T. van Hylckama, Siezen, Roland J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3716772/
https://www.ncbi.nlm.nih.gov/pubmed/23894338
http://dx.doi.org/10.1371/journal.pone.0068731
_version_ 1782277596346580992
author Smokvina, Tamara
Wels, Michiel
Polka, Justyna
Chervaux, Christian
Brisse, Sylvain
Boekhorst, Jos
Vlieg, Johan E. T. van Hylckama
Siezen, Roland J.
author_facet Smokvina, Tamara
Wels, Michiel
Polka, Justyna
Chervaux, Christian
Brisse, Sylvain
Boekhorst, Jos
Vlieg, Johan E. T. van Hylckama
Siezen, Roland J.
author_sort Smokvina, Tamara
collection PubMed
description Lactobacillus paracasei is a member of the normal human and animal gut microbiota and is used extensively in the food industry in starter cultures for dairy products or as probiotics. With the development of low-cost, high-throughput sequencing techniques it has become feasible to sequence many different strains of one species and to determine its “pan-genome”. We have sequenced the genomes of 34 different L. paracasei strains, and performed a comparative genomics analysis. We analysed genome synteny and content, focussing on the pan-genome, core genome and variable genome. Each genome was shown to contain around 2800–3100 protein-coding genes, and comparative analysis identified over 4200 ortholog groups that comprise the pan-genome of this species, of which about 1800 ortholog groups make up the conserved core. Several factors previously associated with host-microbe interactions such as pili, cell-envelope proteinase, hydrolases p40 and p75 or the capacity to produce short branched-chain fatty acids (bkd operon) are part of the L. paracasei core genome present in all analysed strains. The variome consists mainly of hypothetical proteins, phages, plasmids, transposon/conjugative elements, and known functions such as sugar metabolism, cell-surface proteins, transporters, CRISPR-associated proteins, and EPS biosynthesis proteins. An enormous variety and variability of sugar utilization gene cassettes were identified, with each strain harbouring between 25–53 cassettes, reflecting the high adaptability of L. paracasei to different niches. A phylogenomic tree was constructed based on total genome contents, and together with an analysis of horizontal gene transfer events we conclude that evolution of these L. paracasei strains is complex and not always related to niche adaptation. The results of this genome content comparison was used, together with high-throughput growth experiments on various carbohydrates, to perform gene-trait matching analysis, in order to link the distribution pattern of a specific phenotype to the presence/absence of specific sets of genes.
format Online
Article
Text
id pubmed-3716772
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37167722013-07-26 Lactobacillus paracasei Comparative Genomics: Towards Species Pan-Genome Definition and Exploitation of Diversity Smokvina, Tamara Wels, Michiel Polka, Justyna Chervaux, Christian Brisse, Sylvain Boekhorst, Jos Vlieg, Johan E. T. van Hylckama Siezen, Roland J. PLoS One Research Article Lactobacillus paracasei is a member of the normal human and animal gut microbiota and is used extensively in the food industry in starter cultures for dairy products or as probiotics. With the development of low-cost, high-throughput sequencing techniques it has become feasible to sequence many different strains of one species and to determine its “pan-genome”. We have sequenced the genomes of 34 different L. paracasei strains, and performed a comparative genomics analysis. We analysed genome synteny and content, focussing on the pan-genome, core genome and variable genome. Each genome was shown to contain around 2800–3100 protein-coding genes, and comparative analysis identified over 4200 ortholog groups that comprise the pan-genome of this species, of which about 1800 ortholog groups make up the conserved core. Several factors previously associated with host-microbe interactions such as pili, cell-envelope proteinase, hydrolases p40 and p75 or the capacity to produce short branched-chain fatty acids (bkd operon) are part of the L. paracasei core genome present in all analysed strains. The variome consists mainly of hypothetical proteins, phages, plasmids, transposon/conjugative elements, and known functions such as sugar metabolism, cell-surface proteins, transporters, CRISPR-associated proteins, and EPS biosynthesis proteins. An enormous variety and variability of sugar utilization gene cassettes were identified, with each strain harbouring between 25–53 cassettes, reflecting the high adaptability of L. paracasei to different niches. A phylogenomic tree was constructed based on total genome contents, and together with an analysis of horizontal gene transfer events we conclude that evolution of these L. paracasei strains is complex and not always related to niche adaptation. The results of this genome content comparison was used, together with high-throughput growth experiments on various carbohydrates, to perform gene-trait matching analysis, in order to link the distribution pattern of a specific phenotype to the presence/absence of specific sets of genes. Public Library of Science 2013-07-19 /pmc/articles/PMC3716772/ /pubmed/23894338 http://dx.doi.org/10.1371/journal.pone.0068731 Text en © 2013 Smokvina et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Smokvina, Tamara
Wels, Michiel
Polka, Justyna
Chervaux, Christian
Brisse, Sylvain
Boekhorst, Jos
Vlieg, Johan E. T. van Hylckama
Siezen, Roland J.
Lactobacillus paracasei Comparative Genomics: Towards Species Pan-Genome Definition and Exploitation of Diversity
title Lactobacillus paracasei Comparative Genomics: Towards Species Pan-Genome Definition and Exploitation of Diversity
title_full Lactobacillus paracasei Comparative Genomics: Towards Species Pan-Genome Definition and Exploitation of Diversity
title_fullStr Lactobacillus paracasei Comparative Genomics: Towards Species Pan-Genome Definition and Exploitation of Diversity
title_full_unstemmed Lactobacillus paracasei Comparative Genomics: Towards Species Pan-Genome Definition and Exploitation of Diversity
title_short Lactobacillus paracasei Comparative Genomics: Towards Species Pan-Genome Definition and Exploitation of Diversity
title_sort lactobacillus paracasei comparative genomics: towards species pan-genome definition and exploitation of diversity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3716772/
https://www.ncbi.nlm.nih.gov/pubmed/23894338
http://dx.doi.org/10.1371/journal.pone.0068731
work_keys_str_mv AT smokvinatamara lactobacillusparacaseicomparativegenomicstowardsspeciespangenomedefinitionandexploitationofdiversity
AT welsmichiel lactobacillusparacaseicomparativegenomicstowardsspeciespangenomedefinitionandexploitationofdiversity
AT polkajustyna lactobacillusparacaseicomparativegenomicstowardsspeciespangenomedefinitionandexploitationofdiversity
AT chervauxchristian lactobacillusparacaseicomparativegenomicstowardsspeciespangenomedefinitionandexploitationofdiversity
AT brissesylvain lactobacillusparacaseicomparativegenomicstowardsspeciespangenomedefinitionandexploitationofdiversity
AT boekhorstjos lactobacillusparacaseicomparativegenomicstowardsspeciespangenomedefinitionandexploitationofdiversity
AT vliegjohanetvanhylckama lactobacillusparacaseicomparativegenomicstowardsspeciespangenomedefinitionandexploitationofdiversity
AT siezenrolandj lactobacillusparacaseicomparativegenomicstowardsspeciespangenomedefinitionandexploitationofdiversity