Cargando…

Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria

BACKGROUND: The concept of the “pan-genome,” which refers to the total complement of genes within a given sample or species, is well established in bacterial genomics. Rapid and scalable pipelines are available for managing and interpreting pan-genomes from large batches of annotated assemblies. How...

Descripción completa

Detalles Bibliográficos
Autores principales: Thorpe, Harry A, Bayliss, Sion C, Sheppard, Samuel K, Feil, Edward J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5890482/
https://www.ncbi.nlm.nih.gov/pubmed/29635296
http://dx.doi.org/10.1093/gigascience/giy015
_version_ 1783312875695112192
author Thorpe, Harry A
Bayliss, Sion C
Sheppard, Samuel K
Feil, Edward J
author_facet Thorpe, Harry A
Bayliss, Sion C
Sheppard, Samuel K
Feil, Edward J
author_sort Thorpe, Harry A
collection PubMed
description BACKGROUND: The concept of the “pan-genome,” which refers to the total complement of genes within a given sample or species, is well established in bacterial genomics. Rapid and scalable pipelines are available for managing and interpreting pan-genomes from large batches of annotated assemblies. However, despite overwhelming evidence that variation in intergenic regions in bacteria can directly influence phenotypes, most current approaches for analyzing pan-genomes focus exclusively on protein-coding sequences. FINDINGS: To address this we present Piggy, a novel pipeline that emulates Roary except that it is based only on intergenic regions. A key utility provided by Piggy is the detection of highly divergent (“switched”) intergenic regions (IGRs) upstream of genes. We demonstrate the use of Piggy on large datasets of clinically important lineages of Staphylococcus aureus and Escherichia coli. CONCLUSIONS: For S. aureus, we show that highly divergent (switched) IGRs are associated with differences in gene expression and we establish a multilocus reference database of IGR alleles (igMLST; implemented in BIGSdb).
format Online
Article
Text
id pubmed-5890482
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58904822018-04-13 Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria Thorpe, Harry A Bayliss, Sion C Sheppard, Samuel K Feil, Edward J Gigascience Technical Note BACKGROUND: The concept of the “pan-genome,” which refers to the total complement of genes within a given sample or species, is well established in bacterial genomics. Rapid and scalable pipelines are available for managing and interpreting pan-genomes from large batches of annotated assemblies. However, despite overwhelming evidence that variation in intergenic regions in bacteria can directly influence phenotypes, most current approaches for analyzing pan-genomes focus exclusively on protein-coding sequences. FINDINGS: To address this we present Piggy, a novel pipeline that emulates Roary except that it is based only on intergenic regions. A key utility provided by Piggy is the detection of highly divergent (“switched”) intergenic regions (IGRs) upstream of genes. We demonstrate the use of Piggy on large datasets of clinically important lineages of Staphylococcus aureus and Escherichia coli. CONCLUSIONS: For S. aureus, we show that highly divergent (switched) IGRs are associated with differences in gene expression and we establish a multilocus reference database of IGR alleles (igMLST; implemented in BIGSdb). Oxford University Press 2018-03-04 /pmc/articles/PMC5890482/ /pubmed/29635296 http://dx.doi.org/10.1093/gigascience/giy015 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Thorpe, Harry A
Bayliss, Sion C
Sheppard, Samuel K
Feil, Edward J
Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria
title Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria
title_full Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria
title_fullStr Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria
title_full_unstemmed Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria
title_short Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria
title_sort piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5890482/
https://www.ncbi.nlm.nih.gov/pubmed/29635296
http://dx.doi.org/10.1093/gigascience/giy015
work_keys_str_mv AT thorpeharrya piggyarapidlargescalepangenomeanalysistoolforintergenicregionsinbacteria
AT baylisssionc piggyarapidlargescalepangenomeanalysistoolforintergenicregionsinbacteria
AT sheppardsamuelk piggyarapidlargescalepangenomeanalysistoolforintergenicregionsinbacteria
AT feiledwardj piggyarapidlargescalepangenomeanalysistoolforintergenicregionsinbacteria