Cargando…
Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria
BACKGROUND: The concept of the “pan-genome,” which refers to the total complement of genes within a given sample or species, is well established in bacterial genomics. Rapid and scalable pipelines are available for managing and interpreting pan-genomes from large batches of annotated assemblies. How...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5890482/ https://www.ncbi.nlm.nih.gov/pubmed/29635296 http://dx.doi.org/10.1093/gigascience/giy015 |
_version_ | 1783312875695112192 |
---|---|
author | Thorpe, Harry A Bayliss, Sion C Sheppard, Samuel K Feil, Edward J |
author_facet | Thorpe, Harry A Bayliss, Sion C Sheppard, Samuel K Feil, Edward J |
author_sort | Thorpe, Harry A |
collection | PubMed |
description | BACKGROUND: The concept of the “pan-genome,” which refers to the total complement of genes within a given sample or species, is well established in bacterial genomics. Rapid and scalable pipelines are available for managing and interpreting pan-genomes from large batches of annotated assemblies. However, despite overwhelming evidence that variation in intergenic regions in bacteria can directly influence phenotypes, most current approaches for analyzing pan-genomes focus exclusively on protein-coding sequences. FINDINGS: To address this we present Piggy, a novel pipeline that emulates Roary except that it is based only on intergenic regions. A key utility provided by Piggy is the detection of highly divergent (“switched”) intergenic regions (IGRs) upstream of genes. We demonstrate the use of Piggy on large datasets of clinically important lineages of Staphylococcus aureus and Escherichia coli. CONCLUSIONS: For S. aureus, we show that highly divergent (switched) IGRs are associated with differences in gene expression and we establish a multilocus reference database of IGR alleles (igMLST; implemented in BIGSdb). |
format | Online Article Text |
id | pubmed-5890482 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-58904822018-04-13 Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria Thorpe, Harry A Bayliss, Sion C Sheppard, Samuel K Feil, Edward J Gigascience Technical Note BACKGROUND: The concept of the “pan-genome,” which refers to the total complement of genes within a given sample or species, is well established in bacterial genomics. Rapid and scalable pipelines are available for managing and interpreting pan-genomes from large batches of annotated assemblies. However, despite overwhelming evidence that variation in intergenic regions in bacteria can directly influence phenotypes, most current approaches for analyzing pan-genomes focus exclusively on protein-coding sequences. FINDINGS: To address this we present Piggy, a novel pipeline that emulates Roary except that it is based only on intergenic regions. A key utility provided by Piggy is the detection of highly divergent (“switched”) intergenic regions (IGRs) upstream of genes. We demonstrate the use of Piggy on large datasets of clinically important lineages of Staphylococcus aureus and Escherichia coli. CONCLUSIONS: For S. aureus, we show that highly divergent (switched) IGRs are associated with differences in gene expression and we establish a multilocus reference database of IGR alleles (igMLST; implemented in BIGSdb). Oxford University Press 2018-03-04 /pmc/articles/PMC5890482/ /pubmed/29635296 http://dx.doi.org/10.1093/gigascience/giy015 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Technical Note Thorpe, Harry A Bayliss, Sion C Sheppard, Samuel K Feil, Edward J Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria |
title | Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria |
title_full | Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria |
title_fullStr | Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria |
title_full_unstemmed | Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria |
title_short | Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria |
title_sort | piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5890482/ https://www.ncbi.nlm.nih.gov/pubmed/29635296 http://dx.doi.org/10.1093/gigascience/giy015 |
work_keys_str_mv | AT thorpeharrya piggyarapidlargescalepangenomeanalysistoolforintergenicregionsinbacteria AT baylisssionc piggyarapidlargescalepangenomeanalysistoolforintergenicregionsinbacteria AT sheppardsamuelk piggyarapidlargescalepangenomeanalysistoolforintergenicregionsinbacteria AT feiledwardj piggyarapidlargescalepangenomeanalysistoolforintergenicregionsinbacteria |