Cargando…

micropan: an R-package for microbial pan-genomics

BACKGROUND: A pan-genome is defined as the set of all unique gene families found in one or more strains of a prokaryotic species. Due to the extensive within-species diversity in the microbial world, the pan-genome is often many times larger than a single genome. Studies of pan-genomes have become p...

Descripción completa

Detalles Bibliográficos
Autores principales: Snipen, Lars, Liland, Kristian Hovde
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4375852/
https://www.ncbi.nlm.nih.gov/pubmed/25888166
http://dx.doi.org/10.1186/s12859-015-0517-0
_version_ 1782363646080319488
author Snipen, Lars
Liland, Kristian Hovde
author_facet Snipen, Lars
Liland, Kristian Hovde
author_sort Snipen, Lars
collection PubMed
description BACKGROUND: A pan-genome is defined as the set of all unique gene families found in one or more strains of a prokaryotic species. Due to the extensive within-species diversity in the microbial world, the pan-genome is often many times larger than a single genome. Studies of pan-genomes have become popular due to the easy access to whole-genome sequence data for prokaryotes. A pan-genome study reveals species diversity and gene families that may be of special interest, e.g because of their role in bacterial survival or their ability to discriminate strains. RESULTS: We present an R package for the study of prokaryotic pan-genomes. The R computing environment harbors endless possibilities with respect to statistical analyses and graphics. External free software is used for the heavy computations involved, and the R package provides functions for building a computational pipeline. CONCLUSIONS: We demonstrate parts of the package on a data set for the gram positive bacterium Enterococcus faecalis. The package is free to download and install from The Comprehensive R Archive Network. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0517-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4375852
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43758522015-03-28 micropan: an R-package for microbial pan-genomics Snipen, Lars Liland, Kristian Hovde BMC Bioinformatics Software BACKGROUND: A pan-genome is defined as the set of all unique gene families found in one or more strains of a prokaryotic species. Due to the extensive within-species diversity in the microbial world, the pan-genome is often many times larger than a single genome. Studies of pan-genomes have become popular due to the easy access to whole-genome sequence data for prokaryotes. A pan-genome study reveals species diversity and gene families that may be of special interest, e.g because of their role in bacterial survival or their ability to discriminate strains. RESULTS: We present an R package for the study of prokaryotic pan-genomes. The R computing environment harbors endless possibilities with respect to statistical analyses and graphics. External free software is used for the heavy computations involved, and the R package provides functions for building a computational pipeline. CONCLUSIONS: We demonstrate parts of the package on a data set for the gram positive bacterium Enterococcus faecalis. The package is free to download and install from The Comprehensive R Archive Network. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0517-0) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-12 /pmc/articles/PMC4375852/ /pubmed/25888166 http://dx.doi.org/10.1186/s12859-015-0517-0 Text en © Snipen and Liland; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Snipen, Lars
Liland, Kristian Hovde
micropan: an R-package for microbial pan-genomics
title micropan: an R-package for microbial pan-genomics
title_full micropan: an R-package for microbial pan-genomics
title_fullStr micropan: an R-package for microbial pan-genomics
title_full_unstemmed micropan: an R-package for microbial pan-genomics
title_short micropan: an R-package for microbial pan-genomics
title_sort micropan: an r-package for microbial pan-genomics
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4375852/
https://www.ncbi.nlm.nih.gov/pubmed/25888166
http://dx.doi.org/10.1186/s12859-015-0517-0
work_keys_str_mv AT snipenlars micropananrpackageformicrobialpangenomics
AT lilandkristianhovde micropananrpackageformicrobialpangenomics