Cargando…
micropan: an R-package for microbial pan-genomics
BACKGROUND: A pan-genome is defined as the set of all unique gene families found in one or more strains of a prokaryotic species. Due to the extensive within-species diversity in the microbial world, the pan-genome is often many times larger than a single genome. Studies of pan-genomes have become p...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4375852/ https://www.ncbi.nlm.nih.gov/pubmed/25888166 http://dx.doi.org/10.1186/s12859-015-0517-0 |
_version_ | 1782363646080319488 |
---|---|
author | Snipen, Lars Liland, Kristian Hovde |
author_facet | Snipen, Lars Liland, Kristian Hovde |
author_sort | Snipen, Lars |
collection | PubMed |
description | BACKGROUND: A pan-genome is defined as the set of all unique gene families found in one or more strains of a prokaryotic species. Due to the extensive within-species diversity in the microbial world, the pan-genome is often many times larger than a single genome. Studies of pan-genomes have become popular due to the easy access to whole-genome sequence data for prokaryotes. A pan-genome study reveals species diversity and gene families that may be of special interest, e.g because of their role in bacterial survival or their ability to discriminate strains. RESULTS: We present an R package for the study of prokaryotic pan-genomes. The R computing environment harbors endless possibilities with respect to statistical analyses and graphics. External free software is used for the heavy computations involved, and the R package provides functions for building a computational pipeline. CONCLUSIONS: We demonstrate parts of the package on a data set for the gram positive bacterium Enterococcus faecalis. The package is free to download and install from The Comprehensive R Archive Network. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0517-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4375852 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43758522015-03-28 micropan: an R-package for microbial pan-genomics Snipen, Lars Liland, Kristian Hovde BMC Bioinformatics Software BACKGROUND: A pan-genome is defined as the set of all unique gene families found in one or more strains of a prokaryotic species. Due to the extensive within-species diversity in the microbial world, the pan-genome is often many times larger than a single genome. Studies of pan-genomes have become popular due to the easy access to whole-genome sequence data for prokaryotes. A pan-genome study reveals species diversity and gene families that may be of special interest, e.g because of their role in bacterial survival or their ability to discriminate strains. RESULTS: We present an R package for the study of prokaryotic pan-genomes. The R computing environment harbors endless possibilities with respect to statistical analyses and graphics. External free software is used for the heavy computations involved, and the R package provides functions for building a computational pipeline. CONCLUSIONS: We demonstrate parts of the package on a data set for the gram positive bacterium Enterococcus faecalis. The package is free to download and install from The Comprehensive R Archive Network. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0517-0) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-12 /pmc/articles/PMC4375852/ /pubmed/25888166 http://dx.doi.org/10.1186/s12859-015-0517-0 Text en © Snipen and Liland; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Snipen, Lars Liland, Kristian Hovde micropan: an R-package for microbial pan-genomics |
title | micropan: an R-package for microbial pan-genomics |
title_full | micropan: an R-package for microbial pan-genomics |
title_fullStr | micropan: an R-package for microbial pan-genomics |
title_full_unstemmed | micropan: an R-package for microbial pan-genomics |
title_short | micropan: an R-package for microbial pan-genomics |
title_sort | micropan: an r-package for microbial pan-genomics |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4375852/ https://www.ncbi.nlm.nih.gov/pubmed/25888166 http://dx.doi.org/10.1186/s12859-015-0517-0 |
work_keys_str_mv | AT snipenlars micropananrpackageformicrobialpangenomics AT lilandkristianhovde micropananrpackageformicrobialpangenomics |