Cargando…
Epi-Gene: An R-Package for Easy Pan-Genome Analysis
The main aim of this study was to develop a set of functions that can analyze the genomic data with less time consumption and memory. Epi-gene is presented as a solution to large sequence file handling and computational time problems. It uses less time and less programming skills in order to work wi...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8478537/ https://www.ncbi.nlm.nih.gov/pubmed/34595238 http://dx.doi.org/10.1155/2021/5585586 |
_version_ | 1784576077718355968 |
---|---|
author | Awan, Furqan Ali, Muhammad Muddassir Hamid, Muhammad Awan, Muhammad Huzair Mushtaq, Muhammad Hassan Kalsoom, Saeeda Ijaz, Muhammad Mehmood, Khalid Liu, Yongjie |
author_facet | Awan, Furqan Ali, Muhammad Muddassir Hamid, Muhammad Awan, Muhammad Huzair Mushtaq, Muhammad Hassan Kalsoom, Saeeda Ijaz, Muhammad Mehmood, Khalid Liu, Yongjie |
author_sort | Awan, Furqan |
collection | PubMed |
description | The main aim of this study was to develop a set of functions that can analyze the genomic data with less time consumption and memory. Epi-gene is presented as a solution to large sequence file handling and computational time problems. It uses less time and less programming skills in order to work with a large number of genomes. In the current study, some features of the Epi-gene R-package were described and illustrated by using a dataset of the 14 Aeromonas hydrophila genomes. The joining, relabeling, and conversion functions were also included in this package to handle the FASTA formatted sequences. To calculate the subsets of core genes, accessory genes, and unique genes, various Epi-gene functions have been used. Heat maps and phylogenetic genome trees were also constructed. This whole procedure was completed in less than 30 minutes. This package can only work on Windows operating systems. Different functions from other packages such as dplyr and ggtree were also used that were available in R computing environment. |
format | Online Article Text |
id | pubmed-8478537 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-84785372021-09-29 Epi-Gene: An R-Package for Easy Pan-Genome Analysis Awan, Furqan Ali, Muhammad Muddassir Hamid, Muhammad Awan, Muhammad Huzair Mushtaq, Muhammad Hassan Kalsoom, Saeeda Ijaz, Muhammad Mehmood, Khalid Liu, Yongjie Biomed Res Int Research Article The main aim of this study was to develop a set of functions that can analyze the genomic data with less time consumption and memory. Epi-gene is presented as a solution to large sequence file handling and computational time problems. It uses less time and less programming skills in order to work with a large number of genomes. In the current study, some features of the Epi-gene R-package were described and illustrated by using a dataset of the 14 Aeromonas hydrophila genomes. The joining, relabeling, and conversion functions were also included in this package to handle the FASTA formatted sequences. To calculate the subsets of core genes, accessory genes, and unique genes, various Epi-gene functions have been used. Heat maps and phylogenetic genome trees were also constructed. This whole procedure was completed in less than 30 minutes. This package can only work on Windows operating systems. Different functions from other packages such as dplyr and ggtree were also used that were available in R computing environment. Hindawi 2021-09-20 /pmc/articles/PMC8478537/ /pubmed/34595238 http://dx.doi.org/10.1155/2021/5585586 Text en Copyright © 2021 Furqan Awan et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Awan, Furqan Ali, Muhammad Muddassir Hamid, Muhammad Awan, Muhammad Huzair Mushtaq, Muhammad Hassan Kalsoom, Saeeda Ijaz, Muhammad Mehmood, Khalid Liu, Yongjie Epi-Gene: An R-Package for Easy Pan-Genome Analysis |
title | Epi-Gene: An R-Package for Easy Pan-Genome Analysis |
title_full | Epi-Gene: An R-Package for Easy Pan-Genome Analysis |
title_fullStr | Epi-Gene: An R-Package for Easy Pan-Genome Analysis |
title_full_unstemmed | Epi-Gene: An R-Package for Easy Pan-Genome Analysis |
title_short | Epi-Gene: An R-Package for Easy Pan-Genome Analysis |
title_sort | epi-gene: an r-package for easy pan-genome analysis |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8478537/ https://www.ncbi.nlm.nih.gov/pubmed/34595238 http://dx.doi.org/10.1155/2021/5585586 |
work_keys_str_mv | AT awanfurqan epigeneanrpackageforeasypangenomeanalysis AT alimuhammadmuddassir epigeneanrpackageforeasypangenomeanalysis AT hamidmuhammad epigeneanrpackageforeasypangenomeanalysis AT awanmuhammadhuzair epigeneanrpackageforeasypangenomeanalysis AT mushtaqmuhammadhassan epigeneanrpackageforeasypangenomeanalysis AT kalsoomsaeeda epigeneanrpackageforeasypangenomeanalysis AT ijazmuhammad epigeneanrpackageforeasypangenomeanalysis AT mehmoodkhalid epigeneanrpackageforeasypangenomeanalysis AT liuyongjie epigeneanrpackageforeasypangenomeanalysis |