Cargando…

Epi-Gene: An R-Package for Easy Pan-Genome Analysis

The main aim of this study was to develop a set of functions that can analyze the genomic data with less time consumption and memory. Epi-gene is presented as a solution to large sequence file handling and computational time problems. It uses less time and less programming skills in order to work wi...

Descripción completa

Detalles Bibliográficos
Autores principales: Awan, Furqan, Ali, Muhammad Muddassir, Hamid, Muhammad, Awan, Muhammad Huzair, Mushtaq, Muhammad Hassan, Kalsoom, Saeeda, Ijaz, Muhammad, Mehmood, Khalid, Liu, Yongjie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8478537/
https://www.ncbi.nlm.nih.gov/pubmed/34595238
http://dx.doi.org/10.1155/2021/5585586
_version_ 1784576077718355968
author Awan, Furqan
Ali, Muhammad Muddassir
Hamid, Muhammad
Awan, Muhammad Huzair
Mushtaq, Muhammad Hassan
Kalsoom, Saeeda
Ijaz, Muhammad
Mehmood, Khalid
Liu, Yongjie
author_facet Awan, Furqan
Ali, Muhammad Muddassir
Hamid, Muhammad
Awan, Muhammad Huzair
Mushtaq, Muhammad Hassan
Kalsoom, Saeeda
Ijaz, Muhammad
Mehmood, Khalid
Liu, Yongjie
author_sort Awan, Furqan
collection PubMed
description The main aim of this study was to develop a set of functions that can analyze the genomic data with less time consumption and memory. Epi-gene is presented as a solution to large sequence file handling and computational time problems. It uses less time and less programming skills in order to work with a large number of genomes. In the current study, some features of the Epi-gene R-package were described and illustrated by using a dataset of the 14 Aeromonas hydrophila genomes. The joining, relabeling, and conversion functions were also included in this package to handle the FASTA formatted sequences. To calculate the subsets of core genes, accessory genes, and unique genes, various Epi-gene functions have been used. Heat maps and phylogenetic genome trees were also constructed. This whole procedure was completed in less than 30 minutes. This package can only work on Windows operating systems. Different functions from other packages such as dplyr and ggtree were also used that were available in R computing environment.
format Online
Article
Text
id pubmed-8478537
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-84785372021-09-29 Epi-Gene: An R-Package for Easy Pan-Genome Analysis Awan, Furqan Ali, Muhammad Muddassir Hamid, Muhammad Awan, Muhammad Huzair Mushtaq, Muhammad Hassan Kalsoom, Saeeda Ijaz, Muhammad Mehmood, Khalid Liu, Yongjie Biomed Res Int Research Article The main aim of this study was to develop a set of functions that can analyze the genomic data with less time consumption and memory. Epi-gene is presented as a solution to large sequence file handling and computational time problems. It uses less time and less programming skills in order to work with a large number of genomes. In the current study, some features of the Epi-gene R-package were described and illustrated by using a dataset of the 14 Aeromonas hydrophila genomes. The joining, relabeling, and conversion functions were also included in this package to handle the FASTA formatted sequences. To calculate the subsets of core genes, accessory genes, and unique genes, various Epi-gene functions have been used. Heat maps and phylogenetic genome trees were also constructed. This whole procedure was completed in less than 30 minutes. This package can only work on Windows operating systems. Different functions from other packages such as dplyr and ggtree were also used that were available in R computing environment. Hindawi 2021-09-20 /pmc/articles/PMC8478537/ /pubmed/34595238 http://dx.doi.org/10.1155/2021/5585586 Text en Copyright © 2021 Furqan Awan et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Awan, Furqan
Ali, Muhammad Muddassir
Hamid, Muhammad
Awan, Muhammad Huzair
Mushtaq, Muhammad Hassan
Kalsoom, Saeeda
Ijaz, Muhammad
Mehmood, Khalid
Liu, Yongjie
Epi-Gene: An R-Package for Easy Pan-Genome Analysis
title Epi-Gene: An R-Package for Easy Pan-Genome Analysis
title_full Epi-Gene: An R-Package for Easy Pan-Genome Analysis
title_fullStr Epi-Gene: An R-Package for Easy Pan-Genome Analysis
title_full_unstemmed Epi-Gene: An R-Package for Easy Pan-Genome Analysis
title_short Epi-Gene: An R-Package for Easy Pan-Genome Analysis
title_sort epi-gene: an r-package for easy pan-genome analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8478537/
https://www.ncbi.nlm.nih.gov/pubmed/34595238
http://dx.doi.org/10.1155/2021/5585586
work_keys_str_mv AT awanfurqan epigeneanrpackageforeasypangenomeanalysis
AT alimuhammadmuddassir epigeneanrpackageforeasypangenomeanalysis
AT hamidmuhammad epigeneanrpackageforeasypangenomeanalysis
AT awanmuhammadhuzair epigeneanrpackageforeasypangenomeanalysis
AT mushtaqmuhammadhassan epigeneanrpackageforeasypangenomeanalysis
AT kalsoomsaeeda epigeneanrpackageforeasypangenomeanalysis
AT ijazmuhammad epigeneanrpackageforeasypangenomeanalysis
AT mehmoodkhalid epigeneanrpackageforeasypangenomeanalysis
AT liuyongjie epigeneanrpackageforeasypangenomeanalysis