Cargando…

PGAP-X: extension on pan-genome analysis pipeline

BACKGROUND: Since PGAP (pan-genome analysis pipeline) was published in 2012, it has been widely employed in bacterial genomics research. Though PGAP has integrated several modules for pan-genomics analysis, how to properly and effectively interpret and visualize the results data is still a challenge...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Yongbing, Sun, Chen, Zhao, Dongyu, Zhang, Yadong, You, Yang, Jia, Xinmiao, Yang, Junhui, Wang, Lingping, Wang, Jinyue, Fu, Haohuan, Kang, Yu, Chen, Fei, Yu, Jun, Wu, Jiayan, Xiao, Jingfa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5780747/
https://www.ncbi.nlm.nih.gov/pubmed/29363431
http://dx.doi.org/10.1186/s12864-017-4337-7
_version_ 1783294800163766272
author Zhao, Yongbing
Sun, Chen
Zhao, Dongyu
Zhang, Yadong
You, Yang
Jia, Xinmiao
Yang, Junhui
Wang, Lingping
Wang, Jinyue
Fu, Haohuan
Kang, Yu
Chen, Fei
Yu, Jun
Wu, Jiayan
Xiao, Jingfa
author_facet Zhao, Yongbing
Sun, Chen
Zhao, Dongyu
Zhang, Yadong
You, Yang
Jia, Xinmiao
Yang, Junhui
Wang, Lingping
Wang, Jinyue
Fu, Haohuan
Kang, Yu
Chen, Fei
Yu, Jun
Wu, Jiayan
Xiao, Jingfa
author_sort Zhao, Yongbing
collection PubMed
description BACKGROUND: Since PGAP (pan-genome analysis pipeline) was published in 2012, it has been widely employed in bacterial genomics research. Though PGAP has integrated several modules for pan-genomics analysis, how to properly and effectively interpret and visualize the results data is still a challenge. RESULT: To well present bacterial genomic characteristics, a novel cross-platform software was developed, named PGAP-X. Four kinds of data analysis modules were developed and integrated: whole genome sequences alignment, orthologous genes clustering, pan-genome profile analysis, and genetic variants analysis. The results from these analyses can be directly visualized in PGAP-X. The modules for data visualization in PGAP-X include: comparison of genome structure, gene distribution by conservation, pan-genome profile curve and variation on genic and genomic region. Meanwhile, result data produced by other programs with similar function can be imported to be further analyzed and visualized in PGAP-X. To test the performance of PGAP-X, we comprehensively analyzed 14 Streptococcus pneumonia strains and 14 Chlamydia trachomatis. The results show that, S. pneumonia strains have higher diversity on genome structure and gene contents than C. trachomatis strains. In addition, S. pneumonia strains might have suffered many evolutionary events, such genomic rearrangements, frequent horizontal gene transfer, homologous recombination, and other evolutionary process. CONCLUSION: Briefly, PGAP-X directly presents the characteristics of bacterial genomic diversity with different visualization methods, which could help us to intuitively understand dynamics and evolution in bacterial genomes. The source code and the pre-complied executable programs are freely available from http://pgapx.ybzhao.com. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi: 10.1186/s12864-017-4337-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5780747
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57807472018-02-06 PGAP-X: extension on pan-genome analysis pipeline Zhao, Yongbing Sun, Chen Zhao, Dongyu Zhang, Yadong You, Yang Jia, Xinmiao Yang, Junhui Wang, Lingping Wang, Jinyue Fu, Haohuan Kang, Yu Chen, Fei Yu, Jun Wu, Jiayan Xiao, Jingfa BMC Genomics Research BACKGROUND: Since PGAP (pan-genome analysis pipeline) was published in 2012, it has been widely employed in bacterial genomics research. Though PGAP has integrated several modules for pan-genomics analysis, how to properly and effectively interpret and visualize the results data is still a challenge. RESULT: To well present bacterial genomic characteristics, a novel cross-platform software was developed, named PGAP-X. Four kinds of data analysis modules were developed and integrated: whole genome sequences alignment, orthologous genes clustering, pan-genome profile analysis, and genetic variants analysis. The results from these analyses can be directly visualized in PGAP-X. The modules for data visualization in PGAP-X include: comparison of genome structure, gene distribution by conservation, pan-genome profile curve and variation on genic and genomic region. Meanwhile, result data produced by other programs with similar function can be imported to be further analyzed and visualized in PGAP-X. To test the performance of PGAP-X, we comprehensively analyzed 14 Streptococcus pneumonia strains and 14 Chlamydia trachomatis. The results show that, S. pneumonia strains have higher diversity on genome structure and gene contents than C. trachomatis strains. In addition, S. pneumonia strains might have suffered many evolutionary events, such genomic rearrangements, frequent horizontal gene transfer, homologous recombination, and other evolutionary process. CONCLUSION: Briefly, PGAP-X directly presents the characteristics of bacterial genomic diversity with different visualization methods, which could help us to intuitively understand dynamics and evolution in bacterial genomes. The source code and the pre-complied executable programs are freely available from http://pgapx.ybzhao.com. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi: 10.1186/s12864-017-4337-7) contains supplementary material, which is available to authorized users. BioMed Central 2018-01-19 /pmc/articles/PMC5780747/ /pubmed/29363431 http://dx.doi.org/10.1186/s12864-017-4337-7 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zhao, Yongbing
Sun, Chen
Zhao, Dongyu
Zhang, Yadong
You, Yang
Jia, Xinmiao
Yang, Junhui
Wang, Lingping
Wang, Jinyue
Fu, Haohuan
Kang, Yu
Chen, Fei
Yu, Jun
Wu, Jiayan
Xiao, Jingfa
PGAP-X: extension on pan-genome analysis pipeline
title PGAP-X: extension on pan-genome analysis pipeline
title_full PGAP-X: extension on pan-genome analysis pipeline
title_fullStr PGAP-X: extension on pan-genome analysis pipeline
title_full_unstemmed PGAP-X: extension on pan-genome analysis pipeline
title_short PGAP-X: extension on pan-genome analysis pipeline
title_sort pgap-x: extension on pan-genome analysis pipeline
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5780747/
https://www.ncbi.nlm.nih.gov/pubmed/29363431
http://dx.doi.org/10.1186/s12864-017-4337-7
work_keys_str_mv AT zhaoyongbing pgapxextensiononpangenomeanalysispipeline
AT sunchen pgapxextensiononpangenomeanalysispipeline
AT zhaodongyu pgapxextensiononpangenomeanalysispipeline
AT zhangyadong pgapxextensiononpangenomeanalysispipeline
AT youyang pgapxextensiononpangenomeanalysispipeline
AT jiaxinmiao pgapxextensiononpangenomeanalysispipeline
AT yangjunhui pgapxextensiononpangenomeanalysispipeline
AT wanglingping pgapxextensiononpangenomeanalysispipeline
AT wangjinyue pgapxextensiononpangenomeanalysispipeline
AT fuhaohuan pgapxextensiononpangenomeanalysispipeline
AT kangyu pgapxextensiononpangenomeanalysispipeline
AT chenfei pgapxextensiononpangenomeanalysispipeline
AT yujun pgapxextensiononpangenomeanalysispipeline
AT wujiayan pgapxextensiononpangenomeanalysispipeline
AT xiaojingfa pgapxextensiononpangenomeanalysispipeline