Cargando…

Predicting genome terminus sequences of Bacillus cereus-group bacteriophage using next generation sequencing data

BACKGROUND: Most tailed bacteriophages (phages) feature linear dsDNA genomes. Characterizing novel phages requires an understanding of complete genome sequences, including the definition of genome physical ends. RESULT: We sequenced 48 Bacillus cereus phage isolates and analyzed Next-generation sequ...

Descripción completa

Detalles Bibliográficos
Autores principales: Chung, Cheng-Han, Walter, Michael H., Yang, Luobin, Chen, Shu-Chuan (Grace), Winston, Vern, Thomas, Michael A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5418689/
https://www.ncbi.nlm.nih.gov/pubmed/28472946
http://dx.doi.org/10.1186/s12864-017-3744-0
_version_ 1783234089632923648
author Chung, Cheng-Han
Walter, Michael H.
Yang, Luobin
Chen, Shu-Chuan (Grace)
Winston, Vern
Thomas, Michael A.
author_facet Chung, Cheng-Han
Walter, Michael H.
Yang, Luobin
Chen, Shu-Chuan (Grace)
Winston, Vern
Thomas, Michael A.
author_sort Chung, Cheng-Han
collection PubMed
description BACKGROUND: Most tailed bacteriophages (phages) feature linear dsDNA genomes. Characterizing novel phages requires an understanding of complete genome sequences, including the definition of genome physical ends. RESULT: We sequenced 48 Bacillus cereus phage isolates and analyzed Next-generation sequencing (NGS) data to resolve the genome configuration of these novel phages. Most assembled contigs featured reads that mapped to both contig ends and formed circularized contigs. Independent assemblies of 31 nearly identical I48-like Bacillus phage isolates allowed us to observe that the assembly programs tended to produce random cleavage on circularized contigs. However, currently available assemblers were not capable of reporting the underlying phage genome configuration from sequence data. To identify the genome configuration of sequenced phage in silico, a terminus prediction method was developed by means of ‘neighboring coverage ratios’ and ‘read edge frequencies’ from read alignment files. Termini were confirmed by primer walking and supported by phylogenetic inference of large DNA terminase protein sequences. CONCLUSIONS: The Terminus package using phage NGS data along with the contig circularity could efficiently identify the proximal positions of phage genome terminus. Complete phage genome sequences allow a proposed characterization of the potential packaging mechanisms and more precise genome annotation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3744-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5418689
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54186892017-05-08 Predicting genome terminus sequences of Bacillus cereus-group bacteriophage using next generation sequencing data Chung, Cheng-Han Walter, Michael H. Yang, Luobin Chen, Shu-Chuan (Grace) Winston, Vern Thomas, Michael A. BMC Genomics Research Article BACKGROUND: Most tailed bacteriophages (phages) feature linear dsDNA genomes. Characterizing novel phages requires an understanding of complete genome sequences, including the definition of genome physical ends. RESULT: We sequenced 48 Bacillus cereus phage isolates and analyzed Next-generation sequencing (NGS) data to resolve the genome configuration of these novel phages. Most assembled contigs featured reads that mapped to both contig ends and formed circularized contigs. Independent assemblies of 31 nearly identical I48-like Bacillus phage isolates allowed us to observe that the assembly programs tended to produce random cleavage on circularized contigs. However, currently available assemblers were not capable of reporting the underlying phage genome configuration from sequence data. To identify the genome configuration of sequenced phage in silico, a terminus prediction method was developed by means of ‘neighboring coverage ratios’ and ‘read edge frequencies’ from read alignment files. Termini were confirmed by primer walking and supported by phylogenetic inference of large DNA terminase protein sequences. CONCLUSIONS: The Terminus package using phage NGS data along with the contig circularity could efficiently identify the proximal positions of phage genome terminus. Complete phage genome sequences allow a proposed characterization of the potential packaging mechanisms and more precise genome annotation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3744-0) contains supplementary material, which is available to authorized users. BioMed Central 2017-05-04 /pmc/articles/PMC5418689/ /pubmed/28472946 http://dx.doi.org/10.1186/s12864-017-3744-0 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Chung, Cheng-Han
Walter, Michael H.
Yang, Luobin
Chen, Shu-Chuan (Grace)
Winston, Vern
Thomas, Michael A.
Predicting genome terminus sequences of Bacillus cereus-group bacteriophage using next generation sequencing data
title Predicting genome terminus sequences of Bacillus cereus-group bacteriophage using next generation sequencing data
title_full Predicting genome terminus sequences of Bacillus cereus-group bacteriophage using next generation sequencing data
title_fullStr Predicting genome terminus sequences of Bacillus cereus-group bacteriophage using next generation sequencing data
title_full_unstemmed Predicting genome terminus sequences of Bacillus cereus-group bacteriophage using next generation sequencing data
title_short Predicting genome terminus sequences of Bacillus cereus-group bacteriophage using next generation sequencing data
title_sort predicting genome terminus sequences of bacillus cereus-group bacteriophage using next generation sequencing data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5418689/
https://www.ncbi.nlm.nih.gov/pubmed/28472946
http://dx.doi.org/10.1186/s12864-017-3744-0
work_keys_str_mv AT chungchenghan predictinggenometerminussequencesofbacilluscereusgroupbacteriophageusingnextgenerationsequencingdata
AT waltermichaelh predictinggenometerminussequencesofbacilluscereusgroupbacteriophageusingnextgenerationsequencingdata
AT yangluobin predictinggenometerminussequencesofbacilluscereusgroupbacteriophageusingnextgenerationsequencingdata
AT chenshuchuangrace predictinggenometerminussequencesofbacilluscereusgroupbacteriophageusingnextgenerationsequencingdata
AT winstonvern predictinggenometerminussequencesofbacilluscereusgroupbacteriophageusingnextgenerationsequencingdata
AT thomasmichaela predictinggenometerminussequencesofbacilluscereusgroupbacteriophageusingnextgenerationsequencingdata