Cargando…

Whole genome assembly of a natto production strain Bacillus subtilis natto from very short read data

BACKGROUND: Bacillus subtilis natto is closely related to the laboratory standard strain B. subtilis Marburg 168, and functions as a starter for the production of the traditional Japanese food "natto" made from soybeans. Although re-sequencing whole genomes of several laboratory domesticat...

Descripción completa

Detalles Bibliográficos
Autores principales: Nishito, Yukari, Osana, Yasunori, Hachiya, Tsuyoshi, Popendorf, Kris, Toyoda, Atsushi, Fujiyama, Asao, Itaya, Mitsuhiro, Sakakibara, Yasubumi
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2867830/
https://www.ncbi.nlm.nih.gov/pubmed/20398357
http://dx.doi.org/10.1186/1471-2164-11-243
_version_ 1782181005029801984
author Nishito, Yukari
Osana, Yasunori
Hachiya, Tsuyoshi
Popendorf, Kris
Toyoda, Atsushi
Fujiyama, Asao
Itaya, Mitsuhiro
Sakakibara, Yasubumi
author_facet Nishito, Yukari
Osana, Yasunori
Hachiya, Tsuyoshi
Popendorf, Kris
Toyoda, Atsushi
Fujiyama, Asao
Itaya, Mitsuhiro
Sakakibara, Yasubumi
author_sort Nishito, Yukari
collection PubMed
description BACKGROUND: Bacillus subtilis natto is closely related to the laboratory standard strain B. subtilis Marburg 168, and functions as a starter for the production of the traditional Japanese food "natto" made from soybeans. Although re-sequencing whole genomes of several laboratory domesticated B. subtilis 168 derivatives has already been attempted using short read sequencing data, the assembly of the whole genome sequence of a closely related strain, B. subtilis natto, from very short read data is more challenging, particularly with our aim to assemble one fully connected scaffold from short reads around 35 bp in length. RESULTS: We applied a comparative genome assembly method, which combines de novo assembly and reference guided assembly, to one of the B. subtilis natto strains. We successfully assembled 28 scaffolds and managed to avoid substantial fragmentation. Completion of the assembly through long PCR experiments resulted in one connected scaffold for B. subtilis natto. Based on the assembled genome sequence, our orthologous gene analysis between natto BEST195 and Marburg 168 revealed that 82.4% of 4375 predicted genes in BEST195 are one-to-one orthologous to genes in 168, with two genes in-paralog, 3.2% are deleted in 168, 14.3% are inserted in BEST195, and 5.9% of genes present in 168 are deleted in BEST195. The natto genome contains the same alleles in the promoter region of degQ and the coding region of swrAA as the wild strain, RO-FF-1. These are specific for γ-PGA production ability, which is related to natto production. Further, the B. subtilis natto strain completely lacked a polyketide synthesis operon, disrupted the plipastatin production operon, and possesses previously unidentified transposases. CONCLUSIONS: The determination of the whole genome sequence of Bacillus subtilis natto provided detailed analyses of a set of genes related to natto production, demonstrating the number and locations of insertion sequences that B. subtilis natto harbors but B. subtilis 168 lacks. Multiple genome-level comparisons among five closely related Bacillus species were also carried out. The determined genome sequence of B. subtilis natto and gene annotations are available from the Natto genome browser http://natto-genome.org/.
format Text
id pubmed-2867830
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28678302010-05-12 Whole genome assembly of a natto production strain Bacillus subtilis natto from very short read data Nishito, Yukari Osana, Yasunori Hachiya, Tsuyoshi Popendorf, Kris Toyoda, Atsushi Fujiyama, Asao Itaya, Mitsuhiro Sakakibara, Yasubumi BMC Genomics Research Article BACKGROUND: Bacillus subtilis natto is closely related to the laboratory standard strain B. subtilis Marburg 168, and functions as a starter for the production of the traditional Japanese food "natto" made from soybeans. Although re-sequencing whole genomes of several laboratory domesticated B. subtilis 168 derivatives has already been attempted using short read sequencing data, the assembly of the whole genome sequence of a closely related strain, B. subtilis natto, from very short read data is more challenging, particularly with our aim to assemble one fully connected scaffold from short reads around 35 bp in length. RESULTS: We applied a comparative genome assembly method, which combines de novo assembly and reference guided assembly, to one of the B. subtilis natto strains. We successfully assembled 28 scaffolds and managed to avoid substantial fragmentation. Completion of the assembly through long PCR experiments resulted in one connected scaffold for B. subtilis natto. Based on the assembled genome sequence, our orthologous gene analysis between natto BEST195 and Marburg 168 revealed that 82.4% of 4375 predicted genes in BEST195 are one-to-one orthologous to genes in 168, with two genes in-paralog, 3.2% are deleted in 168, 14.3% are inserted in BEST195, and 5.9% of genes present in 168 are deleted in BEST195. The natto genome contains the same alleles in the promoter region of degQ and the coding region of swrAA as the wild strain, RO-FF-1. These are specific for γ-PGA production ability, which is related to natto production. Further, the B. subtilis natto strain completely lacked a polyketide synthesis operon, disrupted the plipastatin production operon, and possesses previously unidentified transposases. CONCLUSIONS: The determination of the whole genome sequence of Bacillus subtilis natto provided detailed analyses of a set of genes related to natto production, demonstrating the number and locations of insertion sequences that B. subtilis natto harbors but B. subtilis 168 lacks. Multiple genome-level comparisons among five closely related Bacillus species were also carried out. The determined genome sequence of B. subtilis natto and gene annotations are available from the Natto genome browser http://natto-genome.org/. BioMed Central 2010-04-16 /pmc/articles/PMC2867830/ /pubmed/20398357 http://dx.doi.org/10.1186/1471-2164-11-243 Text en Copyright ©2010 Nishito et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Nishito, Yukari
Osana, Yasunori
Hachiya, Tsuyoshi
Popendorf, Kris
Toyoda, Atsushi
Fujiyama, Asao
Itaya, Mitsuhiro
Sakakibara, Yasubumi
Whole genome assembly of a natto production strain Bacillus subtilis natto from very short read data
title Whole genome assembly of a natto production strain Bacillus subtilis natto from very short read data
title_full Whole genome assembly of a natto production strain Bacillus subtilis natto from very short read data
title_fullStr Whole genome assembly of a natto production strain Bacillus subtilis natto from very short read data
title_full_unstemmed Whole genome assembly of a natto production strain Bacillus subtilis natto from very short read data
title_short Whole genome assembly of a natto production strain Bacillus subtilis natto from very short read data
title_sort whole genome assembly of a natto production strain bacillus subtilis natto from very short read data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2867830/
https://www.ncbi.nlm.nih.gov/pubmed/20398357
http://dx.doi.org/10.1186/1471-2164-11-243
work_keys_str_mv AT nishitoyukari wholegenomeassemblyofanattoproductionstrainbacillussubtilisnattofromveryshortreaddata
AT osanayasunori wholegenomeassemblyofanattoproductionstrainbacillussubtilisnattofromveryshortreaddata
AT hachiyatsuyoshi wholegenomeassemblyofanattoproductionstrainbacillussubtilisnattofromveryshortreaddata
AT popendorfkris wholegenomeassemblyofanattoproductionstrainbacillussubtilisnattofromveryshortreaddata
AT toyodaatsushi wholegenomeassemblyofanattoproductionstrainbacillussubtilisnattofromveryshortreaddata
AT fujiyamaasao wholegenomeassemblyofanattoproductionstrainbacillussubtilisnattofromveryshortreaddata
AT itayamitsuhiro wholegenomeassemblyofanattoproductionstrainbacillussubtilisnattofromveryshortreaddata
AT sakakibarayasubumi wholegenomeassemblyofanattoproductionstrainbacillussubtilisnattofromveryshortreaddata