Cargando…

Whole genome sequence and manual annotation of Clostridium autoethanogenum, an industrially relevant bacterium

BACKGROUND: Clostridium autoethanogenum is an acetogenic bacterium capable of producing high value commodity chemicals and biofuels from the C1 gases present in synthesis gas. This common industrial waste gas can act as the sole energy and carbon source for the bacterium that converts the low value...

Descripción completa

Detalles Bibliográficos
Autores principales: Humphreys, Christopher M., McLean, Samantha, Schatschneider, Sarah, Millat, Thomas, Henstra, Anne M., Annan, Florence J., Breitkopf, Ronja, Pander, Bart, Piatek, Pawel, Rowe, Peter, Wichlacz, Alexander T., Woods, Craig, Norman, Rupert, Blom, Jochen, Goesman, Alexander, Hodgman, Charlie, Barrett, David, Thomas, Neil R., Winzer, Klaus, Minton, Nigel P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4687164/
https://www.ncbi.nlm.nih.gov/pubmed/26692227
http://dx.doi.org/10.1186/s12864-015-2287-5
_version_ 1782406576115548160
author Humphreys, Christopher M.
McLean, Samantha
Schatschneider, Sarah
Millat, Thomas
Henstra, Anne M.
Annan, Florence J.
Breitkopf, Ronja
Pander, Bart
Piatek, Pawel
Rowe, Peter
Wichlacz, Alexander T.
Woods, Craig
Norman, Rupert
Blom, Jochen
Goesman, Alexander
Hodgman, Charlie
Barrett, David
Thomas, Neil R.
Winzer, Klaus
Minton, Nigel P.
author_facet Humphreys, Christopher M.
McLean, Samantha
Schatschneider, Sarah
Millat, Thomas
Henstra, Anne M.
Annan, Florence J.
Breitkopf, Ronja
Pander, Bart
Piatek, Pawel
Rowe, Peter
Wichlacz, Alexander T.
Woods, Craig
Norman, Rupert
Blom, Jochen
Goesman, Alexander
Hodgman, Charlie
Barrett, David
Thomas, Neil R.
Winzer, Klaus
Minton, Nigel P.
author_sort Humphreys, Christopher M.
collection PubMed
description BACKGROUND: Clostridium autoethanogenum is an acetogenic bacterium capable of producing high value commodity chemicals and biofuels from the C1 gases present in synthesis gas. This common industrial waste gas can act as the sole energy and carbon source for the bacterium that converts the low value gaseous components into cellular building blocks and industrially relevant products via the action of the reductive acetyl-CoA (Wood-Ljungdahl) pathway. Current research efforts are focused on the enhancement and extension of product formation in this organism via synthetic biology approaches. However, crucial to metabolic modelling and directed pathway engineering is a reliable and comprehensively annotated genome sequence. RESULTS: We performed next generation sequencing using Illumina MiSeq technology on the DSM10061 strain of Clostridium autoethanogenum and observed 243 single nucleotide discrepancies when compared to the published finished sequence (NCBI: GCA_000484505.1), with 59.1 % present in coding regions. These variations were confirmed by Sanger sequencing and subsequent analysis suggested that the discrepancies were sequencing errors in the published genome not true single nucleotide polymorphisms. This was corroborated by the observation that over 90 % occurred within homopolymer regions of greater than 4 nucleotides in length. It was also observed that many genes containing these sequencing errors were annotated in the published closed genome as encoding proteins containing frameshift mutations (18 instances) or were annotated despite the coding frame containing stop codons, which if genuine, would severely hinder the organism’s ability to survive. Furthermore, we have completed a comprehensive manual curation to reduce errors in the annotation that occur through serial use of automated annotation pipelines in related species. As a result, different functions were assigned to gene products or previous functional annotations rejected because of missing evidence in various occasions. CONCLUSIONS: We present a revised manually curated full genome sequence for Clostridium autoethanogenum DSM10061, which provides reliable information for genome-scale models that rely heavily on the accuracy of annotation, and represents an important step towards the manipulation and metabolic modelling of this industrially relevant acetogen. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2287-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4687164
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46871642015-12-23 Whole genome sequence and manual annotation of Clostridium autoethanogenum, an industrially relevant bacterium Humphreys, Christopher M. McLean, Samantha Schatschneider, Sarah Millat, Thomas Henstra, Anne M. Annan, Florence J. Breitkopf, Ronja Pander, Bart Piatek, Pawel Rowe, Peter Wichlacz, Alexander T. Woods, Craig Norman, Rupert Blom, Jochen Goesman, Alexander Hodgman, Charlie Barrett, David Thomas, Neil R. Winzer, Klaus Minton, Nigel P. BMC Genomics Research Article BACKGROUND: Clostridium autoethanogenum is an acetogenic bacterium capable of producing high value commodity chemicals and biofuels from the C1 gases present in synthesis gas. This common industrial waste gas can act as the sole energy and carbon source for the bacterium that converts the low value gaseous components into cellular building blocks and industrially relevant products via the action of the reductive acetyl-CoA (Wood-Ljungdahl) pathway. Current research efforts are focused on the enhancement and extension of product formation in this organism via synthetic biology approaches. However, crucial to metabolic modelling and directed pathway engineering is a reliable and comprehensively annotated genome sequence. RESULTS: We performed next generation sequencing using Illumina MiSeq technology on the DSM10061 strain of Clostridium autoethanogenum and observed 243 single nucleotide discrepancies when compared to the published finished sequence (NCBI: GCA_000484505.1), with 59.1 % present in coding regions. These variations were confirmed by Sanger sequencing and subsequent analysis suggested that the discrepancies were sequencing errors in the published genome not true single nucleotide polymorphisms. This was corroborated by the observation that over 90 % occurred within homopolymer regions of greater than 4 nucleotides in length. It was also observed that many genes containing these sequencing errors were annotated in the published closed genome as encoding proteins containing frameshift mutations (18 instances) or were annotated despite the coding frame containing stop codons, which if genuine, would severely hinder the organism’s ability to survive. Furthermore, we have completed a comprehensive manual curation to reduce errors in the annotation that occur through serial use of automated annotation pipelines in related species. As a result, different functions were assigned to gene products or previous functional annotations rejected because of missing evidence in various occasions. CONCLUSIONS: We present a revised manually curated full genome sequence for Clostridium autoethanogenum DSM10061, which provides reliable information for genome-scale models that rely heavily on the accuracy of annotation, and represents an important step towards the manipulation and metabolic modelling of this industrially relevant acetogen. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2287-5) contains supplementary material, which is available to authorized users. BioMed Central 2015-12-21 /pmc/articles/PMC4687164/ /pubmed/26692227 http://dx.doi.org/10.1186/s12864-015-2287-5 Text en © Humphreys et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Humphreys, Christopher M.
McLean, Samantha
Schatschneider, Sarah
Millat, Thomas
Henstra, Anne M.
Annan, Florence J.
Breitkopf, Ronja
Pander, Bart
Piatek, Pawel
Rowe, Peter
Wichlacz, Alexander T.
Woods, Craig
Norman, Rupert
Blom, Jochen
Goesman, Alexander
Hodgman, Charlie
Barrett, David
Thomas, Neil R.
Winzer, Klaus
Minton, Nigel P.
Whole genome sequence and manual annotation of Clostridium autoethanogenum, an industrially relevant bacterium
title Whole genome sequence and manual annotation of Clostridium autoethanogenum, an industrially relevant bacterium
title_full Whole genome sequence and manual annotation of Clostridium autoethanogenum, an industrially relevant bacterium
title_fullStr Whole genome sequence and manual annotation of Clostridium autoethanogenum, an industrially relevant bacterium
title_full_unstemmed Whole genome sequence and manual annotation of Clostridium autoethanogenum, an industrially relevant bacterium
title_short Whole genome sequence and manual annotation of Clostridium autoethanogenum, an industrially relevant bacterium
title_sort whole genome sequence and manual annotation of clostridium autoethanogenum, an industrially relevant bacterium
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4687164/
https://www.ncbi.nlm.nih.gov/pubmed/26692227
http://dx.doi.org/10.1186/s12864-015-2287-5
work_keys_str_mv AT humphreyschristopherm wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT mcleansamantha wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT schatschneidersarah wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT millatthomas wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT henstraannem wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT annanflorencej wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT breitkopfronja wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT panderbart wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT piatekpawel wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT rowepeter wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT wichlaczalexandert wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT woodscraig wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT normanrupert wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT blomjochen wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT goesmanalexander wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT hodgmancharlie wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT barrettdavid wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT thomasneilr wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT winzerklaus wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium
AT mintonnigelp wholegenomesequenceandmanualannotationofclostridiumautoethanogenumanindustriallyrelevantbacterium