Cargando…

A systematic comparison of chloroplast genome assembly tools

BACKGROUND: Chloroplasts are intracellular organelles that enable plants to conduct photosynthesis. They arose through the symbiotic integration of a prokaryotic cell into an eukaryotic host cell and still contain their own genomes with distinct genomic information. Plastid genomes accommodate essen...

Descripción completa

Detalles Bibliográficos
Autores principales: Freudenthal, Jan A., Pfaff, Simon, Terhoeven, Niklas, Korte, Arthur, Ankenbrand, Markus J., Förster, Frank
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7520963/
https://www.ncbi.nlm.nih.gov/pubmed/32988404
http://dx.doi.org/10.1186/s13059-020-02153-6
_version_ 1783587881679323136
author Freudenthal, Jan A.
Pfaff, Simon
Terhoeven, Niklas
Korte, Arthur
Ankenbrand, Markus J.
Förster, Frank
author_facet Freudenthal, Jan A.
Pfaff, Simon
Terhoeven, Niklas
Korte, Arthur
Ankenbrand, Markus J.
Förster, Frank
author_sort Freudenthal, Jan A.
collection PubMed
description BACKGROUND: Chloroplasts are intracellular organelles that enable plants to conduct photosynthesis. They arose through the symbiotic integration of a prokaryotic cell into an eukaryotic host cell and still contain their own genomes with distinct genomic information. Plastid genomes accommodate essential genes and are regularly utilized in biotechnology or phylogenetics. Different assemblers that are able to assess the plastid genome have been developed. These assemblers often use data of whole genome sequencing experiments, which usually contain reads from the complete chloroplast genome. RESULTS: The performance of different assembly tools has never been systematically compared. Here, we present a benchmark of seven chloroplast assembly tools, capable of succeeding in more than 60% of known real data sets. Our results show significant differences between the tested assemblers in terms of generating whole chloroplast genome sequences and computational requirements. The examination of 105 data sets from species with unknown plastid genomes leads to the assembly of 20 novel chloroplast genomes. CONCLUSIONS: We create docker images for each tested tool that are freely available for the scientific community and ensure reproducibility of the analyses. These containers allow the analysis and screening of data sets for chloroplast genomes using standard computational infrastructure. Thus, large scale screening for chloroplasts within genomic sequencing data is feasible.
format Online
Article
Text
id pubmed-7520963
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-75209632020-09-30 A systematic comparison of chloroplast genome assembly tools Freudenthal, Jan A. Pfaff, Simon Terhoeven, Niklas Korte, Arthur Ankenbrand, Markus J. Förster, Frank Genome Biol Research BACKGROUND: Chloroplasts are intracellular organelles that enable plants to conduct photosynthesis. They arose through the symbiotic integration of a prokaryotic cell into an eukaryotic host cell and still contain their own genomes with distinct genomic information. Plastid genomes accommodate essential genes and are regularly utilized in biotechnology or phylogenetics. Different assemblers that are able to assess the plastid genome have been developed. These assemblers often use data of whole genome sequencing experiments, which usually contain reads from the complete chloroplast genome. RESULTS: The performance of different assembly tools has never been systematically compared. Here, we present a benchmark of seven chloroplast assembly tools, capable of succeeding in more than 60% of known real data sets. Our results show significant differences between the tested assemblers in terms of generating whole chloroplast genome sequences and computational requirements. The examination of 105 data sets from species with unknown plastid genomes leads to the assembly of 20 novel chloroplast genomes. CONCLUSIONS: We create docker images for each tested tool that are freely available for the scientific community and ensure reproducibility of the analyses. These containers allow the analysis and screening of data sets for chloroplast genomes using standard computational infrastructure. Thus, large scale screening for chloroplasts within genomic sequencing data is feasible. BioMed Central 2020-09-28 /pmc/articles/PMC7520963/ /pubmed/32988404 http://dx.doi.org/10.1186/s13059-020-02153-6 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Freudenthal, Jan A.
Pfaff, Simon
Terhoeven, Niklas
Korte, Arthur
Ankenbrand, Markus J.
Förster, Frank
A systematic comparison of chloroplast genome assembly tools
title A systematic comparison of chloroplast genome assembly tools
title_full A systematic comparison of chloroplast genome assembly tools
title_fullStr A systematic comparison of chloroplast genome assembly tools
title_full_unstemmed A systematic comparison of chloroplast genome assembly tools
title_short A systematic comparison of chloroplast genome assembly tools
title_sort systematic comparison of chloroplast genome assembly tools
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7520963/
https://www.ncbi.nlm.nih.gov/pubmed/32988404
http://dx.doi.org/10.1186/s13059-020-02153-6
work_keys_str_mv AT freudenthaljana asystematiccomparisonofchloroplastgenomeassemblytools
AT pfaffsimon asystematiccomparisonofchloroplastgenomeassemblytools
AT terhoevenniklas asystematiccomparisonofchloroplastgenomeassemblytools
AT kortearthur asystematiccomparisonofchloroplastgenomeassemblytools
AT ankenbrandmarkusj asystematiccomparisonofchloroplastgenomeassemblytools
AT forsterfrank asystematiccomparisonofchloroplastgenomeassemblytools
AT freudenthaljana systematiccomparisonofchloroplastgenomeassemblytools
AT pfaffsimon systematiccomparisonofchloroplastgenomeassemblytools
AT terhoevenniklas systematiccomparisonofchloroplastgenomeassemblytools
AT kortearthur systematiccomparisonofchloroplastgenomeassemblytools
AT ankenbrandmarkusj systematiccomparisonofchloroplastgenomeassemblytools
AT forsterfrank systematiccomparisonofchloroplastgenomeassemblytools