Cargando…

A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set

In addition to the BAC-based reference sequence of the accession Columbia-0 from the year 2000, several short read assemblies of THE plant model organism Arabidopsis thaliana were published during the last years. Also, a SMRT-based assembly of Landsberg erecta has been generated that identified tran...

Descripción completa

Detalles Bibliográficos
Autores principales: Pucker, Boas, Holtgräwe, Daniela, Stadermann, Kai Bernd, Frey, Katharina, Huettel, Bruno, Reinhardt, Richard, Weisshaar, Bernd
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6529160/
https://www.ncbi.nlm.nih.gov/pubmed/31112551
http://dx.doi.org/10.1371/journal.pone.0216233
_version_ 1783420341601697792
author Pucker, Boas
Holtgräwe, Daniela
Stadermann, Kai Bernd
Frey, Katharina
Huettel, Bruno
Reinhardt, Richard
Weisshaar, Bernd
author_facet Pucker, Boas
Holtgräwe, Daniela
Stadermann, Kai Bernd
Frey, Katharina
Huettel, Bruno
Reinhardt, Richard
Weisshaar, Bernd
author_sort Pucker, Boas
collection PubMed
description In addition to the BAC-based reference sequence of the accession Columbia-0 from the year 2000, several short read assemblies of THE plant model organism Arabidopsis thaliana were published during the last years. Also, a SMRT-based assembly of Landsberg erecta has been generated that identified translocation and inversion polymorphisms between two genotypes of the species. Here we provide a chromosome-arm level assembly of the A. thaliana accession Niederzenz-1 (AthNd-1_v2c) based on SMRT sequencing data. The best assembly comprises 69 nucleome sequences and displays a contig length of up to 16 Mbp. Compared to an earlier Illumina short read-based NGS assembly (AthNd-1_v1), a 75 fold increase in contiguity was observed for AthNd-1_v2c. To assign contig locations independent from the Col-0 gold standard reference sequence, we used genetic anchoring to generate a de novo assembly. In addition, we assembled the chondrome and plastome sequences. Detailed analyses of AthNd-1_v2c allowed reliable identification of large genomic rearrangements between A. thaliana accessions contributing to differences in the gene sets that distinguish the genotypes. One of the differences detected identified a gene that is lacking from the Col-0 gold standard sequence. This de novo assembly extends the known proportion of the A. thaliana pan-genome.
format Online
Article
Text
id pubmed-6529160
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-65291602019-05-31 A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set Pucker, Boas Holtgräwe, Daniela Stadermann, Kai Bernd Frey, Katharina Huettel, Bruno Reinhardt, Richard Weisshaar, Bernd PLoS One Research Article In addition to the BAC-based reference sequence of the accession Columbia-0 from the year 2000, several short read assemblies of THE plant model organism Arabidopsis thaliana were published during the last years. Also, a SMRT-based assembly of Landsberg erecta has been generated that identified translocation and inversion polymorphisms between two genotypes of the species. Here we provide a chromosome-arm level assembly of the A. thaliana accession Niederzenz-1 (AthNd-1_v2c) based on SMRT sequencing data. The best assembly comprises 69 nucleome sequences and displays a contig length of up to 16 Mbp. Compared to an earlier Illumina short read-based NGS assembly (AthNd-1_v1), a 75 fold increase in contiguity was observed for AthNd-1_v2c. To assign contig locations independent from the Col-0 gold standard reference sequence, we used genetic anchoring to generate a de novo assembly. In addition, we assembled the chondrome and plastome sequences. Detailed analyses of AthNd-1_v2c allowed reliable identification of large genomic rearrangements between A. thaliana accessions contributing to differences in the gene sets that distinguish the genotypes. One of the differences detected identified a gene that is lacking from the Col-0 gold standard sequence. This de novo assembly extends the known proportion of the A. thaliana pan-genome. Public Library of Science 2019-05-21 /pmc/articles/PMC6529160/ /pubmed/31112551 http://dx.doi.org/10.1371/journal.pone.0216233 Text en © 2019 Pucker et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Pucker, Boas
Holtgräwe, Daniela
Stadermann, Kai Bernd
Frey, Katharina
Huettel, Bruno
Reinhardt, Richard
Weisshaar, Bernd
A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set
title A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set
title_full A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set
title_fullStr A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set
title_full_unstemmed A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set
title_short A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set
title_sort chromosome-level sequence assembly reveals the structure of the arabidopsis thaliana nd-1 genome and its gene set
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6529160/
https://www.ncbi.nlm.nih.gov/pubmed/31112551
http://dx.doi.org/10.1371/journal.pone.0216233
work_keys_str_mv AT puckerboas achromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset
AT holtgrawedaniela achromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset
AT stadermannkaibernd achromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset
AT freykatharina achromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset
AT huettelbruno achromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset
AT reinhardtrichard achromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset
AT weisshaarbernd achromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset
AT puckerboas chromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset
AT holtgrawedaniela chromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset
AT stadermannkaibernd chromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset
AT freykatharina chromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset
AT huettelbruno chromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset
AT reinhardtrichard chromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset
AT weisshaarbernd chromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset