Cargando…
A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set
In addition to the BAC-based reference sequence of the accession Columbia-0 from the year 2000, several short read assemblies of THE plant model organism Arabidopsis thaliana were published during the last years. Also, a SMRT-based assembly of Landsberg erecta has been generated that identified tran...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6529160/ https://www.ncbi.nlm.nih.gov/pubmed/31112551 http://dx.doi.org/10.1371/journal.pone.0216233 |
_version_ | 1783420341601697792 |
---|---|
author | Pucker, Boas Holtgräwe, Daniela Stadermann, Kai Bernd Frey, Katharina Huettel, Bruno Reinhardt, Richard Weisshaar, Bernd |
author_facet | Pucker, Boas Holtgräwe, Daniela Stadermann, Kai Bernd Frey, Katharina Huettel, Bruno Reinhardt, Richard Weisshaar, Bernd |
author_sort | Pucker, Boas |
collection | PubMed |
description | In addition to the BAC-based reference sequence of the accession Columbia-0 from the year 2000, several short read assemblies of THE plant model organism Arabidopsis thaliana were published during the last years. Also, a SMRT-based assembly of Landsberg erecta has been generated that identified translocation and inversion polymorphisms between two genotypes of the species. Here we provide a chromosome-arm level assembly of the A. thaliana accession Niederzenz-1 (AthNd-1_v2c) based on SMRT sequencing data. The best assembly comprises 69 nucleome sequences and displays a contig length of up to 16 Mbp. Compared to an earlier Illumina short read-based NGS assembly (AthNd-1_v1), a 75 fold increase in contiguity was observed for AthNd-1_v2c. To assign contig locations independent from the Col-0 gold standard reference sequence, we used genetic anchoring to generate a de novo assembly. In addition, we assembled the chondrome and plastome sequences. Detailed analyses of AthNd-1_v2c allowed reliable identification of large genomic rearrangements between A. thaliana accessions contributing to differences in the gene sets that distinguish the genotypes. One of the differences detected identified a gene that is lacking from the Col-0 gold standard sequence. This de novo assembly extends the known proportion of the A. thaliana pan-genome. |
format | Online Article Text |
id | pubmed-6529160 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-65291602019-05-31 A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set Pucker, Boas Holtgräwe, Daniela Stadermann, Kai Bernd Frey, Katharina Huettel, Bruno Reinhardt, Richard Weisshaar, Bernd PLoS One Research Article In addition to the BAC-based reference sequence of the accession Columbia-0 from the year 2000, several short read assemblies of THE plant model organism Arabidopsis thaliana were published during the last years. Also, a SMRT-based assembly of Landsberg erecta has been generated that identified translocation and inversion polymorphisms between two genotypes of the species. Here we provide a chromosome-arm level assembly of the A. thaliana accession Niederzenz-1 (AthNd-1_v2c) based on SMRT sequencing data. The best assembly comprises 69 nucleome sequences and displays a contig length of up to 16 Mbp. Compared to an earlier Illumina short read-based NGS assembly (AthNd-1_v1), a 75 fold increase in contiguity was observed for AthNd-1_v2c. To assign contig locations independent from the Col-0 gold standard reference sequence, we used genetic anchoring to generate a de novo assembly. In addition, we assembled the chondrome and plastome sequences. Detailed analyses of AthNd-1_v2c allowed reliable identification of large genomic rearrangements between A. thaliana accessions contributing to differences in the gene sets that distinguish the genotypes. One of the differences detected identified a gene that is lacking from the Col-0 gold standard sequence. This de novo assembly extends the known proportion of the A. thaliana pan-genome. Public Library of Science 2019-05-21 /pmc/articles/PMC6529160/ /pubmed/31112551 http://dx.doi.org/10.1371/journal.pone.0216233 Text en © 2019 Pucker et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Pucker, Boas Holtgräwe, Daniela Stadermann, Kai Bernd Frey, Katharina Huettel, Bruno Reinhardt, Richard Weisshaar, Bernd A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set |
title | A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set |
title_full | A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set |
title_fullStr | A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set |
title_full_unstemmed | A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set |
title_short | A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set |
title_sort | chromosome-level sequence assembly reveals the structure of the arabidopsis thaliana nd-1 genome and its gene set |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6529160/ https://www.ncbi.nlm.nih.gov/pubmed/31112551 http://dx.doi.org/10.1371/journal.pone.0216233 |
work_keys_str_mv | AT puckerboas achromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset AT holtgrawedaniela achromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset AT stadermannkaibernd achromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset AT freykatharina achromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset AT huettelbruno achromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset AT reinhardtrichard achromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset AT weisshaarbernd achromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset AT puckerboas chromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset AT holtgrawedaniela chromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset AT stadermannkaibernd chromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset AT freykatharina chromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset AT huettelbruno chromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset AT reinhardtrichard chromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset AT weisshaarbernd chromosomelevelsequenceassemblyrevealsthestructureofthearabidopsisthalianand1genomeanditsgeneset |