Cargando…

Assembly and comparison of two closely related Brassica napus genomes

As an increasing number of plant genome sequences become available, it is clear that gene content varies between individuals, and the challenge arises to predict the gene content of a species. However, genome comparison is often confounded by variation in assembly and annotation. Differentiating bet...

Descripción completa

Detalles Bibliográficos
Autores principales: Bayer, Philipp E., Hurgobin, Bhavna, Golicz, Agnieszka A., Chan, Chon‐Kit Kenneth, Yuan, Yuxuan, Lee, HueyTyng, Renton, Michael, Meng, Jinling, Li, Ruiyuan, Long, Yan, Zou, Jun, Bancroft, Ian, Chalhoub, Boulos, King, Graham J., Batley, Jacqueline, Edwards, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5698052/
https://www.ncbi.nlm.nih.gov/pubmed/28403535
http://dx.doi.org/10.1111/pbi.12742
_version_ 1783280714007969792
author Bayer, Philipp E.
Hurgobin, Bhavna
Golicz, Agnieszka A.
Chan, Chon‐Kit Kenneth
Yuan, Yuxuan
Lee, HueyTyng
Renton, Michael
Meng, Jinling
Li, Ruiyuan
Long, Yan
Zou, Jun
Bancroft, Ian
Chalhoub, Boulos
King, Graham J.
Batley, Jacqueline
Edwards, David
author_facet Bayer, Philipp E.
Hurgobin, Bhavna
Golicz, Agnieszka A.
Chan, Chon‐Kit Kenneth
Yuan, Yuxuan
Lee, HueyTyng
Renton, Michael
Meng, Jinling
Li, Ruiyuan
Long, Yan
Zou, Jun
Bancroft, Ian
Chalhoub, Boulos
King, Graham J.
Batley, Jacqueline
Edwards, David
author_sort Bayer, Philipp E.
collection PubMed
description As an increasing number of plant genome sequences become available, it is clear that gene content varies between individuals, and the challenge arises to predict the gene content of a species. However, genome comparison is often confounded by variation in assembly and annotation. Differentiating between true gene absence and variation in assembly or annotation is essential for the accurate identification of conserved and variable genes in a species. Here, we present the de novo assembly of the B. napus cultivar Tapidor and comparison with an improved assembly of the Brassica napus cultivar Darmor‐bzh. Both cultivars were annotated using the same method to allow comparison of gene content. We identified genes unique to each cultivar and differentiate these from artefacts due to variation in the assembly and annotation. We demonstrate that using a common annotation pipeline can result in different gene predictions, even for closely related cultivars, and repeat regions which collapse during assembly impact whole genome comparison. After accounting for differences in assembly and annotation, we demonstrate that the genome of Darmor‐bzh contains a greater number of genes than the genome of Tapidor. Our results are the first step towards comparison of the true differences between B. napus genomes and highlight the potential sources of error in future production of a B. napus pangenome.
format Online
Article
Text
id pubmed-5698052
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-56980522017-11-28 Assembly and comparison of two closely related Brassica napus genomes Bayer, Philipp E. Hurgobin, Bhavna Golicz, Agnieszka A. Chan, Chon‐Kit Kenneth Yuan, Yuxuan Lee, HueyTyng Renton, Michael Meng, Jinling Li, Ruiyuan Long, Yan Zou, Jun Bancroft, Ian Chalhoub, Boulos King, Graham J. Batley, Jacqueline Edwards, David Plant Biotechnol J Research Articles As an increasing number of plant genome sequences become available, it is clear that gene content varies between individuals, and the challenge arises to predict the gene content of a species. However, genome comparison is often confounded by variation in assembly and annotation. Differentiating between true gene absence and variation in assembly or annotation is essential for the accurate identification of conserved and variable genes in a species. Here, we present the de novo assembly of the B. napus cultivar Tapidor and comparison with an improved assembly of the Brassica napus cultivar Darmor‐bzh. Both cultivars were annotated using the same method to allow comparison of gene content. We identified genes unique to each cultivar and differentiate these from artefacts due to variation in the assembly and annotation. We demonstrate that using a common annotation pipeline can result in different gene predictions, even for closely related cultivars, and repeat regions which collapse during assembly impact whole genome comparison. After accounting for differences in assembly and annotation, we demonstrate that the genome of Darmor‐bzh contains a greater number of genes than the genome of Tapidor. Our results are the first step towards comparison of the true differences between B. napus genomes and highlight the potential sources of error in future production of a B. napus pangenome. John Wiley and Sons Inc. 2017-06-14 2017-12 /pmc/articles/PMC5698052/ /pubmed/28403535 http://dx.doi.org/10.1111/pbi.12742 Text en © 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Bayer, Philipp E.
Hurgobin, Bhavna
Golicz, Agnieszka A.
Chan, Chon‐Kit Kenneth
Yuan, Yuxuan
Lee, HueyTyng
Renton, Michael
Meng, Jinling
Li, Ruiyuan
Long, Yan
Zou, Jun
Bancroft, Ian
Chalhoub, Boulos
King, Graham J.
Batley, Jacqueline
Edwards, David
Assembly and comparison of two closely related Brassica napus genomes
title Assembly and comparison of two closely related Brassica napus genomes
title_full Assembly and comparison of two closely related Brassica napus genomes
title_fullStr Assembly and comparison of two closely related Brassica napus genomes
title_full_unstemmed Assembly and comparison of two closely related Brassica napus genomes
title_short Assembly and comparison of two closely related Brassica napus genomes
title_sort assembly and comparison of two closely related brassica napus genomes
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5698052/
https://www.ncbi.nlm.nih.gov/pubmed/28403535
http://dx.doi.org/10.1111/pbi.12742
work_keys_str_mv AT bayerphilippe assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT hurgobinbhavna assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT goliczagnieszkaa assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT chanchonkitkenneth assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT yuanyuxuan assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT leehueytyng assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT rentonmichael assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT mengjinling assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT liruiyuan assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT longyan assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT zoujun assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT bancroftian assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT chalhoubboulos assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT kinggrahamj assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT batleyjacqueline assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes
AT edwardsdavid assemblyandcomparisonoftwocloselyrelatedbrassicanapusgenomes