Cargando…

A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions

BACKGROUND: Tilapias are the second most farmed fishes in the world and a sustainable source of food. Like many other fish, tilapias are sexually dimorphic and sex is a commercially important trait in these fish. In this study, we developed a significantly improved assembly of the tilapia genome usi...

Descripción completa

Detalles Bibliográficos
Autores principales: Conte, Matthew A., Gammerdinger, William J., Bartie, Kerry L., Penman, David J., Kocher, Thomas D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5414186/
https://www.ncbi.nlm.nih.gov/pubmed/28464822
http://dx.doi.org/10.1186/s12864-017-3723-5
_version_ 1783233316680368128
author Conte, Matthew A.
Gammerdinger, William J.
Bartie, Kerry L.
Penman, David J.
Kocher, Thomas D.
author_facet Conte, Matthew A.
Gammerdinger, William J.
Bartie, Kerry L.
Penman, David J.
Kocher, Thomas D.
author_sort Conte, Matthew A.
collection PubMed
description BACKGROUND: Tilapias are the second most farmed fishes in the world and a sustainable source of food. Like many other fish, tilapias are sexually dimorphic and sex is a commercially important trait in these fish. In this study, we developed a significantly improved assembly of the tilapia genome using the latest genome sequencing methods and show how it improves the characterization of two sex determination regions in two tilapia species. RESULTS: A homozygous clonal XX female Nile tilapia (Oreochromis niloticus) was sequenced to 44X coverage using Pacific Biosciences (PacBio) SMRT sequencing. Dozens of candidate de novo assemblies were generated and an optimal assembly (contig NG50 of 3.3Mbp) was selected using principal component analysis of likelihood scores calculated from several paired-end sequencing libraries. Comparison of the new assembly to the previous O. niloticus genome assembly reveals that recently duplicated portions of the genome are now well represented. The overall number of genes in the new assembly increased by 27.3%, including a 67% increase in pseudogenes. The new tilapia genome assembly correctly represents two recent vasa gene duplication events that have been verified with BAC sequencing. At total of 146Mbp of additional transposable element sequence are now assembled, a large proportion of which are recent insertions. Large centromeric satellite repeats are assembled and annotated in cichlid fish for the first time. Finally, the new assembly identifies the long-range structure of both a ~9Mbp XY sex determination region on LG1 in O. niloticus, and a ~50Mbp WZ sex determination region on LG3 in the related species O. aureus. CONCLUSIONS: This study highlights the use of long read sequencing to correctly assemble recent duplications and to characterize repeat-filled regions of the genome. The study serves as an example of the need for high quality genome assemblies and provides a framework for identifying sex determining genes in tilapia and related fish species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3723-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5414186
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54141862017-05-03 A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions Conte, Matthew A. Gammerdinger, William J. Bartie, Kerry L. Penman, David J. Kocher, Thomas D. BMC Genomics Research Article BACKGROUND: Tilapias are the second most farmed fishes in the world and a sustainable source of food. Like many other fish, tilapias are sexually dimorphic and sex is a commercially important trait in these fish. In this study, we developed a significantly improved assembly of the tilapia genome using the latest genome sequencing methods and show how it improves the characterization of two sex determination regions in two tilapia species. RESULTS: A homozygous clonal XX female Nile tilapia (Oreochromis niloticus) was sequenced to 44X coverage using Pacific Biosciences (PacBio) SMRT sequencing. Dozens of candidate de novo assemblies were generated and an optimal assembly (contig NG50 of 3.3Mbp) was selected using principal component analysis of likelihood scores calculated from several paired-end sequencing libraries. Comparison of the new assembly to the previous O. niloticus genome assembly reveals that recently duplicated portions of the genome are now well represented. The overall number of genes in the new assembly increased by 27.3%, including a 67% increase in pseudogenes. The new tilapia genome assembly correctly represents two recent vasa gene duplication events that have been verified with BAC sequencing. At total of 146Mbp of additional transposable element sequence are now assembled, a large proportion of which are recent insertions. Large centromeric satellite repeats are assembled and annotated in cichlid fish for the first time. Finally, the new assembly identifies the long-range structure of both a ~9Mbp XY sex determination region on LG1 in O. niloticus, and a ~50Mbp WZ sex determination region on LG3 in the related species O. aureus. CONCLUSIONS: This study highlights the use of long read sequencing to correctly assemble recent duplications and to characterize repeat-filled regions of the genome. The study serves as an example of the need for high quality genome assemblies and provides a framework for identifying sex determining genes in tilapia and related fish species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3723-5) contains supplementary material, which is available to authorized users. BioMed Central 2017-05-02 /pmc/articles/PMC5414186/ /pubmed/28464822 http://dx.doi.org/10.1186/s12864-017-3723-5 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Conte, Matthew A.
Gammerdinger, William J.
Bartie, Kerry L.
Penman, David J.
Kocher, Thomas D.
A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions
title A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions
title_full A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions
title_fullStr A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions
title_full_unstemmed A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions
title_short A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions
title_sort high quality assembly of the nile tilapia (oreochromis niloticus) genome reveals the structure of two sex determination regions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5414186/
https://www.ncbi.nlm.nih.gov/pubmed/28464822
http://dx.doi.org/10.1186/s12864-017-3723-5
work_keys_str_mv AT contematthewa ahighqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions
AT gammerdingerwilliamj ahighqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions
AT bartiekerryl ahighqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions
AT penmandavidj ahighqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions
AT kocherthomasd ahighqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions
AT contematthewa highqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions
AT gammerdingerwilliamj highqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions
AT bartiekerryl highqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions
AT penmandavidj highqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions
AT kocherthomasd highqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions