Cargando…
A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions
BACKGROUND: Tilapias are the second most farmed fishes in the world and a sustainable source of food. Like many other fish, tilapias are sexually dimorphic and sex is a commercially important trait in these fish. In this study, we developed a significantly improved assembly of the tilapia genome usi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5414186/ https://www.ncbi.nlm.nih.gov/pubmed/28464822 http://dx.doi.org/10.1186/s12864-017-3723-5 |
_version_ | 1783233316680368128 |
---|---|
author | Conte, Matthew A. Gammerdinger, William J. Bartie, Kerry L. Penman, David J. Kocher, Thomas D. |
author_facet | Conte, Matthew A. Gammerdinger, William J. Bartie, Kerry L. Penman, David J. Kocher, Thomas D. |
author_sort | Conte, Matthew A. |
collection | PubMed |
description | BACKGROUND: Tilapias are the second most farmed fishes in the world and a sustainable source of food. Like many other fish, tilapias are sexually dimorphic and sex is a commercially important trait in these fish. In this study, we developed a significantly improved assembly of the tilapia genome using the latest genome sequencing methods and show how it improves the characterization of two sex determination regions in two tilapia species. RESULTS: A homozygous clonal XX female Nile tilapia (Oreochromis niloticus) was sequenced to 44X coverage using Pacific Biosciences (PacBio) SMRT sequencing. Dozens of candidate de novo assemblies were generated and an optimal assembly (contig NG50 of 3.3Mbp) was selected using principal component analysis of likelihood scores calculated from several paired-end sequencing libraries. Comparison of the new assembly to the previous O. niloticus genome assembly reveals that recently duplicated portions of the genome are now well represented. The overall number of genes in the new assembly increased by 27.3%, including a 67% increase in pseudogenes. The new tilapia genome assembly correctly represents two recent vasa gene duplication events that have been verified with BAC sequencing. At total of 146Mbp of additional transposable element sequence are now assembled, a large proportion of which are recent insertions. Large centromeric satellite repeats are assembled and annotated in cichlid fish for the first time. Finally, the new assembly identifies the long-range structure of both a ~9Mbp XY sex determination region on LG1 in O. niloticus, and a ~50Mbp WZ sex determination region on LG3 in the related species O. aureus. CONCLUSIONS: This study highlights the use of long read sequencing to correctly assemble recent duplications and to characterize repeat-filled regions of the genome. The study serves as an example of the need for high quality genome assemblies and provides a framework for identifying sex determining genes in tilapia and related fish species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3723-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5414186 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-54141862017-05-03 A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions Conte, Matthew A. Gammerdinger, William J. Bartie, Kerry L. Penman, David J. Kocher, Thomas D. BMC Genomics Research Article BACKGROUND: Tilapias are the second most farmed fishes in the world and a sustainable source of food. Like many other fish, tilapias are sexually dimorphic and sex is a commercially important trait in these fish. In this study, we developed a significantly improved assembly of the tilapia genome using the latest genome sequencing methods and show how it improves the characterization of two sex determination regions in two tilapia species. RESULTS: A homozygous clonal XX female Nile tilapia (Oreochromis niloticus) was sequenced to 44X coverage using Pacific Biosciences (PacBio) SMRT sequencing. Dozens of candidate de novo assemblies were generated and an optimal assembly (contig NG50 of 3.3Mbp) was selected using principal component analysis of likelihood scores calculated from several paired-end sequencing libraries. Comparison of the new assembly to the previous O. niloticus genome assembly reveals that recently duplicated portions of the genome are now well represented. The overall number of genes in the new assembly increased by 27.3%, including a 67% increase in pseudogenes. The new tilapia genome assembly correctly represents two recent vasa gene duplication events that have been verified with BAC sequencing. At total of 146Mbp of additional transposable element sequence are now assembled, a large proportion of which are recent insertions. Large centromeric satellite repeats are assembled and annotated in cichlid fish for the first time. Finally, the new assembly identifies the long-range structure of both a ~9Mbp XY sex determination region on LG1 in O. niloticus, and a ~50Mbp WZ sex determination region on LG3 in the related species O. aureus. CONCLUSIONS: This study highlights the use of long read sequencing to correctly assemble recent duplications and to characterize repeat-filled regions of the genome. The study serves as an example of the need for high quality genome assemblies and provides a framework for identifying sex determining genes in tilapia and related fish species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3723-5) contains supplementary material, which is available to authorized users. BioMed Central 2017-05-02 /pmc/articles/PMC5414186/ /pubmed/28464822 http://dx.doi.org/10.1186/s12864-017-3723-5 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Conte, Matthew A. Gammerdinger, William J. Bartie, Kerry L. Penman, David J. Kocher, Thomas D. A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions |
title | A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions |
title_full | A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions |
title_fullStr | A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions |
title_full_unstemmed | A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions |
title_short | A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions |
title_sort | high quality assembly of the nile tilapia (oreochromis niloticus) genome reveals the structure of two sex determination regions |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5414186/ https://www.ncbi.nlm.nih.gov/pubmed/28464822 http://dx.doi.org/10.1186/s12864-017-3723-5 |
work_keys_str_mv | AT contematthewa ahighqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions AT gammerdingerwilliamj ahighqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions AT bartiekerryl ahighqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions AT penmandavidj ahighqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions AT kocherthomasd ahighqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions AT contematthewa highqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions AT gammerdingerwilliamj highqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions AT bartiekerryl highqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions AT penmandavidj highqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions AT kocherthomasd highqualityassemblyoftheniletilapiaoreochromisniloticusgenomerevealsthestructureoftwosexdeterminationregions |