Cargando…
Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome
Trachidermus fasciatus is a roughskin sculpin fish widespread across the coastal areas of East Asia. Due to environmental destruction and overfishing, the population of this species is under threat. In order to protect this endangered species, it is important to have the genome sequenced. Reference...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8148166/ https://www.ncbi.nlm.nih.gov/pubmed/34066304 http://dx.doi.org/10.3390/genes12050692 |
_version_ | 1783697793107361792 |
---|---|
author | Xie, Gangcai Zhang, Xu Lv, Feng Sang, Mengmeng Hu, Hairong Wang, Jinqiu Liu, Dong |
author_facet | Xie, Gangcai Zhang, Xu Lv, Feng Sang, Mengmeng Hu, Hairong Wang, Jinqiu Liu, Dong |
author_sort | Xie, Gangcai |
collection | PubMed |
description | Trachidermus fasciatus is a roughskin sculpin fish widespread across the coastal areas of East Asia. Due to environmental destruction and overfishing, the population of this species is under threat. In order to protect this endangered species, it is important to have the genome sequenced. Reference genomes are essential for studying population genetics, domestic farming, and genetic resource protection. However, currently, no reference genome is available for Trachidermus fasciatus, and this has greatly hindered the research on this species. In this study, we integrated nanopore long-read sequencing, Illumina short-read sequencing, and Hi-C methods to thoroughly assemble the Trachidermus fasciatus genome. Our results provided a chromosome-level high-quality genome assembly with a predicted genome size of 542.6 Mbp (2n = 40) and a scaffold N50 of 24.9 Mbp. The BUSCO value for genome assembly completeness was higher than 96%, and the single-base accuracy was 99.997%. Based on EVM-StringTie genome annotation, a total of 19,147 protein-coding genes were identified, including 35,093 mRNA transcripts. In addition, a novel gene-finding strategy named RNR was introduced, and in total, 51 (82) novel genes (transcripts) were identified. Lastly, we present here the first reference genome for Trachidermus fasciatus; this sequence is expected to greatly facilitate future research on this species. |
format | Online Article Text |
id | pubmed-8148166 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-81481662021-05-26 Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome Xie, Gangcai Zhang, Xu Lv, Feng Sang, Mengmeng Hu, Hairong Wang, Jinqiu Liu, Dong Genes (Basel) Article Trachidermus fasciatus is a roughskin sculpin fish widespread across the coastal areas of East Asia. Due to environmental destruction and overfishing, the population of this species is under threat. In order to protect this endangered species, it is important to have the genome sequenced. Reference genomes are essential for studying population genetics, domestic farming, and genetic resource protection. However, currently, no reference genome is available for Trachidermus fasciatus, and this has greatly hindered the research on this species. In this study, we integrated nanopore long-read sequencing, Illumina short-read sequencing, and Hi-C methods to thoroughly assemble the Trachidermus fasciatus genome. Our results provided a chromosome-level high-quality genome assembly with a predicted genome size of 542.6 Mbp (2n = 40) and a scaffold N50 of 24.9 Mbp. The BUSCO value for genome assembly completeness was higher than 96%, and the single-base accuracy was 99.997%. Based on EVM-StringTie genome annotation, a total of 19,147 protein-coding genes were identified, including 35,093 mRNA transcripts. In addition, a novel gene-finding strategy named RNR was introduced, and in total, 51 (82) novel genes (transcripts) were identified. Lastly, we present here the first reference genome for Trachidermus fasciatus; this sequence is expected to greatly facilitate future research on this species. MDPI 2021-05-06 /pmc/articles/PMC8148166/ /pubmed/34066304 http://dx.doi.org/10.3390/genes12050692 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Xie, Gangcai Zhang, Xu Lv, Feng Sang, Mengmeng Hu, Hairong Wang, Jinqiu Liu, Dong Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome |
title | Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome |
title_full | Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome |
title_fullStr | Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome |
title_full_unstemmed | Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome |
title_short | Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome |
title_sort | nanopore sequencing and hi-c based de novo assembly of trachidermus fasciatus genome |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8148166/ https://www.ncbi.nlm.nih.gov/pubmed/34066304 http://dx.doi.org/10.3390/genes12050692 |
work_keys_str_mv | AT xiegangcai nanoporesequencingandhicbaseddenovoassemblyoftrachidermusfasciatusgenome AT zhangxu nanoporesequencingandhicbaseddenovoassemblyoftrachidermusfasciatusgenome AT lvfeng nanoporesequencingandhicbaseddenovoassemblyoftrachidermusfasciatusgenome AT sangmengmeng nanoporesequencingandhicbaseddenovoassemblyoftrachidermusfasciatusgenome AT huhairong nanoporesequencingandhicbaseddenovoassemblyoftrachidermusfasciatusgenome AT wangjinqiu nanoporesequencingandhicbaseddenovoassemblyoftrachidermusfasciatusgenome AT liudong nanoporesequencingandhicbaseddenovoassemblyoftrachidermusfasciatusgenome |