Cargando…

Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome

Trachidermus fasciatus is a roughskin sculpin fish widespread across the coastal areas of East Asia. Due to environmental destruction and overfishing, the population of this species is under threat. In order to protect this endangered species, it is important to have the genome sequenced. Reference...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Gangcai, Zhang, Xu, Lv, Feng, Sang, Mengmeng, Hu, Hairong, Wang, Jinqiu, Liu, Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8148166/
https://www.ncbi.nlm.nih.gov/pubmed/34066304
http://dx.doi.org/10.3390/genes12050692
_version_ 1783697793107361792
author Xie, Gangcai
Zhang, Xu
Lv, Feng
Sang, Mengmeng
Hu, Hairong
Wang, Jinqiu
Liu, Dong
author_facet Xie, Gangcai
Zhang, Xu
Lv, Feng
Sang, Mengmeng
Hu, Hairong
Wang, Jinqiu
Liu, Dong
author_sort Xie, Gangcai
collection PubMed
description Trachidermus fasciatus is a roughskin sculpin fish widespread across the coastal areas of East Asia. Due to environmental destruction and overfishing, the population of this species is under threat. In order to protect this endangered species, it is important to have the genome sequenced. Reference genomes are essential for studying population genetics, domestic farming, and genetic resource protection. However, currently, no reference genome is available for Trachidermus fasciatus, and this has greatly hindered the research on this species. In this study, we integrated nanopore long-read sequencing, Illumina short-read sequencing, and Hi-C methods to thoroughly assemble the Trachidermus fasciatus genome. Our results provided a chromosome-level high-quality genome assembly with a predicted genome size of 542.6 Mbp (2n = 40) and a scaffold N50 of 24.9 Mbp. The BUSCO value for genome assembly completeness was higher than 96%, and the single-base accuracy was 99.997%. Based on EVM-StringTie genome annotation, a total of 19,147 protein-coding genes were identified, including 35,093 mRNA transcripts. In addition, a novel gene-finding strategy named RNR was introduced, and in total, 51 (82) novel genes (transcripts) were identified. Lastly, we present here the first reference genome for Trachidermus fasciatus; this sequence is expected to greatly facilitate future research on this species.
format Online
Article
Text
id pubmed-8148166
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-81481662021-05-26 Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome Xie, Gangcai Zhang, Xu Lv, Feng Sang, Mengmeng Hu, Hairong Wang, Jinqiu Liu, Dong Genes (Basel) Article Trachidermus fasciatus is a roughskin sculpin fish widespread across the coastal areas of East Asia. Due to environmental destruction and overfishing, the population of this species is under threat. In order to protect this endangered species, it is important to have the genome sequenced. Reference genomes are essential for studying population genetics, domestic farming, and genetic resource protection. However, currently, no reference genome is available for Trachidermus fasciatus, and this has greatly hindered the research on this species. In this study, we integrated nanopore long-read sequencing, Illumina short-read sequencing, and Hi-C methods to thoroughly assemble the Trachidermus fasciatus genome. Our results provided a chromosome-level high-quality genome assembly with a predicted genome size of 542.6 Mbp (2n = 40) and a scaffold N50 of 24.9 Mbp. The BUSCO value for genome assembly completeness was higher than 96%, and the single-base accuracy was 99.997%. Based on EVM-StringTie genome annotation, a total of 19,147 protein-coding genes were identified, including 35,093 mRNA transcripts. In addition, a novel gene-finding strategy named RNR was introduced, and in total, 51 (82) novel genes (transcripts) were identified. Lastly, we present here the first reference genome for Trachidermus fasciatus; this sequence is expected to greatly facilitate future research on this species. MDPI 2021-05-06 /pmc/articles/PMC8148166/ /pubmed/34066304 http://dx.doi.org/10.3390/genes12050692 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xie, Gangcai
Zhang, Xu
Lv, Feng
Sang, Mengmeng
Hu, Hairong
Wang, Jinqiu
Liu, Dong
Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome
title Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome
title_full Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome
title_fullStr Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome
title_full_unstemmed Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome
title_short Nanopore Sequencing and Hi-C Based De Novo Assembly of Trachidermus fasciatus Genome
title_sort nanopore sequencing and hi-c based de novo assembly of trachidermus fasciatus genome
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8148166/
https://www.ncbi.nlm.nih.gov/pubmed/34066304
http://dx.doi.org/10.3390/genes12050692
work_keys_str_mv AT xiegangcai nanoporesequencingandhicbaseddenovoassemblyoftrachidermusfasciatusgenome
AT zhangxu nanoporesequencingandhicbaseddenovoassemblyoftrachidermusfasciatusgenome
AT lvfeng nanoporesequencingandhicbaseddenovoassemblyoftrachidermusfasciatusgenome
AT sangmengmeng nanoporesequencingandhicbaseddenovoassemblyoftrachidermusfasciatusgenome
AT huhairong nanoporesequencingandhicbaseddenovoassemblyoftrachidermusfasciatusgenome
AT wangjinqiu nanoporesequencingandhicbaseddenovoassemblyoftrachidermusfasciatusgenome
AT liudong nanoporesequencingandhicbaseddenovoassemblyoftrachidermusfasciatusgenome