Cargando…

The first transcriptome sequencing and data analysis of the Javan mahseer (Tor tambra)

The Javan mahseer (Tor tambra) is one of the most valuable freshwater fish found in Tor species. To date, other than mitogenomic data (BioProject: PRJNA422829), genomic and transcriptomic resources for this species are still lacking which is crucial to understand the molecular mechanisms associated...

Descripción completa

Detalles Bibliográficos
Autores principales: Lau, Melinda Mei Lin, Lim, Leonard Whye Kit, Chung, Hung Hui, Gan, Han Ming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8529094/
https://www.ncbi.nlm.nih.gov/pubmed/34712757
http://dx.doi.org/10.1016/j.dib.2021.107481
_version_ 1784586393724387328
author Lau, Melinda Mei Lin
Lim, Leonard Whye Kit
Chung, Hung Hui
Gan, Han Ming
author_facet Lau, Melinda Mei Lin
Lim, Leonard Whye Kit
Chung, Hung Hui
Gan, Han Ming
author_sort Lau, Melinda Mei Lin
collection PubMed
description The Javan mahseer (Tor tambra) is one of the most valuable freshwater fish found in Tor species. To date, other than mitogenomic data (BioProject: PRJNA422829), genomic and transcriptomic resources for this species are still lacking which is crucial to understand the molecular mechanisms associated with important traits such as growth, immune response, reproduction and sex determination. For the first time, we sequenced the transcriptome from a whole juvenile fish using Illumina NovaSEQ6000 generating raw paired-end reads. De novo transcriptome assembly generated a draft transcriptome (BUSCO5 completeness of 91.2% [Actinopterygii_odb10 database]) consisting of 259,403 putative transcripts with a total and N50 length of 333,881,215 bp and 2283 bp, respectively. A total count of 77,503 non-redundant protein coding sequences were predicted from the transcripts and used for functional annotation. We mapped the predicted proteins to 304 known KEGG pathways with signal transduction cluster having the highest representation followed by immune system and endocrine system. In addition, transcripts exhibiting significant similarity to previously published growth-and immune-related genes were identified which will facilitate future molecular breeding of Tor tambra.
format Online
Article
Text
id pubmed-8529094
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-85290942021-10-27 The first transcriptome sequencing and data analysis of the Javan mahseer (Tor tambra) Lau, Melinda Mei Lin Lim, Leonard Whye Kit Chung, Hung Hui Gan, Han Ming Data Brief Data Article The Javan mahseer (Tor tambra) is one of the most valuable freshwater fish found in Tor species. To date, other than mitogenomic data (BioProject: PRJNA422829), genomic and transcriptomic resources for this species are still lacking which is crucial to understand the molecular mechanisms associated with important traits such as growth, immune response, reproduction and sex determination. For the first time, we sequenced the transcriptome from a whole juvenile fish using Illumina NovaSEQ6000 generating raw paired-end reads. De novo transcriptome assembly generated a draft transcriptome (BUSCO5 completeness of 91.2% [Actinopterygii_odb10 database]) consisting of 259,403 putative transcripts with a total and N50 length of 333,881,215 bp and 2283 bp, respectively. A total count of 77,503 non-redundant protein coding sequences were predicted from the transcripts and used for functional annotation. We mapped the predicted proteins to 304 known KEGG pathways with signal transduction cluster having the highest representation followed by immune system and endocrine system. In addition, transcripts exhibiting significant similarity to previously published growth-and immune-related genes were identified which will facilitate future molecular breeding of Tor tambra. Elsevier 2021-10-14 /pmc/articles/PMC8529094/ /pubmed/34712757 http://dx.doi.org/10.1016/j.dib.2021.107481 Text en © 2021 The Author(s). Published by Elsevier Inc. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Lau, Melinda Mei Lin
Lim, Leonard Whye Kit
Chung, Hung Hui
Gan, Han Ming
The first transcriptome sequencing and data analysis of the Javan mahseer (Tor tambra)
title The first transcriptome sequencing and data analysis of the Javan mahseer (Tor tambra)
title_full The first transcriptome sequencing and data analysis of the Javan mahseer (Tor tambra)
title_fullStr The first transcriptome sequencing and data analysis of the Javan mahseer (Tor tambra)
title_full_unstemmed The first transcriptome sequencing and data analysis of the Javan mahseer (Tor tambra)
title_short The first transcriptome sequencing and data analysis of the Javan mahseer (Tor tambra)
title_sort first transcriptome sequencing and data analysis of the javan mahseer (tor tambra)
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8529094/
https://www.ncbi.nlm.nih.gov/pubmed/34712757
http://dx.doi.org/10.1016/j.dib.2021.107481
work_keys_str_mv AT laumelindameilin thefirsttranscriptomesequencinganddataanalysisofthejavanmahseertortambra
AT limleonardwhyekit thefirsttranscriptomesequencinganddataanalysisofthejavanmahseertortambra
AT chunghunghui thefirsttranscriptomesequencinganddataanalysisofthejavanmahseertortambra
AT ganhanming thefirsttranscriptomesequencinganddataanalysisofthejavanmahseertortambra
AT laumelindameilin firsttranscriptomesequencinganddataanalysisofthejavanmahseertortambra
AT limleonardwhyekit firsttranscriptomesequencinganddataanalysisofthejavanmahseertortambra
AT chunghunghui firsttranscriptomesequencinganddataanalysisofthejavanmahseertortambra
AT ganhanming firsttranscriptomesequencinganddataanalysisofthejavanmahseertortambra