Cargando…

Transcriptome Analysis of the Mud Crab (Scylla paramamosain) by 454 Deep Sequencing: Assembly, Annotation, and Marker Discovery

In this study, we reported the characterization of the first transcriptome of the mud crab (Scylla paramamosain). Pooled cDNAs of four tissue types from twelve wild individuals were sequenced using the Roche 454 FLX platform. Analysis performed included de novo assembly of transcriptome sequences, f...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Hongyu, Ma, Chunyan, Li, Shujuan, Jiang, Wei, Li, Xincang, Liu, Yuexing, Ma, Lingbo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4108364/
https://www.ncbi.nlm.nih.gov/pubmed/25054331
http://dx.doi.org/10.1371/journal.pone.0102668
_version_ 1782327743971590144
author Ma, Hongyu
Ma, Chunyan
Li, Shujuan
Jiang, Wei
Li, Xincang
Liu, Yuexing
Ma, Lingbo
author_facet Ma, Hongyu
Ma, Chunyan
Li, Shujuan
Jiang, Wei
Li, Xincang
Liu, Yuexing
Ma, Lingbo
author_sort Ma, Hongyu
collection PubMed
description In this study, we reported the characterization of the first transcriptome of the mud crab (Scylla paramamosain). Pooled cDNAs of four tissue types from twelve wild individuals were sequenced using the Roche 454 FLX platform. Analysis performed included de novo assembly of transcriptome sequences, functional annotation, and molecular marker discovery. A total of 1,314,101 high quality reads with an average length of 411 bp were generated by 454 sequencing on a mixed cDNA library. De novo assembly of these 1,314,101 reads produced 76,778 contigs (consisting of 818,154 reads) with 5.4-fold average sequencing coverage. The remaining 495,947 reads were singletons. A total of 78,268 unigenes were identified based on sequence similarity with known proteins (E≤0.00001) in UniProt and non-redundant protein databases. Meanwhile, 44,433 sequences were identified (E≤0.00001) using a BLASTN search against the NCBI nucleotide database. Gene Ontology (GO) analysis indicated that biosynthetic process, cell part, and ion binding were the most abundant terms in biological process, cellular component, and molecular function categories, respectively. Kyoto Encyclopedia of Genes and Genome (KEGG) pathway analysis revealed that 4,878 unigenes distributed in 281 different pathways. In addition, 19,011 microsatellites and 37,063 potential single nucleotide polymorphisms were detected from the transcriptome of S. paramamosain. Finally, thirty polymorphic microsatellite markers were developed and used to assess genetic diversity of a wild population of S. paramamosain. So far, existing sequence resources for S. paramamosain are extremely limited. The present study provides a characterization of transcriptome from multiple tissues and individuals, as well as an assessment of genetic diversity of a wild population. These sequence resources will facilitate the investigation of population genetic diversity, the development of genetic maps, and the conduct of molecular marker-assisted breeding in S. paramamosain and related crab species.
format Online
Article
Text
id pubmed-4108364
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41083642014-07-24 Transcriptome Analysis of the Mud Crab (Scylla paramamosain) by 454 Deep Sequencing: Assembly, Annotation, and Marker Discovery Ma, Hongyu Ma, Chunyan Li, Shujuan Jiang, Wei Li, Xincang Liu, Yuexing Ma, Lingbo PLoS One Research Article In this study, we reported the characterization of the first transcriptome of the mud crab (Scylla paramamosain). Pooled cDNAs of four tissue types from twelve wild individuals were sequenced using the Roche 454 FLX platform. Analysis performed included de novo assembly of transcriptome sequences, functional annotation, and molecular marker discovery. A total of 1,314,101 high quality reads with an average length of 411 bp were generated by 454 sequencing on a mixed cDNA library. De novo assembly of these 1,314,101 reads produced 76,778 contigs (consisting of 818,154 reads) with 5.4-fold average sequencing coverage. The remaining 495,947 reads were singletons. A total of 78,268 unigenes were identified based on sequence similarity with known proteins (E≤0.00001) in UniProt and non-redundant protein databases. Meanwhile, 44,433 sequences were identified (E≤0.00001) using a BLASTN search against the NCBI nucleotide database. Gene Ontology (GO) analysis indicated that biosynthetic process, cell part, and ion binding were the most abundant terms in biological process, cellular component, and molecular function categories, respectively. Kyoto Encyclopedia of Genes and Genome (KEGG) pathway analysis revealed that 4,878 unigenes distributed in 281 different pathways. In addition, 19,011 microsatellites and 37,063 potential single nucleotide polymorphisms were detected from the transcriptome of S. paramamosain. Finally, thirty polymorphic microsatellite markers were developed and used to assess genetic diversity of a wild population of S. paramamosain. So far, existing sequence resources for S. paramamosain are extremely limited. The present study provides a characterization of transcriptome from multiple tissues and individuals, as well as an assessment of genetic diversity of a wild population. These sequence resources will facilitate the investigation of population genetic diversity, the development of genetic maps, and the conduct of molecular marker-assisted breeding in S. paramamosain and related crab species. Public Library of Science 2014-07-23 /pmc/articles/PMC4108364/ /pubmed/25054331 http://dx.doi.org/10.1371/journal.pone.0102668 Text en © 2014 Ma et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ma, Hongyu
Ma, Chunyan
Li, Shujuan
Jiang, Wei
Li, Xincang
Liu, Yuexing
Ma, Lingbo
Transcriptome Analysis of the Mud Crab (Scylla paramamosain) by 454 Deep Sequencing: Assembly, Annotation, and Marker Discovery
title Transcriptome Analysis of the Mud Crab (Scylla paramamosain) by 454 Deep Sequencing: Assembly, Annotation, and Marker Discovery
title_full Transcriptome Analysis of the Mud Crab (Scylla paramamosain) by 454 Deep Sequencing: Assembly, Annotation, and Marker Discovery
title_fullStr Transcriptome Analysis of the Mud Crab (Scylla paramamosain) by 454 Deep Sequencing: Assembly, Annotation, and Marker Discovery
title_full_unstemmed Transcriptome Analysis of the Mud Crab (Scylla paramamosain) by 454 Deep Sequencing: Assembly, Annotation, and Marker Discovery
title_short Transcriptome Analysis of the Mud Crab (Scylla paramamosain) by 454 Deep Sequencing: Assembly, Annotation, and Marker Discovery
title_sort transcriptome analysis of the mud crab (scylla paramamosain) by 454 deep sequencing: assembly, annotation, and marker discovery
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4108364/
https://www.ncbi.nlm.nih.gov/pubmed/25054331
http://dx.doi.org/10.1371/journal.pone.0102668
work_keys_str_mv AT mahongyu transcriptomeanalysisofthemudcrabscyllaparamamosainby454deepsequencingassemblyannotationandmarkerdiscovery
AT machunyan transcriptomeanalysisofthemudcrabscyllaparamamosainby454deepsequencingassemblyannotationandmarkerdiscovery
AT lishujuan transcriptomeanalysisofthemudcrabscyllaparamamosainby454deepsequencingassemblyannotationandmarkerdiscovery
AT jiangwei transcriptomeanalysisofthemudcrabscyllaparamamosainby454deepsequencingassemblyannotationandmarkerdiscovery
AT lixincang transcriptomeanalysisofthemudcrabscyllaparamamosainby454deepsequencingassemblyannotationandmarkerdiscovery
AT liuyuexing transcriptomeanalysisofthemudcrabscyllaparamamosainby454deepsequencingassemblyannotationandmarkerdiscovery
AT malingbo transcriptomeanalysisofthemudcrabscyllaparamamosainby454deepsequencingassemblyannotationandmarkerdiscovery