Cargando…

Deep sequencing for de novo construction of a marine fish (Sparus aurata) transcriptome database with a large coverage of protein-coding transcripts

BACKGROUND: The gilthead sea bream (Sparus aurata) is the main fish species cultured in the Mediterranean area and constitutes an interesting model of research. Nevertheless, transcriptomic and genomic data are still scarce for this highly valuable species. A transcriptome database was constructed b...

Descripción completa

Detalles Bibliográficos
Autores principales: Calduch-Giner, Josep A, Bermejo-Nogales, Azucena, Benedito-Palos, Laura, Estensoro, Itziar, Ballester-Lozano, Gabriel, Sitjà-Bobadilla, Ariadna, Pérez-Sánchez, Jaume
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3606596/
https://www.ncbi.nlm.nih.gov/pubmed/23497320
http://dx.doi.org/10.1186/1471-2164-14-178
_version_ 1782264028747268096
author Calduch-Giner, Josep A
Bermejo-Nogales, Azucena
Benedito-Palos, Laura
Estensoro, Itziar
Ballester-Lozano, Gabriel
Sitjà-Bobadilla, Ariadna
Pérez-Sánchez, Jaume
author_facet Calduch-Giner, Josep A
Bermejo-Nogales, Azucena
Benedito-Palos, Laura
Estensoro, Itziar
Ballester-Lozano, Gabriel
Sitjà-Bobadilla, Ariadna
Pérez-Sánchez, Jaume
author_sort Calduch-Giner, Josep A
collection PubMed
description BACKGROUND: The gilthead sea bream (Sparus aurata) is the main fish species cultured in the Mediterranean area and constitutes an interesting model of research. Nevertheless, transcriptomic and genomic data are still scarce for this highly valuable species. A transcriptome database was constructed by de novo assembly of gilthead sea bream sequences derived from public repositories of mRNA and collections of expressed sequence tags together with new high-quality reads from five cDNA 454 normalized libraries of skeletal muscle (1), intestine (1), head kidney (2) and blood (1). RESULTS: Sequencing of the new 454 normalized libraries produced 2,945,914 high-quality reads and the de novo global assembly yielded 125,263 unique sequences with an average length of 727 nt. Blast analysis directed to protein and nucleotide databases annotated 63,880 sequences encoding for 21,384 gene descriptions, that were curated for redundancies and frameshifting at the homopolymer regions of open reading frames, and hosted at http://www.nutrigroup-iats.org/seabreamdb. Among the annotated gene descriptions, 16,177 were mapped in the Ingenuity Pathway Analysis (IPA) database, and 10,899 were eligible for functional analysis with a representation in 341 out of 372 IPA canonical pathways. The high representation of randomly selected stickleback transcripts by Blast search in the nucleotide gilthead sea bream database evidenced its high coverage of protein-coding transcripts. CONCLUSIONS: The newly assembled gilthead sea bream transcriptome represents a progress in genomic resources for this species, as it probably contains more than 75% of actively transcribed genes, constituting a valuable tool to assist studies on functional genomics and future genome projects.
format Online
Article
Text
id pubmed-3606596
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36065962013-03-25 Deep sequencing for de novo construction of a marine fish (Sparus aurata) transcriptome database with a large coverage of protein-coding transcripts Calduch-Giner, Josep A Bermejo-Nogales, Azucena Benedito-Palos, Laura Estensoro, Itziar Ballester-Lozano, Gabriel Sitjà-Bobadilla, Ariadna Pérez-Sánchez, Jaume BMC Genomics Research Article BACKGROUND: The gilthead sea bream (Sparus aurata) is the main fish species cultured in the Mediterranean area and constitutes an interesting model of research. Nevertheless, transcriptomic and genomic data are still scarce for this highly valuable species. A transcriptome database was constructed by de novo assembly of gilthead sea bream sequences derived from public repositories of mRNA and collections of expressed sequence tags together with new high-quality reads from five cDNA 454 normalized libraries of skeletal muscle (1), intestine (1), head kidney (2) and blood (1). RESULTS: Sequencing of the new 454 normalized libraries produced 2,945,914 high-quality reads and the de novo global assembly yielded 125,263 unique sequences with an average length of 727 nt. Blast analysis directed to protein and nucleotide databases annotated 63,880 sequences encoding for 21,384 gene descriptions, that were curated for redundancies and frameshifting at the homopolymer regions of open reading frames, and hosted at http://www.nutrigroup-iats.org/seabreamdb. Among the annotated gene descriptions, 16,177 were mapped in the Ingenuity Pathway Analysis (IPA) database, and 10,899 were eligible for functional analysis with a representation in 341 out of 372 IPA canonical pathways. The high representation of randomly selected stickleback transcripts by Blast search in the nucleotide gilthead sea bream database evidenced its high coverage of protein-coding transcripts. CONCLUSIONS: The newly assembled gilthead sea bream transcriptome represents a progress in genomic resources for this species, as it probably contains more than 75% of actively transcribed genes, constituting a valuable tool to assist studies on functional genomics and future genome projects. BioMed Central 2013-03-15 /pmc/articles/PMC3606596/ /pubmed/23497320 http://dx.doi.org/10.1186/1471-2164-14-178 Text en Copyright ©2013 Calduch-Giner et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Calduch-Giner, Josep A
Bermejo-Nogales, Azucena
Benedito-Palos, Laura
Estensoro, Itziar
Ballester-Lozano, Gabriel
Sitjà-Bobadilla, Ariadna
Pérez-Sánchez, Jaume
Deep sequencing for de novo construction of a marine fish (Sparus aurata) transcriptome database with a large coverage of protein-coding transcripts
title Deep sequencing for de novo construction of a marine fish (Sparus aurata) transcriptome database with a large coverage of protein-coding transcripts
title_full Deep sequencing for de novo construction of a marine fish (Sparus aurata) transcriptome database with a large coverage of protein-coding transcripts
title_fullStr Deep sequencing for de novo construction of a marine fish (Sparus aurata) transcriptome database with a large coverage of protein-coding transcripts
title_full_unstemmed Deep sequencing for de novo construction of a marine fish (Sparus aurata) transcriptome database with a large coverage of protein-coding transcripts
title_short Deep sequencing for de novo construction of a marine fish (Sparus aurata) transcriptome database with a large coverage of protein-coding transcripts
title_sort deep sequencing for de novo construction of a marine fish (sparus aurata) transcriptome database with a large coverage of protein-coding transcripts
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3606596/
https://www.ncbi.nlm.nih.gov/pubmed/23497320
http://dx.doi.org/10.1186/1471-2164-14-178
work_keys_str_mv AT calduchginerjosepa deepsequencingfordenovoconstructionofamarinefishsparusauratatranscriptomedatabasewithalargecoverageofproteincodingtranscripts
AT bermejonogalesazucena deepsequencingfordenovoconstructionofamarinefishsparusauratatranscriptomedatabasewithalargecoverageofproteincodingtranscripts
AT beneditopaloslaura deepsequencingfordenovoconstructionofamarinefishsparusauratatranscriptomedatabasewithalargecoverageofproteincodingtranscripts
AT estensoroitziar deepsequencingfordenovoconstructionofamarinefishsparusauratatranscriptomedatabasewithalargecoverageofproteincodingtranscripts
AT ballesterlozanogabriel deepsequencingfordenovoconstructionofamarinefishsparusauratatranscriptomedatabasewithalargecoverageofproteincodingtranscripts
AT sitjabobadillaariadna deepsequencingfordenovoconstructionofamarinefishsparusauratatranscriptomedatabasewithalargecoverageofproteincodingtranscripts
AT perezsanchezjaume deepsequencingfordenovoconstructionofamarinefishsparusauratatranscriptomedatabasewithalargecoverageofproteincodingtranscripts