Cargando…

EuroPineDB: a high-coverage web database for maritime pine transcriptome

BACKGROUND: Pinus pinaster is an economically and ecologically important species that is becoming a woody gymnosperm model. Its enormous genome size makes whole-genome sequencing approaches are hard to apply. Therefore, the expressed portion of the genome has to be characterised and the results and...

Descripción completa

Detalles Bibliográficos
Autores principales: Fernández-Pozo, Noé, Canales, Javier, Guerrero-Fernández, Darío, Villalobos, David P, Díaz-Moreno, Sara M, Bautista, Rocío, Flores-Monterroso, Arantxa, Guevara, M Ángeles, Perdiguero, Pedro, Collada, Carmen, Cervera, M Teresa, Soto, Álvaro, Ordás, Ricardo, Cantón, Francisco R, Avila, Concepción, Cánovas, Francisco M, Claros, M Gonzalo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3152544/
https://www.ncbi.nlm.nih.gov/pubmed/21762488
http://dx.doi.org/10.1186/1471-2164-12-366
_version_ 1782209778624233472
author Fernández-Pozo, Noé
Canales, Javier
Guerrero-Fernández, Darío
Villalobos, David P
Díaz-Moreno, Sara M
Bautista, Rocío
Flores-Monterroso, Arantxa
Guevara, M Ángeles
Perdiguero, Pedro
Collada, Carmen
Cervera, M Teresa
Soto, Álvaro
Ordás, Ricardo
Cantón, Francisco R
Avila, Concepción
Cánovas, Francisco M
Claros, M Gonzalo
author_facet Fernández-Pozo, Noé
Canales, Javier
Guerrero-Fernández, Darío
Villalobos, David P
Díaz-Moreno, Sara M
Bautista, Rocío
Flores-Monterroso, Arantxa
Guevara, M Ángeles
Perdiguero, Pedro
Collada, Carmen
Cervera, M Teresa
Soto, Álvaro
Ordás, Ricardo
Cantón, Francisco R
Avila, Concepción
Cánovas, Francisco M
Claros, M Gonzalo
author_sort Fernández-Pozo, Noé
collection PubMed
description BACKGROUND: Pinus pinaster is an economically and ecologically important species that is becoming a woody gymnosperm model. Its enormous genome size makes whole-genome sequencing approaches are hard to apply. Therefore, the expressed portion of the genome has to be characterised and the results and annotations have to be stored in dedicated databases. DESCRIPTION: EuroPineDB is the largest sequence collection available for a single pine species, Pinus pinaster (maritime pine), since it comprises 951 641 raw sequence reads obtained from non-normalised cDNA libraries and high-throughput sequencing from adult (xylem, phloem, roots, stem, needles, cones, strobili) and embryonic (germinated embryos, buds, callus) maritime pine tissues. Using open-source tools, sequences were optimally pre-processed, assembled, and extensively annotated (GO, EC and KEGG terms, descriptions, SNPs, SSRs, ORFs and InterPro codes). As a result, a 10.5× P. pinaster genome was covered and assembled in 55 322 UniGenes. A total of 32 919 (59.5%) of P. pinaster UniGenes were annotated with at least one description, revealing at least 18 466 different genes. The complete database, which is designed to be scalable, maintainable, and expandable, is freely available at: http://www.scbi.uma.es/pindb/. It can be retrieved by gene libraries, pine species, annotations, UniGenes and microarrays (i.e., the sequences are distributed in two-colour microarrays; this is the only conifer database that provides this information) and will be periodically updated. Small assemblies can be viewed using a dedicated visualisation tool that connects them with SNPs. Any sequence or annotation set shown on-screen can be downloaded. Retrieval mechanisms for sequences and gene annotations are provided. CONCLUSIONS: The EuroPineDB with its integrated information can be used to reveal new knowledge, offers an easy-to-use collection of information to directly support experimental work (including microarray hybridisation), and provides deeper knowledge on the maritime pine transcriptome.
format Online
Article
Text
id pubmed-3152544
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31525442011-08-09 EuroPineDB: a high-coverage web database for maritime pine transcriptome Fernández-Pozo, Noé Canales, Javier Guerrero-Fernández, Darío Villalobos, David P Díaz-Moreno, Sara M Bautista, Rocío Flores-Monterroso, Arantxa Guevara, M Ángeles Perdiguero, Pedro Collada, Carmen Cervera, M Teresa Soto, Álvaro Ordás, Ricardo Cantón, Francisco R Avila, Concepción Cánovas, Francisco M Claros, M Gonzalo BMC Genomics Database BACKGROUND: Pinus pinaster is an economically and ecologically important species that is becoming a woody gymnosperm model. Its enormous genome size makes whole-genome sequencing approaches are hard to apply. Therefore, the expressed portion of the genome has to be characterised and the results and annotations have to be stored in dedicated databases. DESCRIPTION: EuroPineDB is the largest sequence collection available for a single pine species, Pinus pinaster (maritime pine), since it comprises 951 641 raw sequence reads obtained from non-normalised cDNA libraries and high-throughput sequencing from adult (xylem, phloem, roots, stem, needles, cones, strobili) and embryonic (germinated embryos, buds, callus) maritime pine tissues. Using open-source tools, sequences were optimally pre-processed, assembled, and extensively annotated (GO, EC and KEGG terms, descriptions, SNPs, SSRs, ORFs and InterPro codes). As a result, a 10.5× P. pinaster genome was covered and assembled in 55 322 UniGenes. A total of 32 919 (59.5%) of P. pinaster UniGenes were annotated with at least one description, revealing at least 18 466 different genes. The complete database, which is designed to be scalable, maintainable, and expandable, is freely available at: http://www.scbi.uma.es/pindb/. It can be retrieved by gene libraries, pine species, annotations, UniGenes and microarrays (i.e., the sequences are distributed in two-colour microarrays; this is the only conifer database that provides this information) and will be periodically updated. Small assemblies can be viewed using a dedicated visualisation tool that connects them with SNPs. Any sequence or annotation set shown on-screen can be downloaded. Retrieval mechanisms for sequences and gene annotations are provided. CONCLUSIONS: The EuroPineDB with its integrated information can be used to reveal new knowledge, offers an easy-to-use collection of information to directly support experimental work (including microarray hybridisation), and provides deeper knowledge on the maritime pine transcriptome. BioMed Central 2011-07-15 /pmc/articles/PMC3152544/ /pubmed/21762488 http://dx.doi.org/10.1186/1471-2164-12-366 Text en Copyright ©2011 Fernández-Pozo et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
Fernández-Pozo, Noé
Canales, Javier
Guerrero-Fernández, Darío
Villalobos, David P
Díaz-Moreno, Sara M
Bautista, Rocío
Flores-Monterroso, Arantxa
Guevara, M Ángeles
Perdiguero, Pedro
Collada, Carmen
Cervera, M Teresa
Soto, Álvaro
Ordás, Ricardo
Cantón, Francisco R
Avila, Concepción
Cánovas, Francisco M
Claros, M Gonzalo
EuroPineDB: a high-coverage web database for maritime pine transcriptome
title EuroPineDB: a high-coverage web database for maritime pine transcriptome
title_full EuroPineDB: a high-coverage web database for maritime pine transcriptome
title_fullStr EuroPineDB: a high-coverage web database for maritime pine transcriptome
title_full_unstemmed EuroPineDB: a high-coverage web database for maritime pine transcriptome
title_short EuroPineDB: a high-coverage web database for maritime pine transcriptome
title_sort europinedb: a high-coverage web database for maritime pine transcriptome
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3152544/
https://www.ncbi.nlm.nih.gov/pubmed/21762488
http://dx.doi.org/10.1186/1471-2164-12-366
work_keys_str_mv AT fernandezpozonoe europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT canalesjavier europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT guerrerofernandezdario europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT villalobosdavidp europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT diazmorenosaram europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT bautistarocio europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT floresmonterrosoarantxa europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT guevaramangeles europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT perdigueropedro europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT colladacarmen europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT cerveramteresa europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT sotoalvaro europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT ordasricardo europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT cantonfranciscor europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT avilaconcepcion europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT canovasfranciscom europinedbahighcoveragewebdatabaseformaritimepinetranscriptome
AT clarosmgonzalo europinedbahighcoveragewebdatabaseformaritimepinetranscriptome