Cargando…

Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing

BACKGROUND: In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused...

Descripción completa

Detalles Bibliográficos
Autores principales: Seoane-Zonjic, Pedro, Cañas, Rafael A., Bautista, Rocío, Gómez-Maldonado, Josefa, Arrillaga, Isabel, Fernández-Pozo, Noé, Claros, M. Gonzalo, Cánovas, Francisco M., Ávila, Concepción
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4769843/
https://www.ncbi.nlm.nih.gov/pubmed/26922242
http://dx.doi.org/10.1186/s12864-016-2490-z
_version_ 1782418167598940160
author Seoane-Zonjic, Pedro
Cañas, Rafael A.
Bautista, Rocío
Gómez-Maldonado, Josefa
Arrillaga, Isabel
Fernández-Pozo, Noé
Claros, M. Gonzalo
Cánovas, Francisco M.
Ávila, Concepción
author_facet Seoane-Zonjic, Pedro
Cañas, Rafael A.
Bautista, Rocío
Gómez-Maldonado, Josefa
Arrillaga, Isabel
Fernández-Pozo, Noé
Claros, M. Gonzalo
Cánovas, Francisco M.
Ávila, Concepción
author_sort Seoane-Zonjic, Pedro
collection PubMed
description BACKGROUND: In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused on specific genes, gene models can be established that can aid in the assembly of gene-rich regions, and this information can be used to compare genomes and understand functional evolution. RESULTS: In this study, gene capture technology combined with BAC isolation and sequencing was used as an experimental approach to establish de novo gene structures without a reference genome. Probes were designed for 866 maritime pine transcripts to sequence genes captured from genomic DNA. The gene models were constructed using GeneAssembler, a new bioinformatic pipeline, which reconstructed over 82 % of the gene structures, and a high proportion (85 %) of the captured gene models contained sequences from the promoter regulatory region. In a parallel experiment, the P. pinaster BAC library was screened to isolate clones containing genes whose cDNA sequence were already available. BAC clones containing the asparagine synthetase, sucrose synthase and xyloglucan endotransglycosylase gene sequences were isolated and used in this study. The gene models derived from the gene capture approach were compared with the genomic sequences derived from the BAC clones. This combined approach is a particularly efficient way to capture the genomic structures of gene families with a small number of members. CONCLUSIONS: The experimental approach used in this study is a valuable combined technique to study genomic gene structures in species for which a reference genome is unavailable. It can be used to establish exon/intron boundaries in unknown gene structures, to reconstruct incomplete genes and to obtain promoter sequences that can be used for transcriptional studies. A bioinformatics algorithm (GeneAssembler) is also provided as a Ruby gem for this class of analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2490-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4769843
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47698432016-02-29 Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing Seoane-Zonjic, Pedro Cañas, Rafael A. Bautista, Rocío Gómez-Maldonado, Josefa Arrillaga, Isabel Fernández-Pozo, Noé Claros, M. Gonzalo Cánovas, Francisco M. Ávila, Concepción BMC Genomics Research Article BACKGROUND: In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused on specific genes, gene models can be established that can aid in the assembly of gene-rich regions, and this information can be used to compare genomes and understand functional evolution. RESULTS: In this study, gene capture technology combined with BAC isolation and sequencing was used as an experimental approach to establish de novo gene structures without a reference genome. Probes were designed for 866 maritime pine transcripts to sequence genes captured from genomic DNA. The gene models were constructed using GeneAssembler, a new bioinformatic pipeline, which reconstructed over 82 % of the gene structures, and a high proportion (85 %) of the captured gene models contained sequences from the promoter regulatory region. In a parallel experiment, the P. pinaster BAC library was screened to isolate clones containing genes whose cDNA sequence were already available. BAC clones containing the asparagine synthetase, sucrose synthase and xyloglucan endotransglycosylase gene sequences were isolated and used in this study. The gene models derived from the gene capture approach were compared with the genomic sequences derived from the BAC clones. This combined approach is a particularly efficient way to capture the genomic structures of gene families with a small number of members. CONCLUSIONS: The experimental approach used in this study is a valuable combined technique to study genomic gene structures in species for which a reference genome is unavailable. It can be used to establish exon/intron boundaries in unknown gene structures, to reconstruct incomplete genes and to obtain promoter sequences that can be used for transcriptional studies. A bioinformatics algorithm (GeneAssembler) is also provided as a Ruby gem for this class of analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2490-z) contains supplementary material, which is available to authorized users. BioMed Central 2016-02-27 /pmc/articles/PMC4769843/ /pubmed/26922242 http://dx.doi.org/10.1186/s12864-016-2490-z Text en © Seoane-Zonjic et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Seoane-Zonjic, Pedro
Cañas, Rafael A.
Bautista, Rocío
Gómez-Maldonado, Josefa
Arrillaga, Isabel
Fernández-Pozo, Noé
Claros, M. Gonzalo
Cánovas, Francisco M.
Ávila, Concepción
Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing
title Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing
title_full Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing
title_fullStr Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing
title_full_unstemmed Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing
title_short Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing
title_sort establishing gene models from the pinus pinaster genome using gene capture and bac sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4769843/
https://www.ncbi.nlm.nih.gov/pubmed/26922242
http://dx.doi.org/10.1186/s12864-016-2490-z
work_keys_str_mv AT seoanezonjicpedro establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing
AT canasrafaela establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing
AT bautistarocio establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing
AT gomezmaldonadojosefa establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing
AT arrillagaisabel establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing
AT fernandezpozonoe establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing
AT clarosmgonzalo establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing
AT canovasfranciscom establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing
AT avilaconcepcion establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing