Cargando…
Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing
BACKGROUND: In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4769843/ https://www.ncbi.nlm.nih.gov/pubmed/26922242 http://dx.doi.org/10.1186/s12864-016-2490-z |
_version_ | 1782418167598940160 |
---|---|
author | Seoane-Zonjic, Pedro Cañas, Rafael A. Bautista, Rocío Gómez-Maldonado, Josefa Arrillaga, Isabel Fernández-Pozo, Noé Claros, M. Gonzalo Cánovas, Francisco M. Ávila, Concepción |
author_facet | Seoane-Zonjic, Pedro Cañas, Rafael A. Bautista, Rocío Gómez-Maldonado, Josefa Arrillaga, Isabel Fernández-Pozo, Noé Claros, M. Gonzalo Cánovas, Francisco M. Ávila, Concepción |
author_sort | Seoane-Zonjic, Pedro |
collection | PubMed |
description | BACKGROUND: In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused on specific genes, gene models can be established that can aid in the assembly of gene-rich regions, and this information can be used to compare genomes and understand functional evolution. RESULTS: In this study, gene capture technology combined with BAC isolation and sequencing was used as an experimental approach to establish de novo gene structures without a reference genome. Probes were designed for 866 maritime pine transcripts to sequence genes captured from genomic DNA. The gene models were constructed using GeneAssembler, a new bioinformatic pipeline, which reconstructed over 82 % of the gene structures, and a high proportion (85 %) of the captured gene models contained sequences from the promoter regulatory region. In a parallel experiment, the P. pinaster BAC library was screened to isolate clones containing genes whose cDNA sequence were already available. BAC clones containing the asparagine synthetase, sucrose synthase and xyloglucan endotransglycosylase gene sequences were isolated and used in this study. The gene models derived from the gene capture approach were compared with the genomic sequences derived from the BAC clones. This combined approach is a particularly efficient way to capture the genomic structures of gene families with a small number of members. CONCLUSIONS: The experimental approach used in this study is a valuable combined technique to study genomic gene structures in species for which a reference genome is unavailable. It can be used to establish exon/intron boundaries in unknown gene structures, to reconstruct incomplete genes and to obtain promoter sequences that can be used for transcriptional studies. A bioinformatics algorithm (GeneAssembler) is also provided as a Ruby gem for this class of analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2490-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4769843 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-47698432016-02-29 Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing Seoane-Zonjic, Pedro Cañas, Rafael A. Bautista, Rocío Gómez-Maldonado, Josefa Arrillaga, Isabel Fernández-Pozo, Noé Claros, M. Gonzalo Cánovas, Francisco M. Ávila, Concepción BMC Genomics Research Article BACKGROUND: In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused on specific genes, gene models can be established that can aid in the assembly of gene-rich regions, and this information can be used to compare genomes and understand functional evolution. RESULTS: In this study, gene capture technology combined with BAC isolation and sequencing was used as an experimental approach to establish de novo gene structures without a reference genome. Probes were designed for 866 maritime pine transcripts to sequence genes captured from genomic DNA. The gene models were constructed using GeneAssembler, a new bioinformatic pipeline, which reconstructed over 82 % of the gene structures, and a high proportion (85 %) of the captured gene models contained sequences from the promoter regulatory region. In a parallel experiment, the P. pinaster BAC library was screened to isolate clones containing genes whose cDNA sequence were already available. BAC clones containing the asparagine synthetase, sucrose synthase and xyloglucan endotransglycosylase gene sequences were isolated and used in this study. The gene models derived from the gene capture approach were compared with the genomic sequences derived from the BAC clones. This combined approach is a particularly efficient way to capture the genomic structures of gene families with a small number of members. CONCLUSIONS: The experimental approach used in this study is a valuable combined technique to study genomic gene structures in species for which a reference genome is unavailable. It can be used to establish exon/intron boundaries in unknown gene structures, to reconstruct incomplete genes and to obtain promoter sequences that can be used for transcriptional studies. A bioinformatics algorithm (GeneAssembler) is also provided as a Ruby gem for this class of analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2490-z) contains supplementary material, which is available to authorized users. BioMed Central 2016-02-27 /pmc/articles/PMC4769843/ /pubmed/26922242 http://dx.doi.org/10.1186/s12864-016-2490-z Text en © Seoane-Zonjic et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Seoane-Zonjic, Pedro Cañas, Rafael A. Bautista, Rocío Gómez-Maldonado, Josefa Arrillaga, Isabel Fernández-Pozo, Noé Claros, M. Gonzalo Cánovas, Francisco M. Ávila, Concepción Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing |
title | Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing |
title_full | Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing |
title_fullStr | Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing |
title_full_unstemmed | Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing |
title_short | Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing |
title_sort | establishing gene models from the pinus pinaster genome using gene capture and bac sequencing |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4769843/ https://www.ncbi.nlm.nih.gov/pubmed/26922242 http://dx.doi.org/10.1186/s12864-016-2490-z |
work_keys_str_mv | AT seoanezonjicpedro establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing AT canasrafaela establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing AT bautistarocio establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing AT gomezmaldonadojosefa establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing AT arrillagaisabel establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing AT fernandezpozonoe establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing AT clarosmgonzalo establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing AT canovasfranciscom establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing AT avilaconcepcion establishinggenemodelsfromthepinuspinastergenomeusinggenecaptureandbacsequencing |