Cargando…

Insights into the Loblolly Pine Genome: Characterization of BAC and Fosmid Sequences

Despite their prevalence and importance, the genome sequences of loblolly pine, Norway spruce, and white spruce, three ecologically and economically important conifer species, are just becoming available to the research community. Following the completion of these large assemblies, annotation effort...

Descripción completa

Detalles Bibliográficos
Autores principales: Wegrzyn, Jill L., Lin, Brian Y., Zieve, Jacob J., Dougherty, William M., Martínez-García, Pedro J., Koriabine, Maxim, Holtz-Morris, Ann, deJong, Pieter, Crepeau, Marc, Langley, Charles H., Puiu, Daniela, Salzberg, Steven L., Neale, David B., Stevens, Kristian A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3762812/
https://www.ncbi.nlm.nih.gov/pubmed/24023741
http://dx.doi.org/10.1371/journal.pone.0072439
_version_ 1782282938757414912
author Wegrzyn, Jill L.
Lin, Brian Y.
Zieve, Jacob J.
Dougherty, William M.
Martínez-García, Pedro J.
Koriabine, Maxim
Holtz-Morris, Ann
deJong, Pieter
Crepeau, Marc
Langley, Charles H.
Puiu, Daniela
Salzberg, Steven L.
Neale, David B.
Stevens, Kristian A.
author_facet Wegrzyn, Jill L.
Lin, Brian Y.
Zieve, Jacob J.
Dougherty, William M.
Martínez-García, Pedro J.
Koriabine, Maxim
Holtz-Morris, Ann
deJong, Pieter
Crepeau, Marc
Langley, Charles H.
Puiu, Daniela
Salzberg, Steven L.
Neale, David B.
Stevens, Kristian A.
author_sort Wegrzyn, Jill L.
collection PubMed
description Despite their prevalence and importance, the genome sequences of loblolly pine, Norway spruce, and white spruce, three ecologically and economically important conifer species, are just becoming available to the research community. Following the completion of these large assemblies, annotation efforts will be undertaken to characterize the reference sequences. Accurate annotation of these ancient genomes would be aided by a comprehensive repeat library; however, few studies have generated enough sequence to fully evaluate and catalog their non-genic content. In this paper, two sets of loblolly pine genomic sequence, 103 previously assembled BACs and 90,954 newly sequenced and assembled fosmid scaffolds, were analyzed. Together, this sequence represents 280 Mbp (roughly 1% of the loblolly pine genome) and one of the most comprehensive studies of repetitive elements and genes in a gymnosperm species. A combination of homology and de novo methodologies were applied to identify both conserved and novel repeats. Similarity analysis estimated a repetitive content of 27% that included both full and partial elements. When combined with the de novo investigation, the estimate increased to almost 86%. Over 60% of the repetitive sequence consists of full or partial LTR (long terminal repeat) retrotransposons. Through de novo approaches, 6,270 novel, full-length transposable element families and 9,415 sub-families were identified. Among those 6,270 families, 82% were annotated as single-copy. Several of the novel, high-copy families are described here, with the largest, PtPiedmont, comprising 133 full-length copies. In addition to repeats, analysis of the coding region reported 23 full-length eukaryotic orthologous proteins (KOGS) and another 29 novel or orthologous genes. These discoveries, along with other genomic resources, will be used to annotate conifer genomes and address long-standing questions about gymnosperm evolution.
format Online
Article
Text
id pubmed-3762812
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37628122013-09-10 Insights into the Loblolly Pine Genome: Characterization of BAC and Fosmid Sequences Wegrzyn, Jill L. Lin, Brian Y. Zieve, Jacob J. Dougherty, William M. Martínez-García, Pedro J. Koriabine, Maxim Holtz-Morris, Ann deJong, Pieter Crepeau, Marc Langley, Charles H. Puiu, Daniela Salzberg, Steven L. Neale, David B. Stevens, Kristian A. PLoS One Research Article Despite their prevalence and importance, the genome sequences of loblolly pine, Norway spruce, and white spruce, three ecologically and economically important conifer species, are just becoming available to the research community. Following the completion of these large assemblies, annotation efforts will be undertaken to characterize the reference sequences. Accurate annotation of these ancient genomes would be aided by a comprehensive repeat library; however, few studies have generated enough sequence to fully evaluate and catalog their non-genic content. In this paper, two sets of loblolly pine genomic sequence, 103 previously assembled BACs and 90,954 newly sequenced and assembled fosmid scaffolds, were analyzed. Together, this sequence represents 280 Mbp (roughly 1% of the loblolly pine genome) and one of the most comprehensive studies of repetitive elements and genes in a gymnosperm species. A combination of homology and de novo methodologies were applied to identify both conserved and novel repeats. Similarity analysis estimated a repetitive content of 27% that included both full and partial elements. When combined with the de novo investigation, the estimate increased to almost 86%. Over 60% of the repetitive sequence consists of full or partial LTR (long terminal repeat) retrotransposons. Through de novo approaches, 6,270 novel, full-length transposable element families and 9,415 sub-families were identified. Among those 6,270 families, 82% were annotated as single-copy. Several of the novel, high-copy families are described here, with the largest, PtPiedmont, comprising 133 full-length copies. In addition to repeats, analysis of the coding region reported 23 full-length eukaryotic orthologous proteins (KOGS) and another 29 novel or orthologous genes. These discoveries, along with other genomic resources, will be used to annotate conifer genomes and address long-standing questions about gymnosperm evolution. Public Library of Science 2013-09-04 /pmc/articles/PMC3762812/ /pubmed/24023741 http://dx.doi.org/10.1371/journal.pone.0072439 Text en © 2013 Wegrzyn et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Wegrzyn, Jill L.
Lin, Brian Y.
Zieve, Jacob J.
Dougherty, William M.
Martínez-García, Pedro J.
Koriabine, Maxim
Holtz-Morris, Ann
deJong, Pieter
Crepeau, Marc
Langley, Charles H.
Puiu, Daniela
Salzberg, Steven L.
Neale, David B.
Stevens, Kristian A.
Insights into the Loblolly Pine Genome: Characterization of BAC and Fosmid Sequences
title Insights into the Loblolly Pine Genome: Characterization of BAC and Fosmid Sequences
title_full Insights into the Loblolly Pine Genome: Characterization of BAC and Fosmid Sequences
title_fullStr Insights into the Loblolly Pine Genome: Characterization of BAC and Fosmid Sequences
title_full_unstemmed Insights into the Loblolly Pine Genome: Characterization of BAC and Fosmid Sequences
title_short Insights into the Loblolly Pine Genome: Characterization of BAC and Fosmid Sequences
title_sort insights into the loblolly pine genome: characterization of bac and fosmid sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3762812/
https://www.ncbi.nlm.nih.gov/pubmed/24023741
http://dx.doi.org/10.1371/journal.pone.0072439
work_keys_str_mv AT wegrzynjilll insightsintotheloblollypinegenomecharacterizationofbacandfosmidsequences
AT linbriany insightsintotheloblollypinegenomecharacterizationofbacandfosmidsequences
AT zievejacobj insightsintotheloblollypinegenomecharacterizationofbacandfosmidsequences
AT doughertywilliamm insightsintotheloblollypinegenomecharacterizationofbacandfosmidsequences
AT martinezgarciapedroj insightsintotheloblollypinegenomecharacterizationofbacandfosmidsequences
AT koriabinemaxim insightsintotheloblollypinegenomecharacterizationofbacandfosmidsequences
AT holtzmorrisann insightsintotheloblollypinegenomecharacterizationofbacandfosmidsequences
AT dejongpieter insightsintotheloblollypinegenomecharacterizationofbacandfosmidsequences
AT crepeaumarc insightsintotheloblollypinegenomecharacterizationofbacandfosmidsequences
AT langleycharlesh insightsintotheloblollypinegenomecharacterizationofbacandfosmidsequences
AT puiudaniela insightsintotheloblollypinegenomecharacterizationofbacandfosmidsequences
AT salzbergstevenl insightsintotheloblollypinegenomecharacterizationofbacandfosmidsequences
AT nealedavidb insightsintotheloblollypinegenomecharacterizationofbacandfosmidsequences
AT stevenskristiana insightsintotheloblollypinegenomecharacterizationofbacandfosmidsequences