Cargando…

Unique Features of the Loblolly Pine (Pinus taeda L.) Megagenome Revealed Through Sequence Annotation

The largest genus in the conifer family Pinaceae is Pinus, with over 100 species. The size and complexity of their genomes (∼20–40 Gb, 2n = 24) have delayed the arrival of a well-annotated reference sequence. In this study, we present the annotation of the first whole-genome shotgun assembly of lobl...

Descripción completa

Detalles Bibliográficos
Autores principales: Wegrzyn, Jill L., Liechty, John D., Stevens, Kristian A., Wu, Le-Shin, Loopstra, Carol A., Vasquez-Gross, Hans A., Dougherty, William M., Lin, Brian Y., Zieve, Jacob J., Martínez-García, Pedro J., Holt, Carson, Yandell, Mark, Zimin, Aleksey V., Yorke, James A., Crepeau, Marc W., Puiu, Daniela, Salzberg, Steven L., de Jong, Pieter J., Mockaitis, Keithanne, Main, Doreen, Langley, Charles H., Neale, David B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3948814/
https://www.ncbi.nlm.nih.gov/pubmed/24653211
http://dx.doi.org/10.1534/genetics.113.159996
_version_ 1782306833043554304
author Wegrzyn, Jill L.
Liechty, John D.
Stevens, Kristian A.
Wu, Le-Shin
Loopstra, Carol A.
Vasquez-Gross, Hans A.
Dougherty, William M.
Lin, Brian Y.
Zieve, Jacob J.
Martínez-García, Pedro J.
Holt, Carson
Yandell, Mark
Zimin, Aleksey V.
Yorke, James A.
Crepeau, Marc W.
Puiu, Daniela
Salzberg, Steven L.
de Jong, Pieter J.
Mockaitis, Keithanne
Main, Doreen
Langley, Charles H.
Neale, David B.
author_facet Wegrzyn, Jill L.
Liechty, John D.
Stevens, Kristian A.
Wu, Le-Shin
Loopstra, Carol A.
Vasquez-Gross, Hans A.
Dougherty, William M.
Lin, Brian Y.
Zieve, Jacob J.
Martínez-García, Pedro J.
Holt, Carson
Yandell, Mark
Zimin, Aleksey V.
Yorke, James A.
Crepeau, Marc W.
Puiu, Daniela
Salzberg, Steven L.
de Jong, Pieter J.
Mockaitis, Keithanne
Main, Doreen
Langley, Charles H.
Neale, David B.
author_sort Wegrzyn, Jill L.
collection PubMed
description The largest genus in the conifer family Pinaceae is Pinus, with over 100 species. The size and complexity of their genomes (∼20–40 Gb, 2n = 24) have delayed the arrival of a well-annotated reference sequence. In this study, we present the annotation of the first whole-genome shotgun assembly of loblolly pine (Pinus taeda L.), which comprises 20.1 Gb of sequence. The MAKER-P annotation pipeline combined evidence-based alignments and ab initio predictions to generate 50,172 gene models, of which 15,653 are classified as high confidence. Clustering these gene models with 13 other plant species resulted in 20,646 gene families, of which 1554 are predicted to be unique to conifers. Among the conifer gene families, 159 are composed exclusively of loblolly pine members. The gene models for loblolly pine have the highest median and mean intron lengths of 24 fully sequenced plant genomes. Conifer genomes are full of repetitive DNA, with the most significant contributions from long-terminal-repeat retrotransposons. In depth analysis of the tandem and interspersed repetitive content yielded a combined estimate of 82%.
format Online
Article
Text
id pubmed-3948814
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-39488142015-03-01 Unique Features of the Loblolly Pine (Pinus taeda L.) Megagenome Revealed Through Sequence Annotation Wegrzyn, Jill L. Liechty, John D. Stevens, Kristian A. Wu, Le-Shin Loopstra, Carol A. Vasquez-Gross, Hans A. Dougherty, William M. Lin, Brian Y. Zieve, Jacob J. Martínez-García, Pedro J. Holt, Carson Yandell, Mark Zimin, Aleksey V. Yorke, James A. Crepeau, Marc W. Puiu, Daniela Salzberg, Steven L. de Jong, Pieter J. Mockaitis, Keithanne Main, Doreen Langley, Charles H. Neale, David B. Genetics Investigations The largest genus in the conifer family Pinaceae is Pinus, with over 100 species. The size and complexity of their genomes (∼20–40 Gb, 2n = 24) have delayed the arrival of a well-annotated reference sequence. In this study, we present the annotation of the first whole-genome shotgun assembly of loblolly pine (Pinus taeda L.), which comprises 20.1 Gb of sequence. The MAKER-P annotation pipeline combined evidence-based alignments and ab initio predictions to generate 50,172 gene models, of which 15,653 are classified as high confidence. Clustering these gene models with 13 other plant species resulted in 20,646 gene families, of which 1554 are predicted to be unique to conifers. Among the conifer gene families, 159 are composed exclusively of loblolly pine members. The gene models for loblolly pine have the highest median and mean intron lengths of 24 fully sequenced plant genomes. Conifer genomes are full of repetitive DNA, with the most significant contributions from long-terminal-repeat retrotransposons. In depth analysis of the tandem and interspersed repetitive content yielded a combined estimate of 82%. Genetics Society of America 2014-03 /pmc/articles/PMC3948814/ /pubmed/24653211 http://dx.doi.org/10.1534/genetics.113.159996 Text en Copyright © 2014 by the Genetics Society of America Available freely online through the author-supported open access option.
spellingShingle Investigations
Wegrzyn, Jill L.
Liechty, John D.
Stevens, Kristian A.
Wu, Le-Shin
Loopstra, Carol A.
Vasquez-Gross, Hans A.
Dougherty, William M.
Lin, Brian Y.
Zieve, Jacob J.
Martínez-García, Pedro J.
Holt, Carson
Yandell, Mark
Zimin, Aleksey V.
Yorke, James A.
Crepeau, Marc W.
Puiu, Daniela
Salzberg, Steven L.
de Jong, Pieter J.
Mockaitis, Keithanne
Main, Doreen
Langley, Charles H.
Neale, David B.
Unique Features of the Loblolly Pine (Pinus taeda L.) Megagenome Revealed Through Sequence Annotation
title Unique Features of the Loblolly Pine (Pinus taeda L.) Megagenome Revealed Through Sequence Annotation
title_full Unique Features of the Loblolly Pine (Pinus taeda L.) Megagenome Revealed Through Sequence Annotation
title_fullStr Unique Features of the Loblolly Pine (Pinus taeda L.) Megagenome Revealed Through Sequence Annotation
title_full_unstemmed Unique Features of the Loblolly Pine (Pinus taeda L.) Megagenome Revealed Through Sequence Annotation
title_short Unique Features of the Loblolly Pine (Pinus taeda L.) Megagenome Revealed Through Sequence Annotation
title_sort unique features of the loblolly pine (pinus taeda l.) megagenome revealed through sequence annotation
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3948814/
https://www.ncbi.nlm.nih.gov/pubmed/24653211
http://dx.doi.org/10.1534/genetics.113.159996
work_keys_str_mv AT wegrzynjilll uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT liechtyjohnd uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT stevenskristiana uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT wuleshin uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT loopstracarola uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT vasquezgrosshansa uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT doughertywilliamm uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT linbriany uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT zievejacobj uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT martinezgarciapedroj uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT holtcarson uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT yandellmark uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT ziminalekseyv uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT yorkejamesa uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT crepeaumarcw uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT puiudaniela uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT salzbergstevenl uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT dejongpieterj uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT mockaitiskeithanne uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT maindoreen uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT langleycharlesh uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation
AT nealedavidb uniquefeaturesoftheloblollypinepinustaedalmegagenomerevealedthroughsequenceannotation