Cargando…

Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets

The genome of the wild diploid strawberry species Fragaria vesca, an ideal model system of cultivated strawberry (Fragaria × ananassa, octoploid) and other Rosaceae family crops, was first published in 2011 and followed by a new assembly (Fvb). However, the annotation for Fvb mainly relied on ab ini...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yongping, Wei, Wei, Feng, Jia, Luo, Huifeng, Pi, Mengting, Liu, Zhongchi, Kang, Chunying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5824900/
https://www.ncbi.nlm.nih.gov/pubmed/29036429
http://dx.doi.org/10.1093/dnares/dsx038
_version_ 1783302103778721792
author Li, Yongping
Wei, Wei
Feng, Jia
Luo, Huifeng
Pi, Mengting
Liu, Zhongchi
Kang, Chunying
author_facet Li, Yongping
Wei, Wei
Feng, Jia
Luo, Huifeng
Pi, Mengting
Liu, Zhongchi
Kang, Chunying
author_sort Li, Yongping
collection PubMed
description The genome of the wild diploid strawberry species Fragaria vesca, an ideal model system of cultivated strawberry (Fragaria × ananassa, octoploid) and other Rosaceae family crops, was first published in 2011 and followed by a new assembly (Fvb). However, the annotation for Fvb mainly relied on ab initio predictions and included only predicted coding sequences, therefore an improved annotation is highly desirable. Here, a new annotation version named v2.0.a2 was created for the Fvb genome by a pipeline utilizing one PacBio library, 90 Illumina RNA-seq libraries, and 9 small RNA-seq libraries. Altogether, 18,641 genes (55.6% out of 33,538 genes) were augmented with information on the 5′ and/or 3′ UTRs, 13,168 (39.3%) protein-coding genes were modified or newly identified, and 7,370 genes were found to possess alternative isoforms. In addition, 1,938 long non-coding RNAs, 171 miRNAs, and 51,714 small RNA clusters were integrated into the annotation. This new annotation of F. vesca is substantially improved in both accuracy and integrity of gene predictions, beneficial to the gene functional studies in strawberry and to the comparative genomic analysis of other horticultural crops in Rosaceae family.
format Online
Article
Text
id pubmed-5824900
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58249002018-02-28 Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets Li, Yongping Wei, Wei Feng, Jia Luo, Huifeng Pi, Mengting Liu, Zhongchi Kang, Chunying DNA Res Full Papers The genome of the wild diploid strawberry species Fragaria vesca, an ideal model system of cultivated strawberry (Fragaria × ananassa, octoploid) and other Rosaceae family crops, was first published in 2011 and followed by a new assembly (Fvb). However, the annotation for Fvb mainly relied on ab initio predictions and included only predicted coding sequences, therefore an improved annotation is highly desirable. Here, a new annotation version named v2.0.a2 was created for the Fvb genome by a pipeline utilizing one PacBio library, 90 Illumina RNA-seq libraries, and 9 small RNA-seq libraries. Altogether, 18,641 genes (55.6% out of 33,538 genes) were augmented with information on the 5′ and/or 3′ UTRs, 13,168 (39.3%) protein-coding genes were modified or newly identified, and 7,370 genes were found to possess alternative isoforms. In addition, 1,938 long non-coding RNAs, 171 miRNAs, and 51,714 small RNA clusters were integrated into the annotation. This new annotation of F. vesca is substantially improved in both accuracy and integrity of gene predictions, beneficial to the gene functional studies in strawberry and to the comparative genomic analysis of other horticultural crops in Rosaceae family. Oxford University Press 2018-02 2017-09-23 /pmc/articles/PMC5824900/ /pubmed/29036429 http://dx.doi.org/10.1093/dnares/dsx038 Text en © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Full Papers
Li, Yongping
Wei, Wei
Feng, Jia
Luo, Huifeng
Pi, Mengting
Liu, Zhongchi
Kang, Chunying
Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets
title Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets
title_full Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets
title_fullStr Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets
title_full_unstemmed Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets
title_short Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets
title_sort genome re-annotation of the wild strawberry fragaria vesca using extensive illumina- and smrt-based rna-seq datasets
topic Full Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5824900/
https://www.ncbi.nlm.nih.gov/pubmed/29036429
http://dx.doi.org/10.1093/dnares/dsx038
work_keys_str_mv AT liyongping genomereannotationofthewildstrawberryfragariavescausingextensiveilluminaandsmrtbasedrnaseqdatasets
AT weiwei genomereannotationofthewildstrawberryfragariavescausingextensiveilluminaandsmrtbasedrnaseqdatasets
AT fengjia genomereannotationofthewildstrawberryfragariavescausingextensiveilluminaandsmrtbasedrnaseqdatasets
AT luohuifeng genomereannotationofthewildstrawberryfragariavescausingextensiveilluminaandsmrtbasedrnaseqdatasets
AT pimengting genomereannotationofthewildstrawberryfragariavescausingextensiveilluminaandsmrtbasedrnaseqdatasets
AT liuzhongchi genomereannotationofthewildstrawberryfragariavescausingextensiveilluminaandsmrtbasedrnaseqdatasets
AT kangchunying genomereannotationofthewildstrawberryfragariavescausingextensiveilluminaandsmrtbasedrnaseqdatasets