Cargando…
Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets
The genome of the wild diploid strawberry species Fragaria vesca, an ideal model system of cultivated strawberry (Fragaria × ananassa, octoploid) and other Rosaceae family crops, was first published in 2011 and followed by a new assembly (Fvb). However, the annotation for Fvb mainly relied on ab ini...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5824900/ https://www.ncbi.nlm.nih.gov/pubmed/29036429 http://dx.doi.org/10.1093/dnares/dsx038 |
_version_ | 1783302103778721792 |
---|---|
author | Li, Yongping Wei, Wei Feng, Jia Luo, Huifeng Pi, Mengting Liu, Zhongchi Kang, Chunying |
author_facet | Li, Yongping Wei, Wei Feng, Jia Luo, Huifeng Pi, Mengting Liu, Zhongchi Kang, Chunying |
author_sort | Li, Yongping |
collection | PubMed |
description | The genome of the wild diploid strawberry species Fragaria vesca, an ideal model system of cultivated strawberry (Fragaria × ananassa, octoploid) and other Rosaceae family crops, was first published in 2011 and followed by a new assembly (Fvb). However, the annotation for Fvb mainly relied on ab initio predictions and included only predicted coding sequences, therefore an improved annotation is highly desirable. Here, a new annotation version named v2.0.a2 was created for the Fvb genome by a pipeline utilizing one PacBio library, 90 Illumina RNA-seq libraries, and 9 small RNA-seq libraries. Altogether, 18,641 genes (55.6% out of 33,538 genes) were augmented with information on the 5′ and/or 3′ UTRs, 13,168 (39.3%) protein-coding genes were modified or newly identified, and 7,370 genes were found to possess alternative isoforms. In addition, 1,938 long non-coding RNAs, 171 miRNAs, and 51,714 small RNA clusters were integrated into the annotation. This new annotation of F. vesca is substantially improved in both accuracy and integrity of gene predictions, beneficial to the gene functional studies in strawberry and to the comparative genomic analysis of other horticultural crops in Rosaceae family. |
format | Online Article Text |
id | pubmed-5824900 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-58249002018-02-28 Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets Li, Yongping Wei, Wei Feng, Jia Luo, Huifeng Pi, Mengting Liu, Zhongchi Kang, Chunying DNA Res Full Papers The genome of the wild diploid strawberry species Fragaria vesca, an ideal model system of cultivated strawberry (Fragaria × ananassa, octoploid) and other Rosaceae family crops, was first published in 2011 and followed by a new assembly (Fvb). However, the annotation for Fvb mainly relied on ab initio predictions and included only predicted coding sequences, therefore an improved annotation is highly desirable. Here, a new annotation version named v2.0.a2 was created for the Fvb genome by a pipeline utilizing one PacBio library, 90 Illumina RNA-seq libraries, and 9 small RNA-seq libraries. Altogether, 18,641 genes (55.6% out of 33,538 genes) were augmented with information on the 5′ and/or 3′ UTRs, 13,168 (39.3%) protein-coding genes were modified or newly identified, and 7,370 genes were found to possess alternative isoforms. In addition, 1,938 long non-coding RNAs, 171 miRNAs, and 51,714 small RNA clusters were integrated into the annotation. This new annotation of F. vesca is substantially improved in both accuracy and integrity of gene predictions, beneficial to the gene functional studies in strawberry and to the comparative genomic analysis of other horticultural crops in Rosaceae family. Oxford University Press 2018-02 2017-09-23 /pmc/articles/PMC5824900/ /pubmed/29036429 http://dx.doi.org/10.1093/dnares/dsx038 Text en © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Full Papers Li, Yongping Wei, Wei Feng, Jia Luo, Huifeng Pi, Mengting Liu, Zhongchi Kang, Chunying Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets |
title | Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets |
title_full | Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets |
title_fullStr | Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets |
title_full_unstemmed | Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets |
title_short | Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina- and SMRT-based RNA-seq datasets |
title_sort | genome re-annotation of the wild strawberry fragaria vesca using extensive illumina- and smrt-based rna-seq datasets |
topic | Full Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5824900/ https://www.ncbi.nlm.nih.gov/pubmed/29036429 http://dx.doi.org/10.1093/dnares/dsx038 |
work_keys_str_mv | AT liyongping genomereannotationofthewildstrawberryfragariavescausingextensiveilluminaandsmrtbasedrnaseqdatasets AT weiwei genomereannotationofthewildstrawberryfragariavescausingextensiveilluminaandsmrtbasedrnaseqdatasets AT fengjia genomereannotationofthewildstrawberryfragariavescausingextensiveilluminaandsmrtbasedrnaseqdatasets AT luohuifeng genomereannotationofthewildstrawberryfragariavescausingextensiveilluminaandsmrtbasedrnaseqdatasets AT pimengting genomereannotationofthewildstrawberryfragariavescausingextensiveilluminaandsmrtbasedrnaseqdatasets AT liuzhongchi genomereannotationofthewildstrawberryfragariavescausingextensiveilluminaandsmrtbasedrnaseqdatasets AT kangchunying genomereannotationofthewildstrawberryfragariavescausingextensiveilluminaandsmrtbasedrnaseqdatasets |