Cargando…
Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity
BACKGROUND: Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biolo...
Autores principales: | , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5801600/ https://www.ncbi.nlm.nih.gov/pubmed/29253147 http://dx.doi.org/10.1093/gigascience/gix124 |
_version_ | 1783298377021128704 |
---|---|
author | Edger, Patrick P VanBuren, Robert Colle, Marivi Poorten, Thomas J Wai, Ching Man Niederhuth, Chad E Alger, Elizabeth I Ou, Shujun Acharya, Charlotte B Wang, Jie Callow, Pete McKain, Michael R Shi, Jinghua Collier, Chad Xiong, Zhiyong Mower, Jeffrey P Slovin, Janet P Hytönen, Timo Jiang, Ning Childs, Kevin L Knapp, Steven J |
author_facet | Edger, Patrick P VanBuren, Robert Colle, Marivi Poorten, Thomas J Wai, Ching Man Niederhuth, Chad E Alger, Elizabeth I Ou, Shujun Acharya, Charlotte B Wang, Jie Callow, Pete McKain, Michael R Shi, Jinghua Collier, Chad Xiong, Zhiyong Mower, Jeffrey P Slovin, Janet P Hytönen, Timo Jiang, Ning Childs, Kevin L Knapp, Steven J |
author_sort | Edger, Patrick P |
collection | PubMed |
description | BACKGROUND: Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology. FINDINGS: Here we utilized a robust, cost-effective approach to produce high-quality reference genomes. We report a near-complete genome of diploid woodland strawberry (Fragaria vesca) using single-molecule real-time sequencing from Pacific Biosciences (PacBio). This assembly has a contig N50 length of ∼7.9 million base pairs (Mb), representing a ∼300-fold improvement of the previous version. The vast majority (>99.8%) of the assembly was anchored to 7 pseudomolecules using 2 sets of optical maps from Bionano Genomics. We obtained ∼24.96 Mb of sequence not present in the previous version of the F. vesca genome and produced an improved annotation that includes 1496 new genes. Comparative syntenic analyses uncovered numerous, large-scale scaffolding errors present in each chromosome in the previously published version of the F. vesca genome. CONCLUSIONS: Our results highlight the need to improve existing short-read based reference genomes. Furthermore, we demonstrate how genome quality impacts commonly used analyses for addressing both fundamental and applied biological questions. |
format | Online Article Text |
id | pubmed-5801600 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-58016002018-02-23 Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity Edger, Patrick P VanBuren, Robert Colle, Marivi Poorten, Thomas J Wai, Ching Man Niederhuth, Chad E Alger, Elizabeth I Ou, Shujun Acharya, Charlotte B Wang, Jie Callow, Pete McKain, Michael R Shi, Jinghua Collier, Chad Xiong, Zhiyong Mower, Jeffrey P Slovin, Janet P Hytönen, Timo Jiang, Ning Childs, Kevin L Knapp, Steven J Gigascience Data Note BACKGROUND: Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology. FINDINGS: Here we utilized a robust, cost-effective approach to produce high-quality reference genomes. We report a near-complete genome of diploid woodland strawberry (Fragaria vesca) using single-molecule real-time sequencing from Pacific Biosciences (PacBio). This assembly has a contig N50 length of ∼7.9 million base pairs (Mb), representing a ∼300-fold improvement of the previous version. The vast majority (>99.8%) of the assembly was anchored to 7 pseudomolecules using 2 sets of optical maps from Bionano Genomics. We obtained ∼24.96 Mb of sequence not present in the previous version of the F. vesca genome and produced an improved annotation that includes 1496 new genes. Comparative syntenic analyses uncovered numerous, large-scale scaffolding errors present in each chromosome in the previously published version of the F. vesca genome. CONCLUSIONS: Our results highlight the need to improve existing short-read based reference genomes. Furthermore, we demonstrate how genome quality impacts commonly used analyses for addressing both fundamental and applied biological questions. Oxford University Press 2017-12-13 /pmc/articles/PMC5801600/ /pubmed/29253147 http://dx.doi.org/10.1093/gigascience/gix124 Text en © The Authors 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Data Note Edger, Patrick P VanBuren, Robert Colle, Marivi Poorten, Thomas J Wai, Ching Man Niederhuth, Chad E Alger, Elizabeth I Ou, Shujun Acharya, Charlotte B Wang, Jie Callow, Pete McKain, Michael R Shi, Jinghua Collier, Chad Xiong, Zhiyong Mower, Jeffrey P Slovin, Janet P Hytönen, Timo Jiang, Ning Childs, Kevin L Knapp, Steven J Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity |
title | Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity |
title_full | Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity |
title_fullStr | Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity |
title_full_unstemmed | Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity |
title_short | Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity |
title_sort | single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (fragaria vesca) with chromosome-scale contiguity |
topic | Data Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5801600/ https://www.ncbi.nlm.nih.gov/pubmed/29253147 http://dx.doi.org/10.1093/gigascience/gix124 |
work_keys_str_mv | AT edgerpatrickp singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT vanburenrobert singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT collemarivi singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT poortenthomasj singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT waichingman singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT niederhuthchade singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT algerelizabethi singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT oushujun singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT acharyacharlotteb singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT wangjie singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT callowpete singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT mckainmichaelr singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT shijinghua singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT collierchad singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT xiongzhiyong singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT mowerjeffreyp singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT slovinjanetp singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT hytonentimo singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT jiangning singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT childskevinl singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity AT knappstevenj singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity |