Cargando…

Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity

BACKGROUND: Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biolo...

Descripción completa

Detalles Bibliográficos
Autores principales: Edger, Patrick P, VanBuren, Robert, Colle, Marivi, Poorten, Thomas J, Wai, Ching Man, Niederhuth, Chad E, Alger, Elizabeth I, Ou, Shujun, Acharya, Charlotte B, Wang, Jie, Callow, Pete, McKain, Michael R, Shi, Jinghua, Collier, Chad, Xiong, Zhiyong, Mower, Jeffrey P, Slovin, Janet P, Hytönen, Timo, Jiang, Ning, Childs, Kevin L, Knapp, Steven J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5801600/
https://www.ncbi.nlm.nih.gov/pubmed/29253147
http://dx.doi.org/10.1093/gigascience/gix124
_version_ 1783298377021128704
author Edger, Patrick P
VanBuren, Robert
Colle, Marivi
Poorten, Thomas J
Wai, Ching Man
Niederhuth, Chad E
Alger, Elizabeth I
Ou, Shujun
Acharya, Charlotte B
Wang, Jie
Callow, Pete
McKain, Michael R
Shi, Jinghua
Collier, Chad
Xiong, Zhiyong
Mower, Jeffrey P
Slovin, Janet P
Hytönen, Timo
Jiang, Ning
Childs, Kevin L
Knapp, Steven J
author_facet Edger, Patrick P
VanBuren, Robert
Colle, Marivi
Poorten, Thomas J
Wai, Ching Man
Niederhuth, Chad E
Alger, Elizabeth I
Ou, Shujun
Acharya, Charlotte B
Wang, Jie
Callow, Pete
McKain, Michael R
Shi, Jinghua
Collier, Chad
Xiong, Zhiyong
Mower, Jeffrey P
Slovin, Janet P
Hytönen, Timo
Jiang, Ning
Childs, Kevin L
Knapp, Steven J
author_sort Edger, Patrick P
collection PubMed
description BACKGROUND: Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology. FINDINGS: Here we utilized a robust, cost-effective approach to produce high-quality reference genomes. We report a near-complete genome of diploid woodland strawberry (Fragaria vesca) using single-molecule real-time sequencing from Pacific Biosciences (PacBio). This assembly has a contig N50 length of ∼7.9 million base pairs (Mb), representing a ∼300-fold improvement of the previous version. The vast majority (>99.8%) of the assembly was anchored to 7 pseudomolecules using 2 sets of optical maps from Bionano Genomics. We obtained ∼24.96 Mb of sequence not present in the previous version of the F. vesca genome and produced an improved annotation that includes 1496 new genes. Comparative syntenic analyses uncovered numerous, large-scale scaffolding errors present in each chromosome in the previously published version of the F. vesca genome. CONCLUSIONS: Our results highlight the need to improve existing short-read based reference genomes. Furthermore, we demonstrate how genome quality impacts commonly used analyses for addressing both fundamental and applied biological questions.
format Online
Article
Text
id pubmed-5801600
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58016002018-02-23 Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity Edger, Patrick P VanBuren, Robert Colle, Marivi Poorten, Thomas J Wai, Ching Man Niederhuth, Chad E Alger, Elizabeth I Ou, Shujun Acharya, Charlotte B Wang, Jie Callow, Pete McKain, Michael R Shi, Jinghua Collier, Chad Xiong, Zhiyong Mower, Jeffrey P Slovin, Janet P Hytönen, Timo Jiang, Ning Childs, Kevin L Knapp, Steven J Gigascience Data Note BACKGROUND: Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology. FINDINGS: Here we utilized a robust, cost-effective approach to produce high-quality reference genomes. We report a near-complete genome of diploid woodland strawberry (Fragaria vesca) using single-molecule real-time sequencing from Pacific Biosciences (PacBio). This assembly has a contig N50 length of ∼7.9 million base pairs (Mb), representing a ∼300-fold improvement of the previous version. The vast majority (>99.8%) of the assembly was anchored to 7 pseudomolecules using 2 sets of optical maps from Bionano Genomics. We obtained ∼24.96 Mb of sequence not present in the previous version of the F. vesca genome and produced an improved annotation that includes 1496 new genes. Comparative syntenic analyses uncovered numerous, large-scale scaffolding errors present in each chromosome in the previously published version of the F. vesca genome. CONCLUSIONS: Our results highlight the need to improve existing short-read based reference genomes. Furthermore, we demonstrate how genome quality impacts commonly used analyses for addressing both fundamental and applied biological questions. Oxford University Press 2017-12-13 /pmc/articles/PMC5801600/ /pubmed/29253147 http://dx.doi.org/10.1093/gigascience/gix124 Text en © The Authors 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Data Note
Edger, Patrick P
VanBuren, Robert
Colle, Marivi
Poorten, Thomas J
Wai, Ching Man
Niederhuth, Chad E
Alger, Elizabeth I
Ou, Shujun
Acharya, Charlotte B
Wang, Jie
Callow, Pete
McKain, Michael R
Shi, Jinghua
Collier, Chad
Xiong, Zhiyong
Mower, Jeffrey P
Slovin, Janet P
Hytönen, Timo
Jiang, Ning
Childs, Kevin L
Knapp, Steven J
Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity
title Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity
title_full Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity
title_fullStr Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity
title_full_unstemmed Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity
title_short Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity
title_sort single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (fragaria vesca) with chromosome-scale contiguity
topic Data Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5801600/
https://www.ncbi.nlm.nih.gov/pubmed/29253147
http://dx.doi.org/10.1093/gigascience/gix124
work_keys_str_mv AT edgerpatrickp singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT vanburenrobert singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT collemarivi singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT poortenthomasj singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT waichingman singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT niederhuthchade singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT algerelizabethi singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT oushujun singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT acharyacharlotteb singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT wangjie singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT callowpete singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT mckainmichaelr singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT shijinghua singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT collierchad singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT xiongzhiyong singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT mowerjeffreyp singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT slovinjanetp singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT hytonentimo singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT jiangning singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT childskevinl singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity
AT knappstevenj singlemoleculesequencingandopticalmappingyieldsanimprovedgenomeofwoodlandstrawberryfragariavescawithchromosomescalecontiguity