Cargando…

A critical comparison of technologies for a plant genome sequencing project

BACKGROUND: A high-quality genome sequence of any model organism is an essential starting point for genetic and other studies. Older clone-based methods are slow and expensive, whereas faster, cheaper short-read–only assemblies can be incomplete and highly fragmented, which minimizes their usefulnes...

Descripción completa

Detalles Bibliográficos
Autores principales: Paajanen, Pirita, Kettleborough, George, López-Girona, Elena, Giolai, Michael, Heavens, Darren, Baker, David, Lister, Ashleigh, Cugliandolo, Fiorella, Wilde, Gail, Hein, Ingo, Macaulay, Iain, Bryan, Glenn J, Clark, Matthew D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6423373/
https://www.ncbi.nlm.nih.gov/pubmed/30624602
http://dx.doi.org/10.1093/gigascience/giy163
_version_ 1783404522488463360
author Paajanen, Pirita
Kettleborough, George
López-Girona, Elena
Giolai, Michael
Heavens, Darren
Baker, David
Lister, Ashleigh
Cugliandolo, Fiorella
Wilde, Gail
Hein, Ingo
Macaulay, Iain
Bryan, Glenn J
Clark, Matthew D
author_facet Paajanen, Pirita
Kettleborough, George
López-Girona, Elena
Giolai, Michael
Heavens, Darren
Baker, David
Lister, Ashleigh
Cugliandolo, Fiorella
Wilde, Gail
Hein, Ingo
Macaulay, Iain
Bryan, Glenn J
Clark, Matthew D
author_sort Paajanen, Pirita
collection PubMed
description BACKGROUND: A high-quality genome sequence of any model organism is an essential starting point for genetic and other studies. Older clone-based methods are slow and expensive, whereas faster, cheaper short-read–only assemblies can be incomplete and highly fragmented, which minimizes their usefulness. The last few years have seen the introduction of many new technologies for genome assembly. These new technologies and associated new algorithms are typically benchmarked on microbial genomes or, if they scale appropriately, on larger (e.g., human) genomes. However, plant genomes can be much more repetitive and larger than the human genome, and plant biochemistry often makes obtaining high-quality DNA that is free from contaminants difficult. Reflecting their challenging nature, we observe that plant genome assembly statistics are typically poorer than for vertebrates. RESULTS: Here, we compare Illumina short read, Pacific Biosciences long read, 10x Genomics linked reads, Dovetail Hi-C, and BioNano Genomics optical maps, singly and combined, in producing high-quality long-range genome assemblies of the potato species Solanum verrucosum. We benchmark the assemblies for completeness and accuracy, as well as DNA compute requirements and sequencing costs. CONCLUSIONS: The field of genome sequencing and assembly is reaching maturity, and the differences we observe between assemblies are surprisingly small. We expect that our results will be helpful to other genome projects, and that these datasets will be used in benchmarking by assembly algorithm developers.
format Online
Article
Text
id pubmed-6423373
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-64233732019-03-22 A critical comparison of technologies for a plant genome sequencing project Paajanen, Pirita Kettleborough, George López-Girona, Elena Giolai, Michael Heavens, Darren Baker, David Lister, Ashleigh Cugliandolo, Fiorella Wilde, Gail Hein, Ingo Macaulay, Iain Bryan, Glenn J Clark, Matthew D Gigascience Research BACKGROUND: A high-quality genome sequence of any model organism is an essential starting point for genetic and other studies. Older clone-based methods are slow and expensive, whereas faster, cheaper short-read–only assemblies can be incomplete and highly fragmented, which minimizes their usefulness. The last few years have seen the introduction of many new technologies for genome assembly. These new technologies and associated new algorithms are typically benchmarked on microbial genomes or, if they scale appropriately, on larger (e.g., human) genomes. However, plant genomes can be much more repetitive and larger than the human genome, and plant biochemistry often makes obtaining high-quality DNA that is free from contaminants difficult. Reflecting their challenging nature, we observe that plant genome assembly statistics are typically poorer than for vertebrates. RESULTS: Here, we compare Illumina short read, Pacific Biosciences long read, 10x Genomics linked reads, Dovetail Hi-C, and BioNano Genomics optical maps, singly and combined, in producing high-quality long-range genome assemblies of the potato species Solanum verrucosum. We benchmark the assemblies for completeness and accuracy, as well as DNA compute requirements and sequencing costs. CONCLUSIONS: The field of genome sequencing and assembly is reaching maturity, and the differences we observe between assemblies are surprisingly small. We expect that our results will be helpful to other genome projects, and that these datasets will be used in benchmarking by assembly algorithm developers. Oxford University Press 2019-01-09 /pmc/articles/PMC6423373/ /pubmed/30624602 http://dx.doi.org/10.1093/gigascience/giy163 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Paajanen, Pirita
Kettleborough, George
López-Girona, Elena
Giolai, Michael
Heavens, Darren
Baker, David
Lister, Ashleigh
Cugliandolo, Fiorella
Wilde, Gail
Hein, Ingo
Macaulay, Iain
Bryan, Glenn J
Clark, Matthew D
A critical comparison of technologies for a plant genome sequencing project
title A critical comparison of technologies for a plant genome sequencing project
title_full A critical comparison of technologies for a plant genome sequencing project
title_fullStr A critical comparison of technologies for a plant genome sequencing project
title_full_unstemmed A critical comparison of technologies for a plant genome sequencing project
title_short A critical comparison of technologies for a plant genome sequencing project
title_sort critical comparison of technologies for a plant genome sequencing project
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6423373/
https://www.ncbi.nlm.nih.gov/pubmed/30624602
http://dx.doi.org/10.1093/gigascience/giy163
work_keys_str_mv AT paajanenpirita acriticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT kettleboroughgeorge acriticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT lopezgironaelena acriticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT giolaimichael acriticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT heavensdarren acriticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT bakerdavid acriticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT listerashleigh acriticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT cugliandolofiorella acriticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT wildegail acriticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT heiningo acriticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT macaulayiain acriticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT bryanglennj acriticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT clarkmatthewd acriticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT paajanenpirita criticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT kettleboroughgeorge criticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT lopezgironaelena criticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT giolaimichael criticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT heavensdarren criticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT bakerdavid criticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT listerashleigh criticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT cugliandolofiorella criticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT wildegail criticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT heiningo criticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT macaulayiain criticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT bryanglennj criticalcomparisonoftechnologiesforaplantgenomesequencingproject
AT clarkmatthewd criticalcomparisonoftechnologiesforaplantgenomesequencingproject