Cargando…

Chromosome-level de novo genome assemblies of over 100 plant species

Genome sequence analysis in higher plants began with the whole-genome sequencing of Arabidopsis thaliana. Owing to the great advances in sequencing technologies, also known as next-generation sequencing (NGS) technologies, genomes of more than 400 plant species have been sequenced to date. Long-read...

Descripción completa

Detalles Bibliográficos
Autores principales: Shirasawa, Kenta, Harada, Daijiro, Hirakawa, Hideki, Isobe, Sachiko, Kole, Chittaranjan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Japanese Society of Breeding 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8329882/
https://www.ncbi.nlm.nih.gov/pubmed/34377059
http://dx.doi.org/10.1270/jsbbs.20146
_version_ 1783732587052662784
author Shirasawa, Kenta
Harada, Daijiro
Hirakawa, Hideki
Isobe, Sachiko
Kole, Chittaranjan
author_facet Shirasawa, Kenta
Harada, Daijiro
Hirakawa, Hideki
Isobe, Sachiko
Kole, Chittaranjan
author_sort Shirasawa, Kenta
collection PubMed
description Genome sequence analysis in higher plants began with the whole-genome sequencing of Arabidopsis thaliana. Owing to the great advances in sequencing technologies, also known as next-generation sequencing (NGS) technologies, genomes of more than 400 plant species have been sequenced to date. Long-read sequencing technologies, together with sequence scaffolding methods, have enabled the synthesis of chromosome-level de novo genome sequence assemblies, which has further allowed comparative analysis of the structural features of multiple plant genomes, thus elucidating the evolutionary history of plants. However, the quality of the assembled chromosome-level sequences varies among plant species. In this review, we summarize the status of chromosome-level assemblies of 114 plant species, with genome sizes ranging from 125 Mb to 16.9 Gb. While the average genome coverage of the assembled sequences reached up to 89.1%, the average coverage of chromosome-level pseudomolecules was 73.3%. Thus, further improvements in sequencing technologies and scaffolding, and data analysis methods, are required to establish gap-free telomere-to-telomere genome sequence assemblies. With the forthcoming new technologies, we are going to enter into a new genomics era where pan-genomics and the >1,000 or >1 million genomes’ project will be routine in higher plants.
format Online
Article
Text
id pubmed-8329882
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Japanese Society of Breeding
record_format MEDLINE/PubMed
spelling pubmed-83298822021-08-09 Chromosome-level de novo genome assemblies of over 100 plant species Shirasawa, Kenta Harada, Daijiro Hirakawa, Hideki Isobe, Sachiko Kole, Chittaranjan Breed Sci Invited Review Genome sequence analysis in higher plants began with the whole-genome sequencing of Arabidopsis thaliana. Owing to the great advances in sequencing technologies, also known as next-generation sequencing (NGS) technologies, genomes of more than 400 plant species have been sequenced to date. Long-read sequencing technologies, together with sequence scaffolding methods, have enabled the synthesis of chromosome-level de novo genome sequence assemblies, which has further allowed comparative analysis of the structural features of multiple plant genomes, thus elucidating the evolutionary history of plants. However, the quality of the assembled chromosome-level sequences varies among plant species. In this review, we summarize the status of chromosome-level assemblies of 114 plant species, with genome sizes ranging from 125 Mb to 16.9 Gb. While the average genome coverage of the assembled sequences reached up to 89.1%, the average coverage of chromosome-level pseudomolecules was 73.3%. Thus, further improvements in sequencing technologies and scaffolding, and data analysis methods, are required to establish gap-free telomere-to-telomere genome sequence assemblies. With the forthcoming new technologies, we are going to enter into a new genomics era where pan-genomics and the >1,000 or >1 million genomes’ project will be routine in higher plants. Japanese Society of Breeding 2021-04 2021-04-06 /pmc/articles/PMC8329882/ /pubmed/34377059 http://dx.doi.org/10.1270/jsbbs.20146 Text en Copyright © 2021 by JAPANESE SOCIETY OF BREEDING https://creativecommons.org/licenses/by-nc-nd/3.0/This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Invited Review
Shirasawa, Kenta
Harada, Daijiro
Hirakawa, Hideki
Isobe, Sachiko
Kole, Chittaranjan
Chromosome-level de novo genome assemblies of over 100 plant species
title Chromosome-level de novo genome assemblies of over 100 plant species
title_full Chromosome-level de novo genome assemblies of over 100 plant species
title_fullStr Chromosome-level de novo genome assemblies of over 100 plant species
title_full_unstemmed Chromosome-level de novo genome assemblies of over 100 plant species
title_short Chromosome-level de novo genome assemblies of over 100 plant species
title_sort chromosome-level de novo genome assemblies of over 100 plant species
topic Invited Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8329882/
https://www.ncbi.nlm.nih.gov/pubmed/34377059
http://dx.doi.org/10.1270/jsbbs.20146
work_keys_str_mv AT shirasawakenta chromosomeleveldenovogenomeassembliesofover100plantspecies
AT haradadaijiro chromosomeleveldenovogenomeassembliesofover100plantspecies
AT hirakawahideki chromosomeleveldenovogenomeassembliesofover100plantspecies
AT isobesachiko chromosomeleveldenovogenomeassembliesofover100plantspecies
AT kolechittaranjan chromosomeleveldenovogenomeassembliesofover100plantspecies