Cargando…

Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing

The rice coral, Montipora capitata, is widely distributed throughout the Indo-Pacific and comprises one of the most important reef-building species in the Hawaiian Islands. Here, we describe a de novo assembly of its genome based on a linked-read sequencing approach developed by 10x Genomics. The fi...

Descripción completa

Detalles Bibliográficos
Autores principales: Helmkampf, Martin, Bellinger, M Renee, Geib, Scott M, Sim, Sheina B, Takabayashi, Misaki
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6668484/
https://www.ncbi.nlm.nih.gov/pubmed/31243452
http://dx.doi.org/10.1093/gbe/evz135
_version_ 1783440227852877824
author Helmkampf, Martin
Bellinger, M Renee
Geib, Scott M
Sim, Sheina B
Takabayashi, Misaki
author_facet Helmkampf, Martin
Bellinger, M Renee
Geib, Scott M
Sim, Sheina B
Takabayashi, Misaki
author_sort Helmkampf, Martin
collection PubMed
description The rice coral, Montipora capitata, is widely distributed throughout the Indo-Pacific and comprises one of the most important reef-building species in the Hawaiian Islands. Here, we describe a de novo assembly of its genome based on a linked-read sequencing approach developed by 10x Genomics. The final draft assembly consisted of 27,870 scaffolds with a N50 size of 186 kb and contained a fairly complete set (81%) of metazoan benchmarking (BUSCO) genes. Based on haploid assembly size (615 Mb) and read k-mer profiles, we estimated the genome size to fall between 600 and 700 Mb, although the high fraction of repetitive sequence introduced considerable uncertainty. Repeat analysis indicated that 42% of the assembly consisted of interspersed, mostly unclassified repeats, and almost 3% tandem repeats. We also identified 36,691 protein-coding genes with a median coding sequence length of 807 bp, together spanning 7% of the assembly. The high repeat content and heterozygosity of the genome proved a challenging scenario for assembly, requiring additional steps to merge haplotypes and resulting in a higher than expected fragmentation at the scaffold level. Despite these challenges, the assembly turned out to be comparable in most quality measures to that of other available coral genomes while being considerably more cost-effective, especially with respect to long-read sequencing methods. Provided high-molecular-weight DNA is available, linked-read technology may thus serve as a valuable alternative capable of providing quality genome assemblies of nonmodel organisms.
format Online
Article
Text
id pubmed-6668484
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-66684842019-08-05 Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing Helmkampf, Martin Bellinger, M Renee Geib, Scott M Sim, Sheina B Takabayashi, Misaki Genome Biol Evol Genome Report The rice coral, Montipora capitata, is widely distributed throughout the Indo-Pacific and comprises one of the most important reef-building species in the Hawaiian Islands. Here, we describe a de novo assembly of its genome based on a linked-read sequencing approach developed by 10x Genomics. The final draft assembly consisted of 27,870 scaffolds with a N50 size of 186 kb and contained a fairly complete set (81%) of metazoan benchmarking (BUSCO) genes. Based on haploid assembly size (615 Mb) and read k-mer profiles, we estimated the genome size to fall between 600 and 700 Mb, although the high fraction of repetitive sequence introduced considerable uncertainty. Repeat analysis indicated that 42% of the assembly consisted of interspersed, mostly unclassified repeats, and almost 3% tandem repeats. We also identified 36,691 protein-coding genes with a median coding sequence length of 807 bp, together spanning 7% of the assembly. The high repeat content and heterozygosity of the genome proved a challenging scenario for assembly, requiring additional steps to merge haplotypes and resulting in a higher than expected fragmentation at the scaffold level. Despite these challenges, the assembly turned out to be comparable in most quality measures to that of other available coral genomes while being considerably more cost-effective, especially with respect to long-read sequencing methods. Provided high-molecular-weight DNA is available, linked-read technology may thus serve as a valuable alternative capable of providing quality genome assemblies of nonmodel organisms. Oxford University Press 2019-06-27 /pmc/articles/PMC6668484/ /pubmed/31243452 http://dx.doi.org/10.1093/gbe/evz135 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genome Report
Helmkampf, Martin
Bellinger, M Renee
Geib, Scott M
Sim, Sheina B
Takabayashi, Misaki
Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing
title Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing
title_full Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing
title_fullStr Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing
title_full_unstemmed Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing
title_short Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing
title_sort draft genome of the rice coral montipora capitata obtained from linked-read sequencing
topic Genome Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6668484/
https://www.ncbi.nlm.nih.gov/pubmed/31243452
http://dx.doi.org/10.1093/gbe/evz135
work_keys_str_mv AT helmkampfmartin draftgenomeofthericecoralmontiporacapitataobtainedfromlinkedreadsequencing
AT bellingermrenee draftgenomeofthericecoralmontiporacapitataobtainedfromlinkedreadsequencing
AT geibscottm draftgenomeofthericecoralmontiporacapitataobtainedfromlinkedreadsequencing
AT simsheinab draftgenomeofthericecoralmontiporacapitataobtainedfromlinkedreadsequencing
AT takabayashimisaki draftgenomeofthericecoralmontiporacapitataobtainedfromlinkedreadsequencing