Cargando…
Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing
The rice coral, Montipora capitata, is widely distributed throughout the Indo-Pacific and comprises one of the most important reef-building species in the Hawaiian Islands. Here, we describe a de novo assembly of its genome based on a linked-read sequencing approach developed by 10x Genomics. The fi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6668484/ https://www.ncbi.nlm.nih.gov/pubmed/31243452 http://dx.doi.org/10.1093/gbe/evz135 |
_version_ | 1783440227852877824 |
---|---|
author | Helmkampf, Martin Bellinger, M Renee Geib, Scott M Sim, Sheina B Takabayashi, Misaki |
author_facet | Helmkampf, Martin Bellinger, M Renee Geib, Scott M Sim, Sheina B Takabayashi, Misaki |
author_sort | Helmkampf, Martin |
collection | PubMed |
description | The rice coral, Montipora capitata, is widely distributed throughout the Indo-Pacific and comprises one of the most important reef-building species in the Hawaiian Islands. Here, we describe a de novo assembly of its genome based on a linked-read sequencing approach developed by 10x Genomics. The final draft assembly consisted of 27,870 scaffolds with a N50 size of 186 kb and contained a fairly complete set (81%) of metazoan benchmarking (BUSCO) genes. Based on haploid assembly size (615 Mb) and read k-mer profiles, we estimated the genome size to fall between 600 and 700 Mb, although the high fraction of repetitive sequence introduced considerable uncertainty. Repeat analysis indicated that 42% of the assembly consisted of interspersed, mostly unclassified repeats, and almost 3% tandem repeats. We also identified 36,691 protein-coding genes with a median coding sequence length of 807 bp, together spanning 7% of the assembly. The high repeat content and heterozygosity of the genome proved a challenging scenario for assembly, requiring additional steps to merge haplotypes and resulting in a higher than expected fragmentation at the scaffold level. Despite these challenges, the assembly turned out to be comparable in most quality measures to that of other available coral genomes while being considerably more cost-effective, especially with respect to long-read sequencing methods. Provided high-molecular-weight DNA is available, linked-read technology may thus serve as a valuable alternative capable of providing quality genome assemblies of nonmodel organisms. |
format | Online Article Text |
id | pubmed-6668484 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-66684842019-08-05 Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing Helmkampf, Martin Bellinger, M Renee Geib, Scott M Sim, Sheina B Takabayashi, Misaki Genome Biol Evol Genome Report The rice coral, Montipora capitata, is widely distributed throughout the Indo-Pacific and comprises one of the most important reef-building species in the Hawaiian Islands. Here, we describe a de novo assembly of its genome based on a linked-read sequencing approach developed by 10x Genomics. The final draft assembly consisted of 27,870 scaffolds with a N50 size of 186 kb and contained a fairly complete set (81%) of metazoan benchmarking (BUSCO) genes. Based on haploid assembly size (615 Mb) and read k-mer profiles, we estimated the genome size to fall between 600 and 700 Mb, although the high fraction of repetitive sequence introduced considerable uncertainty. Repeat analysis indicated that 42% of the assembly consisted of interspersed, mostly unclassified repeats, and almost 3% tandem repeats. We also identified 36,691 protein-coding genes with a median coding sequence length of 807 bp, together spanning 7% of the assembly. The high repeat content and heterozygosity of the genome proved a challenging scenario for assembly, requiring additional steps to merge haplotypes and resulting in a higher than expected fragmentation at the scaffold level. Despite these challenges, the assembly turned out to be comparable in most quality measures to that of other available coral genomes while being considerably more cost-effective, especially with respect to long-read sequencing methods. Provided high-molecular-weight DNA is available, linked-read technology may thus serve as a valuable alternative capable of providing quality genome assemblies of nonmodel organisms. Oxford University Press 2019-06-27 /pmc/articles/PMC6668484/ /pubmed/31243452 http://dx.doi.org/10.1093/gbe/evz135 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Genome Report Helmkampf, Martin Bellinger, M Renee Geib, Scott M Sim, Sheina B Takabayashi, Misaki Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing |
title | Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing |
title_full | Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing |
title_fullStr | Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing |
title_full_unstemmed | Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing |
title_short | Draft Genome of the Rice Coral Montipora capitata Obtained from Linked-Read Sequencing |
title_sort | draft genome of the rice coral montipora capitata obtained from linked-read sequencing |
topic | Genome Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6668484/ https://www.ncbi.nlm.nih.gov/pubmed/31243452 http://dx.doi.org/10.1093/gbe/evz135 |
work_keys_str_mv | AT helmkampfmartin draftgenomeofthericecoralmontiporacapitataobtainedfromlinkedreadsequencing AT bellingermrenee draftgenomeofthericecoralmontiporacapitataobtainedfromlinkedreadsequencing AT geibscottm draftgenomeofthericecoralmontiporacapitataobtainedfromlinkedreadsequencing AT simsheinab draftgenomeofthericecoralmontiporacapitataobtainedfromlinkedreadsequencing AT takabayashimisaki draftgenomeofthericecoralmontiporacapitataobtainedfromlinkedreadsequencing |