Cargando…

Chromosomal-level assembly of the blood clam, Scapharca (Anadara) broughtonii, using long sequence reads and Hi-C

BACKGROUND: The blood clam, Scapharca (Anadara) broughtonii, is an economically and ecologically important marine bivalve of the family Arcidae. Efforts to study their population genetics, breeding, cultivation, and stock enrichment have been somewhat hindered by the lack of a reference genome. Here...

Descripción completa

Detalles Bibliográficos
Autores principales: Bai, Chang-Ming, Xin, Lu-Sheng, Rosani, Umberto, Wu, Biao, Wang, Qing-Chen, Duan, Xiao-Ke, Liu, Zhi-Hong, Wang, Chong-Ming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6615981/
https://www.ncbi.nlm.nih.gov/pubmed/31289832
http://dx.doi.org/10.1093/gigascience/giz067
_version_ 1783433428141604864
author Bai, Chang-Ming
Xin, Lu-Sheng
Rosani, Umberto
Wu, Biao
Wang, Qing-Chen
Duan, Xiao-Ke
Liu, Zhi-Hong
Wang, Chong-Ming
author_facet Bai, Chang-Ming
Xin, Lu-Sheng
Rosani, Umberto
Wu, Biao
Wang, Qing-Chen
Duan, Xiao-Ke
Liu, Zhi-Hong
Wang, Chong-Ming
author_sort Bai, Chang-Ming
collection PubMed
description BACKGROUND: The blood clam, Scapharca (Anadara) broughtonii, is an economically and ecologically important marine bivalve of the family Arcidae. Efforts to study their population genetics, breeding, cultivation, and stock enrichment have been somewhat hindered by the lack of a reference genome. Herein, we report the complete genome sequence of S. broughtonii, a first reference genome of the family Arcidae. FINDINGS: A total of 75.79 Gb clean data were generated with the Pacific Biosciences and Oxford Nanopore platforms, which represented approximately 86× coverage of the S. broughtonii genome. De novo assembly of these long reads resulted in an 884.5-Mb genome, with a contig N50 of 1.80 Mb and scaffold N50 of 45.00 Mb. Genome Hi-C scaffolding resulted in 19 chromosomes containing 99.35% of bases in the assembled genome. Genome annotation revealed that nearly half of the genome (46.1%) is composed of repeated sequences, while 24,045 protein-coding genes were predicted and 84.7% of them were annotated. CONCLUSIONS: We report here a chromosomal-level assembly of the S. broughtonii genome based on long-read sequencing and Hi-C scaffolding. The genomic data can serve as a reference for the family Arcidae and will provide a valuable resource for the scientific community and aquaculture sector.
format Online
Article
Text
id pubmed-6615981
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-66159812019-07-15 Chromosomal-level assembly of the blood clam, Scapharca (Anadara) broughtonii, using long sequence reads and Hi-C Bai, Chang-Ming Xin, Lu-Sheng Rosani, Umberto Wu, Biao Wang, Qing-Chen Duan, Xiao-Ke Liu, Zhi-Hong Wang, Chong-Ming Gigascience Data Note BACKGROUND: The blood clam, Scapharca (Anadara) broughtonii, is an economically and ecologically important marine bivalve of the family Arcidae. Efforts to study their population genetics, breeding, cultivation, and stock enrichment have been somewhat hindered by the lack of a reference genome. Herein, we report the complete genome sequence of S. broughtonii, a first reference genome of the family Arcidae. FINDINGS: A total of 75.79 Gb clean data were generated with the Pacific Biosciences and Oxford Nanopore platforms, which represented approximately 86× coverage of the S. broughtonii genome. De novo assembly of these long reads resulted in an 884.5-Mb genome, with a contig N50 of 1.80 Mb and scaffold N50 of 45.00 Mb. Genome Hi-C scaffolding resulted in 19 chromosomes containing 99.35% of bases in the assembled genome. Genome annotation revealed that nearly half of the genome (46.1%) is composed of repeated sequences, while 24,045 protein-coding genes were predicted and 84.7% of them were annotated. CONCLUSIONS: We report here a chromosomal-level assembly of the S. broughtonii genome based on long-read sequencing and Hi-C scaffolding. The genomic data can serve as a reference for the family Arcidae and will provide a valuable resource for the scientific community and aquaculture sector. Oxford University Press 2019-07-09 /pmc/articles/PMC6615981/ /pubmed/31289832 http://dx.doi.org/10.1093/gigascience/giz067 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Data Note
Bai, Chang-Ming
Xin, Lu-Sheng
Rosani, Umberto
Wu, Biao
Wang, Qing-Chen
Duan, Xiao-Ke
Liu, Zhi-Hong
Wang, Chong-Ming
Chromosomal-level assembly of the blood clam, Scapharca (Anadara) broughtonii, using long sequence reads and Hi-C
title Chromosomal-level assembly of the blood clam, Scapharca (Anadara) broughtonii, using long sequence reads and Hi-C
title_full Chromosomal-level assembly of the blood clam, Scapharca (Anadara) broughtonii, using long sequence reads and Hi-C
title_fullStr Chromosomal-level assembly of the blood clam, Scapharca (Anadara) broughtonii, using long sequence reads and Hi-C
title_full_unstemmed Chromosomal-level assembly of the blood clam, Scapharca (Anadara) broughtonii, using long sequence reads and Hi-C
title_short Chromosomal-level assembly of the blood clam, Scapharca (Anadara) broughtonii, using long sequence reads and Hi-C
title_sort chromosomal-level assembly of the blood clam, scapharca (anadara) broughtonii, using long sequence reads and hi-c
topic Data Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6615981/
https://www.ncbi.nlm.nih.gov/pubmed/31289832
http://dx.doi.org/10.1093/gigascience/giz067
work_keys_str_mv AT baichangming chromosomallevelassemblyofthebloodclamscapharcaanadarabroughtoniiusinglongsequencereadsandhic
AT xinlusheng chromosomallevelassemblyofthebloodclamscapharcaanadarabroughtoniiusinglongsequencereadsandhic
AT rosaniumberto chromosomallevelassemblyofthebloodclamscapharcaanadarabroughtoniiusinglongsequencereadsandhic
AT wubiao chromosomallevelassemblyofthebloodclamscapharcaanadarabroughtoniiusinglongsequencereadsandhic
AT wangqingchen chromosomallevelassemblyofthebloodclamscapharcaanadarabroughtoniiusinglongsequencereadsandhic
AT duanxiaoke chromosomallevelassemblyofthebloodclamscapharcaanadarabroughtoniiusinglongsequencereadsandhic
AT liuzhihong chromosomallevelassemblyofthebloodclamscapharcaanadarabroughtoniiusinglongsequencereadsandhic
AT wangchongming chromosomallevelassemblyofthebloodclamscapharcaanadarabroughtoniiusinglongsequencereadsandhic