Cargando…

Recompleting the Caenorhabditis elegans genome

Caenorhabditis elegans was the first multicellular eukaryotic genome sequenced to apparent completion. Although this assembly employed a standard C. elegans strain (N2), it used sequence data from several laboratories, with DNA propagated in bacteria and yeast. Thus, the N2 assembly has many differe...

Descripción completa

Detalles Bibliográficos
Autores principales: Yoshimura, Jun, Ichikawa, Kazuki, Shoura, Massa J., Artiles, Karen L., Gabdank, Idan, Wahba, Lamia, Smith, Cheryl L., Edgley, Mark L., Rougvie, Ann E., Fire, Andrew Z., Morishita, Shinichi, Schwarz, Erich M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6581061/
https://www.ncbi.nlm.nih.gov/pubmed/31123080
http://dx.doi.org/10.1101/gr.244830.118
_version_ 1783428126155472896
author Yoshimura, Jun
Ichikawa, Kazuki
Shoura, Massa J.
Artiles, Karen L.
Gabdank, Idan
Wahba, Lamia
Smith, Cheryl L.
Edgley, Mark L.
Rougvie, Ann E.
Fire, Andrew Z.
Morishita, Shinichi
Schwarz, Erich M.
author_facet Yoshimura, Jun
Ichikawa, Kazuki
Shoura, Massa J.
Artiles, Karen L.
Gabdank, Idan
Wahba, Lamia
Smith, Cheryl L.
Edgley, Mark L.
Rougvie, Ann E.
Fire, Andrew Z.
Morishita, Shinichi
Schwarz, Erich M.
author_sort Yoshimura, Jun
collection PubMed
description Caenorhabditis elegans was the first multicellular eukaryotic genome sequenced to apparent completion. Although this assembly employed a standard C. elegans strain (N2), it used sequence data from several laboratories, with DNA propagated in bacteria and yeast. Thus, the N2 assembly has many differences from any C. elegans available today. To provide a more accurate C. elegans genome, we performed long-read assembly of VC2010, a modern strain derived from N2. Our VC2010 assembly has 99.98% identity to N2 but with an additional 1.8 Mb including tandem repeat expansions and genome duplications. For 116 structural discrepancies between N2 and VC2010, 97 structures matching VC2010 (84%) were also found in two outgroup strains, implying deficiencies in N2. Over 98% of N2 genes encoded unchanged products in VC2010; moreover, we predicted ≥53 new genes in VC2010. The recompleted genome of C. elegans should be a valuable resource for genetics, genomics, and systems biology.
format Online
Article
Text
id pubmed-6581061
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-65810612019-07-02 Recompleting the Caenorhabditis elegans genome Yoshimura, Jun Ichikawa, Kazuki Shoura, Massa J. Artiles, Karen L. Gabdank, Idan Wahba, Lamia Smith, Cheryl L. Edgley, Mark L. Rougvie, Ann E. Fire, Andrew Z. Morishita, Shinichi Schwarz, Erich M. Genome Res Resource Caenorhabditis elegans was the first multicellular eukaryotic genome sequenced to apparent completion. Although this assembly employed a standard C. elegans strain (N2), it used sequence data from several laboratories, with DNA propagated in bacteria and yeast. Thus, the N2 assembly has many differences from any C. elegans available today. To provide a more accurate C. elegans genome, we performed long-read assembly of VC2010, a modern strain derived from N2. Our VC2010 assembly has 99.98% identity to N2 but with an additional 1.8 Mb including tandem repeat expansions and genome duplications. For 116 structural discrepancies between N2 and VC2010, 97 structures matching VC2010 (84%) were also found in two outgroup strains, implying deficiencies in N2. Over 98% of N2 genes encoded unchanged products in VC2010; moreover, we predicted ≥53 new genes in VC2010. The recompleted genome of C. elegans should be a valuable resource for genetics, genomics, and systems biology. Cold Spring Harbor Laboratory Press 2019-06 /pmc/articles/PMC6581061/ /pubmed/31123080 http://dx.doi.org/10.1101/gr.244830.118 Text en © 2019 Yoshimura et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Resource
Yoshimura, Jun
Ichikawa, Kazuki
Shoura, Massa J.
Artiles, Karen L.
Gabdank, Idan
Wahba, Lamia
Smith, Cheryl L.
Edgley, Mark L.
Rougvie, Ann E.
Fire, Andrew Z.
Morishita, Shinichi
Schwarz, Erich M.
Recompleting the Caenorhabditis elegans genome
title Recompleting the Caenorhabditis elegans genome
title_full Recompleting the Caenorhabditis elegans genome
title_fullStr Recompleting the Caenorhabditis elegans genome
title_full_unstemmed Recompleting the Caenorhabditis elegans genome
title_short Recompleting the Caenorhabditis elegans genome
title_sort recompleting the caenorhabditis elegans genome
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6581061/
https://www.ncbi.nlm.nih.gov/pubmed/31123080
http://dx.doi.org/10.1101/gr.244830.118
work_keys_str_mv AT yoshimurajun recompletingthecaenorhabditiselegansgenome
AT ichikawakazuki recompletingthecaenorhabditiselegansgenome
AT shouramassaj recompletingthecaenorhabditiselegansgenome
AT artileskarenl recompletingthecaenorhabditiselegansgenome
AT gabdankidan recompletingthecaenorhabditiselegansgenome
AT wahbalamia recompletingthecaenorhabditiselegansgenome
AT smithcheryll recompletingthecaenorhabditiselegansgenome
AT edgleymarkl recompletingthecaenorhabditiselegansgenome
AT rougvieanne recompletingthecaenorhabditiselegansgenome
AT fireandrewz recompletingthecaenorhabditiselegansgenome
AT morishitashinichi recompletingthecaenorhabditiselegansgenome
AT schwarzerichm recompletingthecaenorhabditiselegansgenome