Cargando…

Nanopore sequencing reads improve assembly and gene annotation of the Parochlus steinenii genome

Parochlus steinenii is a winged midge from King George Island. It is cold-tolerant and endures the harsh Antarctic winter. Previously, we reported the genome of this midge, but the genome assembly with short reads had limited contig contiguity, which reduced the completeness of the genome assembly a...

Descripción completa

Detalles Bibliográficos
Autores principales: Shin, Seung Chul, Kim, Hyun, Lee, Jun Hyuck, Kim, Han-Woo, Park, Joonho, Choi, Beom-Soon, Lee, Sang-Choon, Kim, Ji Hee, Lee, Hyoungseok, Kim, Sanghee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6434015/
https://www.ncbi.nlm.nih.gov/pubmed/30911035
http://dx.doi.org/10.1038/s41598-019-41549-8
_version_ 1783406394282606592
author Shin, Seung Chul
Kim, Hyun
Lee, Jun Hyuck
Kim, Han-Woo
Park, Joonho
Choi, Beom-Soon
Lee, Sang-Choon
Kim, Ji Hee
Lee, Hyoungseok
Kim, Sanghee
author_facet Shin, Seung Chul
Kim, Hyun
Lee, Jun Hyuck
Kim, Han-Woo
Park, Joonho
Choi, Beom-Soon
Lee, Sang-Choon
Kim, Ji Hee
Lee, Hyoungseok
Kim, Sanghee
author_sort Shin, Seung Chul
collection PubMed
description Parochlus steinenii is a winged midge from King George Island. It is cold-tolerant and endures the harsh Antarctic winter. Previously, we reported the genome of this midge, but the genome assembly with short reads had limited contig contiguity, which reduced the completeness of the genome assembly and the annotated gene sets. Recently, assembly contiguity has been increased using nanopore technology. A number of methods for enhancing the low base quality of the assembly have been reported, including long-read (e.g. Nanopolish) or short-read (e.g. Pilon) based methods. Based on these advances, we used nanopore technologies to upgrade the draft genome sequence of P. steinenii. The final assembled genome was 145,366,448 bases in length. The contig number decreased from 9,132 to 162, and the N50 contig size increased from 36,946 to 1,989,550 bases. The BUSCO completeness of the assembly increased from 87.8 to 98.7%. Improved assembly statistics helped predict more genes from the draft genome of P. steinenii. The completeness of the predicted gene model increased from 79.5 to 92.1%, but the numbers and types of the predicted repeats were similar to those observed in the short read assembly, with the exception of long interspersed nuclear elements. In the present study, we markedly improved the P. steinenii genome assembly statistics using nanopore sequencing, but found that genome polishing with high-quality reads was essential for improving genome annotation. The number of genes predicted and the lengths of the genes were greater than before, and nanopore technology readily improved genome information.
format Online
Article
Text
id pubmed-6434015
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-64340152019-04-02 Nanopore sequencing reads improve assembly and gene annotation of the Parochlus steinenii genome Shin, Seung Chul Kim, Hyun Lee, Jun Hyuck Kim, Han-Woo Park, Joonho Choi, Beom-Soon Lee, Sang-Choon Kim, Ji Hee Lee, Hyoungseok Kim, Sanghee Sci Rep Article Parochlus steinenii is a winged midge from King George Island. It is cold-tolerant and endures the harsh Antarctic winter. Previously, we reported the genome of this midge, but the genome assembly with short reads had limited contig contiguity, which reduced the completeness of the genome assembly and the annotated gene sets. Recently, assembly contiguity has been increased using nanopore technology. A number of methods for enhancing the low base quality of the assembly have been reported, including long-read (e.g. Nanopolish) or short-read (e.g. Pilon) based methods. Based on these advances, we used nanopore technologies to upgrade the draft genome sequence of P. steinenii. The final assembled genome was 145,366,448 bases in length. The contig number decreased from 9,132 to 162, and the N50 contig size increased from 36,946 to 1,989,550 bases. The BUSCO completeness of the assembly increased from 87.8 to 98.7%. Improved assembly statistics helped predict more genes from the draft genome of P. steinenii. The completeness of the predicted gene model increased from 79.5 to 92.1%, but the numbers and types of the predicted repeats were similar to those observed in the short read assembly, with the exception of long interspersed nuclear elements. In the present study, we markedly improved the P. steinenii genome assembly statistics using nanopore sequencing, but found that genome polishing with high-quality reads was essential for improving genome annotation. The number of genes predicted and the lengths of the genes were greater than before, and nanopore technology readily improved genome information. Nature Publishing Group UK 2019-03-25 /pmc/articles/PMC6434015/ /pubmed/30911035 http://dx.doi.org/10.1038/s41598-019-41549-8 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Shin, Seung Chul
Kim, Hyun
Lee, Jun Hyuck
Kim, Han-Woo
Park, Joonho
Choi, Beom-Soon
Lee, Sang-Choon
Kim, Ji Hee
Lee, Hyoungseok
Kim, Sanghee
Nanopore sequencing reads improve assembly and gene annotation of the Parochlus steinenii genome
title Nanopore sequencing reads improve assembly and gene annotation of the Parochlus steinenii genome
title_full Nanopore sequencing reads improve assembly and gene annotation of the Parochlus steinenii genome
title_fullStr Nanopore sequencing reads improve assembly and gene annotation of the Parochlus steinenii genome
title_full_unstemmed Nanopore sequencing reads improve assembly and gene annotation of the Parochlus steinenii genome
title_short Nanopore sequencing reads improve assembly and gene annotation of the Parochlus steinenii genome
title_sort nanopore sequencing reads improve assembly and gene annotation of the parochlus steinenii genome
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6434015/
https://www.ncbi.nlm.nih.gov/pubmed/30911035
http://dx.doi.org/10.1038/s41598-019-41549-8
work_keys_str_mv AT shinseungchul nanoporesequencingreadsimproveassemblyandgeneannotationoftheparochlussteineniigenome
AT kimhyun nanoporesequencingreadsimproveassemblyandgeneannotationoftheparochlussteineniigenome
AT leejunhyuck nanoporesequencingreadsimproveassemblyandgeneannotationoftheparochlussteineniigenome
AT kimhanwoo nanoporesequencingreadsimproveassemblyandgeneannotationoftheparochlussteineniigenome
AT parkjoonho nanoporesequencingreadsimproveassemblyandgeneannotationoftheparochlussteineniigenome
AT choibeomsoon nanoporesequencingreadsimproveassemblyandgeneannotationoftheparochlussteineniigenome
AT leesangchoon nanoporesequencingreadsimproveassemblyandgeneannotationoftheparochlussteineniigenome
AT kimjihee nanoporesequencingreadsimproveassemblyandgeneannotationoftheparochlussteineniigenome
AT leehyoungseok nanoporesequencingreadsimproveassemblyandgeneannotationoftheparochlussteineniigenome
AT kimsanghee nanoporesequencingreadsimproveassemblyandgeneannotationoftheparochlussteineniigenome