Cargando…

Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing

Human papillomavirus (HPV) is a causal agent for most cervical cancers. The physical status of the HPV genome in these cancers could be episomal, integrated, or both. HPV integration could serve as a biomarker for clinical diagnosis, treatment, and prognosis. Although whole-genome sequencing by next...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Shuaibing, Zhao, Qianqian, Tang, Lihua, Chen, Zejia, Wu, Zhaoting, Li, Kaixin, Lin, Ruoru, Chen, Yang, Ou, Danlin, Zhou, Li, Xu, Jianzhen, Qin, Qingsong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8764290/
https://www.ncbi.nlm.nih.gov/pubmed/35058971
http://dx.doi.org/10.3389/fgene.2021.798608
_version_ 1784634131078971392
author Yang, Shuaibing
Zhao, Qianqian
Tang, Lihua
Chen, Zejia
Wu, Zhaoting
Li, Kaixin
Lin, Ruoru
Chen, Yang
Ou, Danlin
Zhou, Li
Xu, Jianzhen
Qin, Qingsong
author_facet Yang, Shuaibing
Zhao, Qianqian
Tang, Lihua
Chen, Zejia
Wu, Zhaoting
Li, Kaixin
Lin, Ruoru
Chen, Yang
Ou, Danlin
Zhou, Li
Xu, Jianzhen
Qin, Qingsong
author_sort Yang, Shuaibing
collection PubMed
description Human papillomavirus (HPV) is a causal agent for most cervical cancers. The physical status of the HPV genome in these cancers could be episomal, integrated, or both. HPV integration could serve as a biomarker for clinical diagnosis, treatment, and prognosis. Although whole-genome sequencing by next-generation sequencing (NGS) technologies, such as the Illumina sequencing platform, have been used for detecting integrated HPV genome in cervical cancer, it faces challenges of analyzing long repeats and translocated sequences. In contrast, Oxford nanopore sequencing technology can generate ultra-long reads, which could be a very useful tool for determining HPV genome sequence and its physical status in cervical cancer. As a proof of concept, in this study, we completed whole genome sequencing from a cervical cancer tissue and a CaSki cell line with Oxford Nanopore Technologies. From the cervical cancer tissue, a 7,894 bp-long HPV35 genomic sequence was assembled from 678 reads at 97-fold coverage of HPV genome, sharing 99.96% identity with the HPV sequence obtained by Sanger sequencing. A 7904 bp-long HPV16 genomic sequence was assembled from data generated from the CaSki cell line at 3857-fold coverage, sharing 99.99% identity with the reference genome (NCBI: U89348). Intriguingly, long reads generated by nanopore sequencing directly revealed chimeric cellular–viral sequences and concatemeric genomic sequences, leading to the discovery of 448 unique integration breakpoints in the CaSki cell line and 60 breakpoints in the cervical cancer sample. Taken together, nanopore sequencing is a unique tool to identify HPV sequences and would shed light on the physical status of HPV genome in its associated cancers.
format Online
Article
Text
id pubmed-8764290
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-87642902022-01-19 Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing Yang, Shuaibing Zhao, Qianqian Tang, Lihua Chen, Zejia Wu, Zhaoting Li, Kaixin Lin, Ruoru Chen, Yang Ou, Danlin Zhou, Li Xu, Jianzhen Qin, Qingsong Front Genet Genetics Human papillomavirus (HPV) is a causal agent for most cervical cancers. The physical status of the HPV genome in these cancers could be episomal, integrated, or both. HPV integration could serve as a biomarker for clinical diagnosis, treatment, and prognosis. Although whole-genome sequencing by next-generation sequencing (NGS) technologies, such as the Illumina sequencing platform, have been used for detecting integrated HPV genome in cervical cancer, it faces challenges of analyzing long repeats and translocated sequences. In contrast, Oxford nanopore sequencing technology can generate ultra-long reads, which could be a very useful tool for determining HPV genome sequence and its physical status in cervical cancer. As a proof of concept, in this study, we completed whole genome sequencing from a cervical cancer tissue and a CaSki cell line with Oxford Nanopore Technologies. From the cervical cancer tissue, a 7,894 bp-long HPV35 genomic sequence was assembled from 678 reads at 97-fold coverage of HPV genome, sharing 99.96% identity with the HPV sequence obtained by Sanger sequencing. A 7904 bp-long HPV16 genomic sequence was assembled from data generated from the CaSki cell line at 3857-fold coverage, sharing 99.99% identity with the reference genome (NCBI: U89348). Intriguingly, long reads generated by nanopore sequencing directly revealed chimeric cellular–viral sequences and concatemeric genomic sequences, leading to the discovery of 448 unique integration breakpoints in the CaSki cell line and 60 breakpoints in the cervical cancer sample. Taken together, nanopore sequencing is a unique tool to identify HPV sequences and would shed light on the physical status of HPV genome in its associated cancers. Frontiers Media S.A. 2022-01-04 /pmc/articles/PMC8764290/ /pubmed/35058971 http://dx.doi.org/10.3389/fgene.2021.798608 Text en Copyright © 2022 Yang, Zhao, Tang, Chen, Wu, Li, Lin, Chen, Ou, Zhou, Xu and Qin. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Yang, Shuaibing
Zhao, Qianqian
Tang, Lihua
Chen, Zejia
Wu, Zhaoting
Li, Kaixin
Lin, Ruoru
Chen, Yang
Ou, Danlin
Zhou, Li
Xu, Jianzhen
Qin, Qingsong
Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing
title Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing
title_full Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing
title_fullStr Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing
title_full_unstemmed Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing
title_short Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing
title_sort whole genome assembly of human papillomavirus by nanopore long-read sequencing
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8764290/
https://www.ncbi.nlm.nih.gov/pubmed/35058971
http://dx.doi.org/10.3389/fgene.2021.798608
work_keys_str_mv AT yangshuaibing wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing
AT zhaoqianqian wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing
AT tanglihua wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing
AT chenzejia wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing
AT wuzhaoting wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing
AT likaixin wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing
AT linruoru wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing
AT chenyang wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing
AT oudanlin wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing
AT zhouli wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing
AT xujianzhen wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing
AT qinqingsong wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing