Cargando…
Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing
Human papillomavirus (HPV) is a causal agent for most cervical cancers. The physical status of the HPV genome in these cancers could be episomal, integrated, or both. HPV integration could serve as a biomarker for clinical diagnosis, treatment, and prognosis. Although whole-genome sequencing by next...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8764290/ https://www.ncbi.nlm.nih.gov/pubmed/35058971 http://dx.doi.org/10.3389/fgene.2021.798608 |
_version_ | 1784634131078971392 |
---|---|
author | Yang, Shuaibing Zhao, Qianqian Tang, Lihua Chen, Zejia Wu, Zhaoting Li, Kaixin Lin, Ruoru Chen, Yang Ou, Danlin Zhou, Li Xu, Jianzhen Qin, Qingsong |
author_facet | Yang, Shuaibing Zhao, Qianqian Tang, Lihua Chen, Zejia Wu, Zhaoting Li, Kaixin Lin, Ruoru Chen, Yang Ou, Danlin Zhou, Li Xu, Jianzhen Qin, Qingsong |
author_sort | Yang, Shuaibing |
collection | PubMed |
description | Human papillomavirus (HPV) is a causal agent for most cervical cancers. The physical status of the HPV genome in these cancers could be episomal, integrated, or both. HPV integration could serve as a biomarker for clinical diagnosis, treatment, and prognosis. Although whole-genome sequencing by next-generation sequencing (NGS) technologies, such as the Illumina sequencing platform, have been used for detecting integrated HPV genome in cervical cancer, it faces challenges of analyzing long repeats and translocated sequences. In contrast, Oxford nanopore sequencing technology can generate ultra-long reads, which could be a very useful tool for determining HPV genome sequence and its physical status in cervical cancer. As a proof of concept, in this study, we completed whole genome sequencing from a cervical cancer tissue and a CaSki cell line with Oxford Nanopore Technologies. From the cervical cancer tissue, a 7,894 bp-long HPV35 genomic sequence was assembled from 678 reads at 97-fold coverage of HPV genome, sharing 99.96% identity with the HPV sequence obtained by Sanger sequencing. A 7904 bp-long HPV16 genomic sequence was assembled from data generated from the CaSki cell line at 3857-fold coverage, sharing 99.99% identity with the reference genome (NCBI: U89348). Intriguingly, long reads generated by nanopore sequencing directly revealed chimeric cellular–viral sequences and concatemeric genomic sequences, leading to the discovery of 448 unique integration breakpoints in the CaSki cell line and 60 breakpoints in the cervical cancer sample. Taken together, nanopore sequencing is a unique tool to identify HPV sequences and would shed light on the physical status of HPV genome in its associated cancers. |
format | Online Article Text |
id | pubmed-8764290 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-87642902022-01-19 Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing Yang, Shuaibing Zhao, Qianqian Tang, Lihua Chen, Zejia Wu, Zhaoting Li, Kaixin Lin, Ruoru Chen, Yang Ou, Danlin Zhou, Li Xu, Jianzhen Qin, Qingsong Front Genet Genetics Human papillomavirus (HPV) is a causal agent for most cervical cancers. The physical status of the HPV genome in these cancers could be episomal, integrated, or both. HPV integration could serve as a biomarker for clinical diagnosis, treatment, and prognosis. Although whole-genome sequencing by next-generation sequencing (NGS) technologies, such as the Illumina sequencing platform, have been used for detecting integrated HPV genome in cervical cancer, it faces challenges of analyzing long repeats and translocated sequences. In contrast, Oxford nanopore sequencing technology can generate ultra-long reads, which could be a very useful tool for determining HPV genome sequence and its physical status in cervical cancer. As a proof of concept, in this study, we completed whole genome sequencing from a cervical cancer tissue and a CaSki cell line with Oxford Nanopore Technologies. From the cervical cancer tissue, a 7,894 bp-long HPV35 genomic sequence was assembled from 678 reads at 97-fold coverage of HPV genome, sharing 99.96% identity with the HPV sequence obtained by Sanger sequencing. A 7904 bp-long HPV16 genomic sequence was assembled from data generated from the CaSki cell line at 3857-fold coverage, sharing 99.99% identity with the reference genome (NCBI: U89348). Intriguingly, long reads generated by nanopore sequencing directly revealed chimeric cellular–viral sequences and concatemeric genomic sequences, leading to the discovery of 448 unique integration breakpoints in the CaSki cell line and 60 breakpoints in the cervical cancer sample. Taken together, nanopore sequencing is a unique tool to identify HPV sequences and would shed light on the physical status of HPV genome in its associated cancers. Frontiers Media S.A. 2022-01-04 /pmc/articles/PMC8764290/ /pubmed/35058971 http://dx.doi.org/10.3389/fgene.2021.798608 Text en Copyright © 2022 Yang, Zhao, Tang, Chen, Wu, Li, Lin, Chen, Ou, Zhou, Xu and Qin. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Yang, Shuaibing Zhao, Qianqian Tang, Lihua Chen, Zejia Wu, Zhaoting Li, Kaixin Lin, Ruoru Chen, Yang Ou, Danlin Zhou, Li Xu, Jianzhen Qin, Qingsong Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing |
title | Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing |
title_full | Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing |
title_fullStr | Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing |
title_full_unstemmed | Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing |
title_short | Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing |
title_sort | whole genome assembly of human papillomavirus by nanopore long-read sequencing |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8764290/ https://www.ncbi.nlm.nih.gov/pubmed/35058971 http://dx.doi.org/10.3389/fgene.2021.798608 |
work_keys_str_mv | AT yangshuaibing wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing AT zhaoqianqian wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing AT tanglihua wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing AT chenzejia wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing AT wuzhaoting wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing AT likaixin wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing AT linruoru wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing AT chenyang wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing AT oudanlin wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing AT zhouli wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing AT xujianzhen wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing AT qinqingsong wholegenomeassemblyofhumanpapillomavirusbynanoporelongreadsequencing |