Cargando…

Sequence Imputation of HPV16 Genomes for Genetic Association Studies

BACKGROUND: Human Papillomavirus type 16 (HPV16) causes over half of all cervical cancer and some HPV16 variants are more oncogenic than others. The genetic basis for the extraordinary oncogenic properties of HPV16 compared to other HPVs is unknown. In addition, we neither know which nucleotides var...

Descripción completa

Detalles Bibliográficos
Autores principales: Smith, Benjamin, Chen, Zigui, Reimers, Laura, van Doorslaer, Koenraad, Schiffman, Mark, DeSalle, Rob, Herrero, Rolando, Yu, Kai, Wacholder, Sholom, Wang, Tao, Burk, Robert D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3121793/
https://www.ncbi.nlm.nih.gov/pubmed/21731721
http://dx.doi.org/10.1371/journal.pone.0021375
_version_ 1782206872462295040
author Smith, Benjamin
Chen, Zigui
Reimers, Laura
van Doorslaer, Koenraad
Schiffman, Mark
DeSalle, Rob
Herrero, Rolando
Yu, Kai
Wacholder, Sholom
Wang, Tao
Burk, Robert D.
author_facet Smith, Benjamin
Chen, Zigui
Reimers, Laura
van Doorslaer, Koenraad
Schiffman, Mark
DeSalle, Rob
Herrero, Rolando
Yu, Kai
Wacholder, Sholom
Wang, Tao
Burk, Robert D.
author_sort Smith, Benjamin
collection PubMed
description BACKGROUND: Human Papillomavirus type 16 (HPV16) causes over half of all cervical cancer and some HPV16 variants are more oncogenic than others. The genetic basis for the extraordinary oncogenic properties of HPV16 compared to other HPVs is unknown. In addition, we neither know which nucleotides vary across and within HPV types and lineages, nor which of the single nucleotide polymorphisms (SNPs) determine oncogenicity. METHODS: A reference set of 62 HPV16 complete genome sequences was established and used to examine patterns of evolutionary relatedness amongst variants using a pairwise identity heatmap and HPV16 phylogeny. A BLAST-based algorithm was developed to impute complete genome data from partial sequence information using the reference database. To interrogate the oncogenic risk of determined and imputed HPV16 SNPs, odds-ratios for each SNP were calculated in a case-control viral genome-wide association study (VWAS) using biopsy confirmed high-grade cervix neoplasia and self-limited HPV16 infections from Guanacaste, Costa Rica. RESULTS: HPV16 variants display evolutionarily stable lineages that contain conserved diagnostic SNPs. The imputation algorithm indicated that an average of 97.5±1.03% of SNPs could be accurately imputed. The VWAS revealed specific HPV16 viral SNPs associated with variant lineages and elevated odds ratios; however, individual causal SNPs could not be distinguished with certainty due to the nature of HPV evolution. CONCLUSIONS: Conserved and lineage-specific SNPs can be imputed with a high degree of accuracy from limited viral polymorphic data due to the lack of recombination and the stochastic mechanism of variation accumulation in the HPV genome. However, to determine the role of novel variants or non-lineage-specific SNPs by VWAS will require direct sequence analysis. The investigation of patterns of genetic variation and the identification of diagnostic SNPs for lineages of HPV16 variants provides a valuable resource for future studies of HPV16 pathogenicity.
format Online
Article
Text
id pubmed-3121793
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31217932011-06-30 Sequence Imputation of HPV16 Genomes for Genetic Association Studies Smith, Benjamin Chen, Zigui Reimers, Laura van Doorslaer, Koenraad Schiffman, Mark DeSalle, Rob Herrero, Rolando Yu, Kai Wacholder, Sholom Wang, Tao Burk, Robert D. PLoS One Research Article BACKGROUND: Human Papillomavirus type 16 (HPV16) causes over half of all cervical cancer and some HPV16 variants are more oncogenic than others. The genetic basis for the extraordinary oncogenic properties of HPV16 compared to other HPVs is unknown. In addition, we neither know which nucleotides vary across and within HPV types and lineages, nor which of the single nucleotide polymorphisms (SNPs) determine oncogenicity. METHODS: A reference set of 62 HPV16 complete genome sequences was established and used to examine patterns of evolutionary relatedness amongst variants using a pairwise identity heatmap and HPV16 phylogeny. A BLAST-based algorithm was developed to impute complete genome data from partial sequence information using the reference database. To interrogate the oncogenic risk of determined and imputed HPV16 SNPs, odds-ratios for each SNP were calculated in a case-control viral genome-wide association study (VWAS) using biopsy confirmed high-grade cervix neoplasia and self-limited HPV16 infections from Guanacaste, Costa Rica. RESULTS: HPV16 variants display evolutionarily stable lineages that contain conserved diagnostic SNPs. The imputation algorithm indicated that an average of 97.5±1.03% of SNPs could be accurately imputed. The VWAS revealed specific HPV16 viral SNPs associated with variant lineages and elevated odds ratios; however, individual causal SNPs could not be distinguished with certainty due to the nature of HPV evolution. CONCLUSIONS: Conserved and lineage-specific SNPs can be imputed with a high degree of accuracy from limited viral polymorphic data due to the lack of recombination and the stochastic mechanism of variation accumulation in the HPV genome. However, to determine the role of novel variants or non-lineage-specific SNPs by VWAS will require direct sequence analysis. The investigation of patterns of genetic variation and the identification of diagnostic SNPs for lineages of HPV16 variants provides a valuable resource for future studies of HPV16 pathogenicity. Public Library of Science 2011-06-23 /pmc/articles/PMC3121793/ /pubmed/21731721 http://dx.doi.org/10.1371/journal.pone.0021375 Text en This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication. https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Smith, Benjamin
Chen, Zigui
Reimers, Laura
van Doorslaer, Koenraad
Schiffman, Mark
DeSalle, Rob
Herrero, Rolando
Yu, Kai
Wacholder, Sholom
Wang, Tao
Burk, Robert D.
Sequence Imputation of HPV16 Genomes for Genetic Association Studies
title Sequence Imputation of HPV16 Genomes for Genetic Association Studies
title_full Sequence Imputation of HPV16 Genomes for Genetic Association Studies
title_fullStr Sequence Imputation of HPV16 Genomes for Genetic Association Studies
title_full_unstemmed Sequence Imputation of HPV16 Genomes for Genetic Association Studies
title_short Sequence Imputation of HPV16 Genomes for Genetic Association Studies
title_sort sequence imputation of hpv16 genomes for genetic association studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3121793/
https://www.ncbi.nlm.nih.gov/pubmed/21731721
http://dx.doi.org/10.1371/journal.pone.0021375
work_keys_str_mv AT smithbenjamin sequenceimputationofhpv16genomesforgeneticassociationstudies
AT chenzigui sequenceimputationofhpv16genomesforgeneticassociationstudies
AT reimerslaura sequenceimputationofhpv16genomesforgeneticassociationstudies
AT vandoorslaerkoenraad sequenceimputationofhpv16genomesforgeneticassociationstudies
AT schiffmanmark sequenceimputationofhpv16genomesforgeneticassociationstudies
AT desallerob sequenceimputationofhpv16genomesforgeneticassociationstudies
AT herrerorolando sequenceimputationofhpv16genomesforgeneticassociationstudies
AT yukai sequenceimputationofhpv16genomesforgeneticassociationstudies
AT wacholdersholom sequenceimputationofhpv16genomesforgeneticassociationstudies
AT wangtao sequenceimputationofhpv16genomesforgeneticassociationstudies
AT burkrobertd sequenceimputationofhpv16genomesforgeneticassociationstudies