Cargando…
Sequence Imputation of HPV16 Genomes for Genetic Association Studies
BACKGROUND: Human Papillomavirus type 16 (HPV16) causes over half of all cervical cancer and some HPV16 variants are more oncogenic than others. The genetic basis for the extraordinary oncogenic properties of HPV16 compared to other HPVs is unknown. In addition, we neither know which nucleotides var...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3121793/ https://www.ncbi.nlm.nih.gov/pubmed/21731721 http://dx.doi.org/10.1371/journal.pone.0021375 |
_version_ | 1782206872462295040 |
---|---|
author | Smith, Benjamin Chen, Zigui Reimers, Laura van Doorslaer, Koenraad Schiffman, Mark DeSalle, Rob Herrero, Rolando Yu, Kai Wacholder, Sholom Wang, Tao Burk, Robert D. |
author_facet | Smith, Benjamin Chen, Zigui Reimers, Laura van Doorslaer, Koenraad Schiffman, Mark DeSalle, Rob Herrero, Rolando Yu, Kai Wacholder, Sholom Wang, Tao Burk, Robert D. |
author_sort | Smith, Benjamin |
collection | PubMed |
description | BACKGROUND: Human Papillomavirus type 16 (HPV16) causes over half of all cervical cancer and some HPV16 variants are more oncogenic than others. The genetic basis for the extraordinary oncogenic properties of HPV16 compared to other HPVs is unknown. In addition, we neither know which nucleotides vary across and within HPV types and lineages, nor which of the single nucleotide polymorphisms (SNPs) determine oncogenicity. METHODS: A reference set of 62 HPV16 complete genome sequences was established and used to examine patterns of evolutionary relatedness amongst variants using a pairwise identity heatmap and HPV16 phylogeny. A BLAST-based algorithm was developed to impute complete genome data from partial sequence information using the reference database. To interrogate the oncogenic risk of determined and imputed HPV16 SNPs, odds-ratios for each SNP were calculated in a case-control viral genome-wide association study (VWAS) using biopsy confirmed high-grade cervix neoplasia and self-limited HPV16 infections from Guanacaste, Costa Rica. RESULTS: HPV16 variants display evolutionarily stable lineages that contain conserved diagnostic SNPs. The imputation algorithm indicated that an average of 97.5±1.03% of SNPs could be accurately imputed. The VWAS revealed specific HPV16 viral SNPs associated with variant lineages and elevated odds ratios; however, individual causal SNPs could not be distinguished with certainty due to the nature of HPV evolution. CONCLUSIONS: Conserved and lineage-specific SNPs can be imputed with a high degree of accuracy from limited viral polymorphic data due to the lack of recombination and the stochastic mechanism of variation accumulation in the HPV genome. However, to determine the role of novel variants or non-lineage-specific SNPs by VWAS will require direct sequence analysis. The investigation of patterns of genetic variation and the identification of diagnostic SNPs for lineages of HPV16 variants provides a valuable resource for future studies of HPV16 pathogenicity. |
format | Online Article Text |
id | pubmed-3121793 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-31217932011-06-30 Sequence Imputation of HPV16 Genomes for Genetic Association Studies Smith, Benjamin Chen, Zigui Reimers, Laura van Doorslaer, Koenraad Schiffman, Mark DeSalle, Rob Herrero, Rolando Yu, Kai Wacholder, Sholom Wang, Tao Burk, Robert D. PLoS One Research Article BACKGROUND: Human Papillomavirus type 16 (HPV16) causes over half of all cervical cancer and some HPV16 variants are more oncogenic than others. The genetic basis for the extraordinary oncogenic properties of HPV16 compared to other HPVs is unknown. In addition, we neither know which nucleotides vary across and within HPV types and lineages, nor which of the single nucleotide polymorphisms (SNPs) determine oncogenicity. METHODS: A reference set of 62 HPV16 complete genome sequences was established and used to examine patterns of evolutionary relatedness amongst variants using a pairwise identity heatmap and HPV16 phylogeny. A BLAST-based algorithm was developed to impute complete genome data from partial sequence information using the reference database. To interrogate the oncogenic risk of determined and imputed HPV16 SNPs, odds-ratios for each SNP were calculated in a case-control viral genome-wide association study (VWAS) using biopsy confirmed high-grade cervix neoplasia and self-limited HPV16 infections from Guanacaste, Costa Rica. RESULTS: HPV16 variants display evolutionarily stable lineages that contain conserved diagnostic SNPs. The imputation algorithm indicated that an average of 97.5±1.03% of SNPs could be accurately imputed. The VWAS revealed specific HPV16 viral SNPs associated with variant lineages and elevated odds ratios; however, individual causal SNPs could not be distinguished with certainty due to the nature of HPV evolution. CONCLUSIONS: Conserved and lineage-specific SNPs can be imputed with a high degree of accuracy from limited viral polymorphic data due to the lack of recombination and the stochastic mechanism of variation accumulation in the HPV genome. However, to determine the role of novel variants or non-lineage-specific SNPs by VWAS will require direct sequence analysis. The investigation of patterns of genetic variation and the identification of diagnostic SNPs for lineages of HPV16 variants provides a valuable resource for future studies of HPV16 pathogenicity. Public Library of Science 2011-06-23 /pmc/articles/PMC3121793/ /pubmed/21731721 http://dx.doi.org/10.1371/journal.pone.0021375 Text en This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication. https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. |
spellingShingle | Research Article Smith, Benjamin Chen, Zigui Reimers, Laura van Doorslaer, Koenraad Schiffman, Mark DeSalle, Rob Herrero, Rolando Yu, Kai Wacholder, Sholom Wang, Tao Burk, Robert D. Sequence Imputation of HPV16 Genomes for Genetic Association Studies |
title | Sequence Imputation of HPV16 Genomes for Genetic Association Studies |
title_full | Sequence Imputation of HPV16 Genomes for Genetic Association Studies |
title_fullStr | Sequence Imputation of HPV16 Genomes for Genetic Association Studies |
title_full_unstemmed | Sequence Imputation of HPV16 Genomes for Genetic Association Studies |
title_short | Sequence Imputation of HPV16 Genomes for Genetic Association Studies |
title_sort | sequence imputation of hpv16 genomes for genetic association studies |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3121793/ https://www.ncbi.nlm.nih.gov/pubmed/21731721 http://dx.doi.org/10.1371/journal.pone.0021375 |
work_keys_str_mv | AT smithbenjamin sequenceimputationofhpv16genomesforgeneticassociationstudies AT chenzigui sequenceimputationofhpv16genomesforgeneticassociationstudies AT reimerslaura sequenceimputationofhpv16genomesforgeneticassociationstudies AT vandoorslaerkoenraad sequenceimputationofhpv16genomesforgeneticassociationstudies AT schiffmanmark sequenceimputationofhpv16genomesforgeneticassociationstudies AT desallerob sequenceimputationofhpv16genomesforgeneticassociationstudies AT herrerorolando sequenceimputationofhpv16genomesforgeneticassociationstudies AT yukai sequenceimputationofhpv16genomesforgeneticassociationstudies AT wacholdersholom sequenceimputationofhpv16genomesforgeneticassociationstudies AT wangtao sequenceimputationofhpv16genomesforgeneticassociationstudies AT burkrobertd sequenceimputationofhpv16genomesforgeneticassociationstudies |