Cargando…

HPVdb: a data mining system for knowledge discovery in human papillomavirus with applications in T cell immunology and vaccinology

High-risk human papillomaviruses (HPVs) are the causes of many cancers, including cervical, anal, vulvar, vaginal, penile and oropharyngeal. To facilitate diagnosis, prognosis and characterization of these cancers, it is necessary to make full use of the immunological data on HPV available through p...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Guang Lan, Riemer, Angelika B., Keskin, Derin B., Chitkushev, Lou, Reinherz, Ellis L., Brusic, Vladimir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3975992/
https://www.ncbi.nlm.nih.gov/pubmed/24705205
http://dx.doi.org/10.1093/database/bau031
_version_ 1782310222661943296
author Zhang, Guang Lan
Riemer, Angelika B.
Keskin, Derin B.
Chitkushev, Lou
Reinherz, Ellis L.
Brusic, Vladimir
author_facet Zhang, Guang Lan
Riemer, Angelika B.
Keskin, Derin B.
Chitkushev, Lou
Reinherz, Ellis L.
Brusic, Vladimir
author_sort Zhang, Guang Lan
collection PubMed
description High-risk human papillomaviruses (HPVs) are the causes of many cancers, including cervical, anal, vulvar, vaginal, penile and oropharyngeal. To facilitate diagnosis, prognosis and characterization of these cancers, it is necessary to make full use of the immunological data on HPV available through publications, technical reports and databases. These data vary in granularity, quality and complexity. The extraction of knowledge from the vast amount of immunological data using data mining techniques remains a challenging task. To support integration of data and knowledge in virology and vaccinology, we developed a framework called KB-builder to streamline the development and deployment of web-accessible immunological knowledge systems. The framework consists of seven major functional modules, each facilitating a specific aspect of the knowledgebase construction process. Using KB-builder, we constructed the Human Papillomavirus T cell Antigen Database (HPVdb). It contains 2781 curated antigen entries of antigenic proteins derived from 18 genotypes of high-risk HPV and 18 genotypes of low-risk HPV. The HPVdb also catalogs 191 verified T cell epitopes and 45 verified human leukocyte antigen (HLA) ligands. Primary amino acid sequences of HPV antigens were collected and annotated from the UniProtKB. T cell epitopes and HLA ligands were collected from data mining of scientific literature and databases. The data were subject to extensive quality control (redundancy elimination, error detection and vocabulary consolidation). A set of computational tools for an in-depth analysis, such as sequence comparison using BLAST search, multiple alignments of antigens, classification of HPV types based on cancer risk, T cell epitope/HLA ligand visualization, T cell epitope/HLA ligand conservation analysis and sequence variability analysis, has been integrated within the HPVdb. Predicted Class I and Class II HLA binding peptides for 15 common HLA alleles are included in this database as putative targets. HPVdb is a knowledge-based system that integrates curated data and information with tailored analysis tools to facilitate data mining for HPV vaccinology and immunology. To our best knowledge, HPVdb is a unique data source providing a comprehensive list of HPV antigens and peptides. Database URL: http://cvc.dfci.harvard.edu/hpv/
format Online
Article
Text
id pubmed-3975992
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-39759922014-04-07 HPVdb: a data mining system for knowledge discovery in human papillomavirus with applications in T cell immunology and vaccinology Zhang, Guang Lan Riemer, Angelika B. Keskin, Derin B. Chitkushev, Lou Reinherz, Ellis L. Brusic, Vladimir Database (Oxford) Database Tool High-risk human papillomaviruses (HPVs) are the causes of many cancers, including cervical, anal, vulvar, vaginal, penile and oropharyngeal. To facilitate diagnosis, prognosis and characterization of these cancers, it is necessary to make full use of the immunological data on HPV available through publications, technical reports and databases. These data vary in granularity, quality and complexity. The extraction of knowledge from the vast amount of immunological data using data mining techniques remains a challenging task. To support integration of data and knowledge in virology and vaccinology, we developed a framework called KB-builder to streamline the development and deployment of web-accessible immunological knowledge systems. The framework consists of seven major functional modules, each facilitating a specific aspect of the knowledgebase construction process. Using KB-builder, we constructed the Human Papillomavirus T cell Antigen Database (HPVdb). It contains 2781 curated antigen entries of antigenic proteins derived from 18 genotypes of high-risk HPV and 18 genotypes of low-risk HPV. The HPVdb also catalogs 191 verified T cell epitopes and 45 verified human leukocyte antigen (HLA) ligands. Primary amino acid sequences of HPV antigens were collected and annotated from the UniProtKB. T cell epitopes and HLA ligands were collected from data mining of scientific literature and databases. The data were subject to extensive quality control (redundancy elimination, error detection and vocabulary consolidation). A set of computational tools for an in-depth analysis, such as sequence comparison using BLAST search, multiple alignments of antigens, classification of HPV types based on cancer risk, T cell epitope/HLA ligand visualization, T cell epitope/HLA ligand conservation analysis and sequence variability analysis, has been integrated within the HPVdb. Predicted Class I and Class II HLA binding peptides for 15 common HLA alleles are included in this database as putative targets. HPVdb is a knowledge-based system that integrates curated data and information with tailored analysis tools to facilitate data mining for HPV vaccinology and immunology. To our best knowledge, HPVdb is a unique data source providing a comprehensive list of HPV antigens and peptides. Database URL: http://cvc.dfci.harvard.edu/hpv/ Oxford University Press 2014-04-04 /pmc/articles/PMC3975992/ /pubmed/24705205 http://dx.doi.org/10.1093/database/bau031 Text en © The Author(s) 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Tool
Zhang, Guang Lan
Riemer, Angelika B.
Keskin, Derin B.
Chitkushev, Lou
Reinherz, Ellis L.
Brusic, Vladimir
HPVdb: a data mining system for knowledge discovery in human papillomavirus with applications in T cell immunology and vaccinology
title HPVdb: a data mining system for knowledge discovery in human papillomavirus with applications in T cell immunology and vaccinology
title_full HPVdb: a data mining system for knowledge discovery in human papillomavirus with applications in T cell immunology and vaccinology
title_fullStr HPVdb: a data mining system for knowledge discovery in human papillomavirus with applications in T cell immunology and vaccinology
title_full_unstemmed HPVdb: a data mining system for knowledge discovery in human papillomavirus with applications in T cell immunology and vaccinology
title_short HPVdb: a data mining system for knowledge discovery in human papillomavirus with applications in T cell immunology and vaccinology
title_sort hpvdb: a data mining system for knowledge discovery in human papillomavirus with applications in t cell immunology and vaccinology
topic Database Tool
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3975992/
https://www.ncbi.nlm.nih.gov/pubmed/24705205
http://dx.doi.org/10.1093/database/bau031
work_keys_str_mv AT zhangguanglan hpvdbadataminingsystemforknowledgediscoveryinhumanpapillomaviruswithapplicationsintcellimmunologyandvaccinology
AT riemerangelikab hpvdbadataminingsystemforknowledgediscoveryinhumanpapillomaviruswithapplicationsintcellimmunologyandvaccinology
AT keskinderinb hpvdbadataminingsystemforknowledgediscoveryinhumanpapillomaviruswithapplicationsintcellimmunologyandvaccinology
AT chitkushevlou hpvdbadataminingsystemforknowledgediscoveryinhumanpapillomaviruswithapplicationsintcellimmunologyandvaccinology
AT reinherzellisl hpvdbadataminingsystemforknowledgediscoveryinhumanpapillomaviruswithapplicationsintcellimmunologyandvaccinology
AT brusicvladimir hpvdbadataminingsystemforknowledgediscoveryinhumanpapillomaviruswithapplicationsintcellimmunologyandvaccinology