Cargando…

A benchmark dataset of protein antigens for antigenicity measurement

Antigenicity measurement plays a fundamental role in vaccine design, which requires antigen selection from a large number of mutants. To augment traditional cross-reactivity experiments, computational approaches for predicting the antigenic distance between multiple protein antigens are highly valua...

Descripción completa

Detalles Bibliográficos
Autores principales: Qiu, Tianyi, Qiu, Jingxuan, Yang, Yiyan, Zhang, Lu, Mao, Tiantian, Zhang, Xiaoyan, Xu, Jianqing, Cao, Zhiwei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7338539/
https://www.ncbi.nlm.nih.gov/pubmed/32632108
http://dx.doi.org/10.1038/s41597-020-0555-y
_version_ 1783554701104513024
author Qiu, Tianyi
Qiu, Jingxuan
Yang, Yiyan
Zhang, Lu
Mao, Tiantian
Zhang, Xiaoyan
Xu, Jianqing
Cao, Zhiwei
author_facet Qiu, Tianyi
Qiu, Jingxuan
Yang, Yiyan
Zhang, Lu
Mao, Tiantian
Zhang, Xiaoyan
Xu, Jianqing
Cao, Zhiwei
author_sort Qiu, Tianyi
collection PubMed
description Antigenicity measurement plays a fundamental role in vaccine design, which requires antigen selection from a large number of mutants. To augment traditional cross-reactivity experiments, computational approaches for predicting the antigenic distance between multiple protein antigens are highly valuable. The performance of in silico models relies heavily on large-scale benchmark datasets, which are scattered among public databases and published articles or reports. Here, we present the first benchmark dataset of protein antigens with experimental evidence to guide in silico antigenicity calculations. This dataset includes (1) standard haemagglutination-inhibition (HI) tests for 3,867 influenza A/H3N2 strain pairs, (2) standard HI tests for 559 influenza virus B strain pairs, and (3) neutralization titres derived from 1,073 Dengue virus strain pairs. All of these datasets were collated and annotated with experimentally validated antigenicity relationships as well as sequence information for the corresponding protein antigens. We anticipate that this work will provide a benchmark dataset for in silico antigenicity prediction that could be further used to assist in epidemic surveillance and therapeutic vaccine design for viruses with variable antigenicity.
format Online
Article
Text
id pubmed-7338539
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-73385392020-07-09 A benchmark dataset of protein antigens for antigenicity measurement Qiu, Tianyi Qiu, Jingxuan Yang, Yiyan Zhang, Lu Mao, Tiantian Zhang, Xiaoyan Xu, Jianqing Cao, Zhiwei Sci Data Data Descriptor Antigenicity measurement plays a fundamental role in vaccine design, which requires antigen selection from a large number of mutants. To augment traditional cross-reactivity experiments, computational approaches for predicting the antigenic distance between multiple protein antigens are highly valuable. The performance of in silico models relies heavily on large-scale benchmark datasets, which are scattered among public databases and published articles or reports. Here, we present the first benchmark dataset of protein antigens with experimental evidence to guide in silico antigenicity calculations. This dataset includes (1) standard haemagglutination-inhibition (HI) tests for 3,867 influenza A/H3N2 strain pairs, (2) standard HI tests for 559 influenza virus B strain pairs, and (3) neutralization titres derived from 1,073 Dengue virus strain pairs. All of these datasets were collated and annotated with experimentally validated antigenicity relationships as well as sequence information for the corresponding protein antigens. We anticipate that this work will provide a benchmark dataset for in silico antigenicity prediction that could be further used to assist in epidemic surveillance and therapeutic vaccine design for viruses with variable antigenicity. Nature Publishing Group UK 2020-07-06 /pmc/articles/PMC7338539/ /pubmed/32632108 http://dx.doi.org/10.1038/s41597-020-0555-y Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
spellingShingle Data Descriptor
Qiu, Tianyi
Qiu, Jingxuan
Yang, Yiyan
Zhang, Lu
Mao, Tiantian
Zhang, Xiaoyan
Xu, Jianqing
Cao, Zhiwei
A benchmark dataset of protein antigens for antigenicity measurement
title A benchmark dataset of protein antigens for antigenicity measurement
title_full A benchmark dataset of protein antigens for antigenicity measurement
title_fullStr A benchmark dataset of protein antigens for antigenicity measurement
title_full_unstemmed A benchmark dataset of protein antigens for antigenicity measurement
title_short A benchmark dataset of protein antigens for antigenicity measurement
title_sort benchmark dataset of protein antigens for antigenicity measurement
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7338539/
https://www.ncbi.nlm.nih.gov/pubmed/32632108
http://dx.doi.org/10.1038/s41597-020-0555-y
work_keys_str_mv AT qiutianyi abenchmarkdatasetofproteinantigensforantigenicitymeasurement
AT qiujingxuan abenchmarkdatasetofproteinantigensforantigenicitymeasurement
AT yangyiyan abenchmarkdatasetofproteinantigensforantigenicitymeasurement
AT zhanglu abenchmarkdatasetofproteinantigensforantigenicitymeasurement
AT maotiantian abenchmarkdatasetofproteinantigensforantigenicitymeasurement
AT zhangxiaoyan abenchmarkdatasetofproteinantigensforantigenicitymeasurement
AT xujianqing abenchmarkdatasetofproteinantigensforantigenicitymeasurement
AT caozhiwei abenchmarkdatasetofproteinantigensforantigenicitymeasurement
AT qiutianyi benchmarkdatasetofproteinantigensforantigenicitymeasurement
AT qiujingxuan benchmarkdatasetofproteinantigensforantigenicitymeasurement
AT yangyiyan benchmarkdatasetofproteinantigensforantigenicitymeasurement
AT zhanglu benchmarkdatasetofproteinantigensforantigenicitymeasurement
AT maotiantian benchmarkdatasetofproteinantigensforantigenicitymeasurement
AT zhangxiaoyan benchmarkdatasetofproteinantigensforantigenicitymeasurement
AT xujianqing benchmarkdatasetofproteinantigensforantigenicitymeasurement
AT caozhiwei benchmarkdatasetofproteinantigensforantigenicitymeasurement