Cargando…
A benchmark dataset of protein antigens for antigenicity measurement
Antigenicity measurement plays a fundamental role in vaccine design, which requires antigen selection from a large number of mutants. To augment traditional cross-reactivity experiments, computational approaches for predicting the antigenic distance between multiple protein antigens are highly valua...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7338539/ https://www.ncbi.nlm.nih.gov/pubmed/32632108 http://dx.doi.org/10.1038/s41597-020-0555-y |
_version_ | 1783554701104513024 |
---|---|
author | Qiu, Tianyi Qiu, Jingxuan Yang, Yiyan Zhang, Lu Mao, Tiantian Zhang, Xiaoyan Xu, Jianqing Cao, Zhiwei |
author_facet | Qiu, Tianyi Qiu, Jingxuan Yang, Yiyan Zhang, Lu Mao, Tiantian Zhang, Xiaoyan Xu, Jianqing Cao, Zhiwei |
author_sort | Qiu, Tianyi |
collection | PubMed |
description | Antigenicity measurement plays a fundamental role in vaccine design, which requires antigen selection from a large number of mutants. To augment traditional cross-reactivity experiments, computational approaches for predicting the antigenic distance between multiple protein antigens are highly valuable. The performance of in silico models relies heavily on large-scale benchmark datasets, which are scattered among public databases and published articles or reports. Here, we present the first benchmark dataset of protein antigens with experimental evidence to guide in silico antigenicity calculations. This dataset includes (1) standard haemagglutination-inhibition (HI) tests for 3,867 influenza A/H3N2 strain pairs, (2) standard HI tests for 559 influenza virus B strain pairs, and (3) neutralization titres derived from 1,073 Dengue virus strain pairs. All of these datasets were collated and annotated with experimentally validated antigenicity relationships as well as sequence information for the corresponding protein antigens. We anticipate that this work will provide a benchmark dataset for in silico antigenicity prediction that could be further used to assist in epidemic surveillance and therapeutic vaccine design for viruses with variable antigenicity. |
format | Online Article Text |
id | pubmed-7338539 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-73385392020-07-09 A benchmark dataset of protein antigens for antigenicity measurement Qiu, Tianyi Qiu, Jingxuan Yang, Yiyan Zhang, Lu Mao, Tiantian Zhang, Xiaoyan Xu, Jianqing Cao, Zhiwei Sci Data Data Descriptor Antigenicity measurement plays a fundamental role in vaccine design, which requires antigen selection from a large number of mutants. To augment traditional cross-reactivity experiments, computational approaches for predicting the antigenic distance between multiple protein antigens are highly valuable. The performance of in silico models relies heavily on large-scale benchmark datasets, which are scattered among public databases and published articles or reports. Here, we present the first benchmark dataset of protein antigens with experimental evidence to guide in silico antigenicity calculations. This dataset includes (1) standard haemagglutination-inhibition (HI) tests for 3,867 influenza A/H3N2 strain pairs, (2) standard HI tests for 559 influenza virus B strain pairs, and (3) neutralization titres derived from 1,073 Dengue virus strain pairs. All of these datasets were collated and annotated with experimentally validated antigenicity relationships as well as sequence information for the corresponding protein antigens. We anticipate that this work will provide a benchmark dataset for in silico antigenicity prediction that could be further used to assist in epidemic surveillance and therapeutic vaccine design for viruses with variable antigenicity. Nature Publishing Group UK 2020-07-06 /pmc/articles/PMC7338539/ /pubmed/32632108 http://dx.doi.org/10.1038/s41597-020-0555-y Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article. |
spellingShingle | Data Descriptor Qiu, Tianyi Qiu, Jingxuan Yang, Yiyan Zhang, Lu Mao, Tiantian Zhang, Xiaoyan Xu, Jianqing Cao, Zhiwei A benchmark dataset of protein antigens for antigenicity measurement |
title | A benchmark dataset of protein antigens for antigenicity measurement |
title_full | A benchmark dataset of protein antigens for antigenicity measurement |
title_fullStr | A benchmark dataset of protein antigens for antigenicity measurement |
title_full_unstemmed | A benchmark dataset of protein antigens for antigenicity measurement |
title_short | A benchmark dataset of protein antigens for antigenicity measurement |
title_sort | benchmark dataset of protein antigens for antigenicity measurement |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7338539/ https://www.ncbi.nlm.nih.gov/pubmed/32632108 http://dx.doi.org/10.1038/s41597-020-0555-y |
work_keys_str_mv | AT qiutianyi abenchmarkdatasetofproteinantigensforantigenicitymeasurement AT qiujingxuan abenchmarkdatasetofproteinantigensforantigenicitymeasurement AT yangyiyan abenchmarkdatasetofproteinantigensforantigenicitymeasurement AT zhanglu abenchmarkdatasetofproteinantigensforantigenicitymeasurement AT maotiantian abenchmarkdatasetofproteinantigensforantigenicitymeasurement AT zhangxiaoyan abenchmarkdatasetofproteinantigensforantigenicitymeasurement AT xujianqing abenchmarkdatasetofproteinantigensforantigenicitymeasurement AT caozhiwei abenchmarkdatasetofproteinantigensforantigenicitymeasurement AT qiutianyi benchmarkdatasetofproteinantigensforantigenicitymeasurement AT qiujingxuan benchmarkdatasetofproteinantigensforantigenicitymeasurement AT yangyiyan benchmarkdatasetofproteinantigensforantigenicitymeasurement AT zhanglu benchmarkdatasetofproteinantigensforantigenicitymeasurement AT maotiantian benchmarkdatasetofproteinantigensforantigenicitymeasurement AT zhangxiaoyan benchmarkdatasetofproteinantigensforantigenicitymeasurement AT xujianqing benchmarkdatasetofproteinantigensforantigenicitymeasurement AT caozhiwei benchmarkdatasetofproteinantigensforantigenicitymeasurement |