Cargando…
PTB-XL, a large publicly available electrocardiography dataset
Electrocardiography (ECG) is a key non-invasive diagnostic tool for cardiovascular diseases which is increasingly supported by algorithms based on machine learning. Major obstacles for the development of automatic ECG interpretation algorithms are both the lack of public datasets and well-defined be...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7248071/ https://www.ncbi.nlm.nih.gov/pubmed/32451379 http://dx.doi.org/10.1038/s41597-020-0495-6 |
_version_ | 1783538289378066432 |
---|---|
author | Wagner, Patrick Strodthoff, Nils Bousseljot, Ralf-Dieter Kreiseler, Dieter Lunze, Fatima I. Samek, Wojciech Schaeffter, Tobias |
author_facet | Wagner, Patrick Strodthoff, Nils Bousseljot, Ralf-Dieter Kreiseler, Dieter Lunze, Fatima I. Samek, Wojciech Schaeffter, Tobias |
author_sort | Wagner, Patrick |
collection | PubMed |
description | Electrocardiography (ECG) is a key non-invasive diagnostic tool for cardiovascular diseases which is increasingly supported by algorithms based on machine learning. Major obstacles for the development of automatic ECG interpretation algorithms are both the lack of public datasets and well-defined benchmarking procedures to allow comparison s of different algorithms. To address these issues, we put forward PTB-XL, the to-date largest freely accessible clinical 12-lead ECG-waveform dataset comprising 21837 records from 18885 patients of 10 seconds length. The ECG-waveform data was annotated by up to two cardiologists as a multi-label dataset, where diagnostic labels were further aggregated into super and subclasses. The dataset covers a broad range of diagnostic classes including, in particular, a large fraction of healthy records. The combination with additional metadata on demographics, additional diagnostic statements, diagnosis likelihoods, manually annotated signal properties as well as suggested folds for splitting training and test sets turns the dataset into a rich resource for the development and the evaluation of automatic ECG interpretation algorithms. |
format | Online Article Text |
id | pubmed-7248071 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-72480712020-06-04 PTB-XL, a large publicly available electrocardiography dataset Wagner, Patrick Strodthoff, Nils Bousseljot, Ralf-Dieter Kreiseler, Dieter Lunze, Fatima I. Samek, Wojciech Schaeffter, Tobias Sci Data Data Descriptor Electrocardiography (ECG) is a key non-invasive diagnostic tool for cardiovascular diseases which is increasingly supported by algorithms based on machine learning. Major obstacles for the development of automatic ECG interpretation algorithms are both the lack of public datasets and well-defined benchmarking procedures to allow comparison s of different algorithms. To address these issues, we put forward PTB-XL, the to-date largest freely accessible clinical 12-lead ECG-waveform dataset comprising 21837 records from 18885 patients of 10 seconds length. The ECG-waveform data was annotated by up to two cardiologists as a multi-label dataset, where diagnostic labels were further aggregated into super and subclasses. The dataset covers a broad range of diagnostic classes including, in particular, a large fraction of healthy records. The combination with additional metadata on demographics, additional diagnostic statements, diagnosis likelihoods, manually annotated signal properties as well as suggested folds for splitting training and test sets turns the dataset into a rich resource for the development and the evaluation of automatic ECG interpretation algorithms. Nature Publishing Group UK 2020-05-25 /pmc/articles/PMC7248071/ /pubmed/32451379 http://dx.doi.org/10.1038/s41597-020-0495-6 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article. |
spellingShingle | Data Descriptor Wagner, Patrick Strodthoff, Nils Bousseljot, Ralf-Dieter Kreiseler, Dieter Lunze, Fatima I. Samek, Wojciech Schaeffter, Tobias PTB-XL, a large publicly available electrocardiography dataset |
title | PTB-XL, a large publicly available electrocardiography dataset |
title_full | PTB-XL, a large publicly available electrocardiography dataset |
title_fullStr | PTB-XL, a large publicly available electrocardiography dataset |
title_full_unstemmed | PTB-XL, a large publicly available electrocardiography dataset |
title_short | PTB-XL, a large publicly available electrocardiography dataset |
title_sort | ptb-xl, a large publicly available electrocardiography dataset |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7248071/ https://www.ncbi.nlm.nih.gov/pubmed/32451379 http://dx.doi.org/10.1038/s41597-020-0495-6 |
work_keys_str_mv | AT wagnerpatrick ptbxlalargepubliclyavailableelectrocardiographydataset AT strodthoffnils ptbxlalargepubliclyavailableelectrocardiographydataset AT bousseljotralfdieter ptbxlalargepubliclyavailableelectrocardiographydataset AT kreiselerdieter ptbxlalargepubliclyavailableelectrocardiographydataset AT lunzefatimai ptbxlalargepubliclyavailableelectrocardiographydataset AT samekwojciech ptbxlalargepubliclyavailableelectrocardiographydataset AT schaefftertobias ptbxlalargepubliclyavailableelectrocardiographydataset |