Cargando…

PTB-XL, a large publicly available electrocardiography dataset

Electrocardiography (ECG) is a key non-invasive diagnostic tool for cardiovascular diseases which is increasingly supported by algorithms based on machine learning. Major obstacles for the development of automatic ECG interpretation algorithms are both the lack of public datasets and well-defined be...

Descripción completa

Detalles Bibliográficos
Autores principales: Wagner, Patrick, Strodthoff, Nils, Bousseljot, Ralf-Dieter, Kreiseler, Dieter, Lunze, Fatima I., Samek, Wojciech, Schaeffter, Tobias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7248071/
https://www.ncbi.nlm.nih.gov/pubmed/32451379
http://dx.doi.org/10.1038/s41597-020-0495-6
_version_ 1783538289378066432
author Wagner, Patrick
Strodthoff, Nils
Bousseljot, Ralf-Dieter
Kreiseler, Dieter
Lunze, Fatima I.
Samek, Wojciech
Schaeffter, Tobias
author_facet Wagner, Patrick
Strodthoff, Nils
Bousseljot, Ralf-Dieter
Kreiseler, Dieter
Lunze, Fatima I.
Samek, Wojciech
Schaeffter, Tobias
author_sort Wagner, Patrick
collection PubMed
description Electrocardiography (ECG) is a key non-invasive diagnostic tool for cardiovascular diseases which is increasingly supported by algorithms based on machine learning. Major obstacles for the development of automatic ECG interpretation algorithms are both the lack of public datasets and well-defined benchmarking procedures to allow comparison s of different algorithms. To address these issues, we put forward PTB-XL, the to-date largest freely accessible clinical 12-lead ECG-waveform dataset comprising 21837 records from 18885 patients of 10 seconds length. The ECG-waveform data was annotated by up to two cardiologists as a multi-label dataset, where diagnostic labels were further aggregated into super and subclasses. The dataset covers a broad range of diagnostic classes including, in particular, a large fraction of healthy records. The combination with additional metadata on demographics, additional diagnostic statements, diagnosis likelihoods, manually annotated signal properties as well as suggested folds for splitting training and test sets turns the dataset into a rich resource for the development and the evaluation of automatic ECG interpretation algorithms.
format Online
Article
Text
id pubmed-7248071
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-72480712020-06-04 PTB-XL, a large publicly available electrocardiography dataset Wagner, Patrick Strodthoff, Nils Bousseljot, Ralf-Dieter Kreiseler, Dieter Lunze, Fatima I. Samek, Wojciech Schaeffter, Tobias Sci Data Data Descriptor Electrocardiography (ECG) is a key non-invasive diagnostic tool for cardiovascular diseases which is increasingly supported by algorithms based on machine learning. Major obstacles for the development of automatic ECG interpretation algorithms are both the lack of public datasets and well-defined benchmarking procedures to allow comparison s of different algorithms. To address these issues, we put forward PTB-XL, the to-date largest freely accessible clinical 12-lead ECG-waveform dataset comprising 21837 records from 18885 patients of 10 seconds length. The ECG-waveform data was annotated by up to two cardiologists as a multi-label dataset, where diagnostic labels were further aggregated into super and subclasses. The dataset covers a broad range of diagnostic classes including, in particular, a large fraction of healthy records. The combination with additional metadata on demographics, additional diagnostic statements, diagnosis likelihoods, manually annotated signal properties as well as suggested folds for splitting training and test sets turns the dataset into a rich resource for the development and the evaluation of automatic ECG interpretation algorithms. Nature Publishing Group UK 2020-05-25 /pmc/articles/PMC7248071/ /pubmed/32451379 http://dx.doi.org/10.1038/s41597-020-0495-6 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
spellingShingle Data Descriptor
Wagner, Patrick
Strodthoff, Nils
Bousseljot, Ralf-Dieter
Kreiseler, Dieter
Lunze, Fatima I.
Samek, Wojciech
Schaeffter, Tobias
PTB-XL, a large publicly available electrocardiography dataset
title PTB-XL, a large publicly available electrocardiography dataset
title_full PTB-XL, a large publicly available electrocardiography dataset
title_fullStr PTB-XL, a large publicly available electrocardiography dataset
title_full_unstemmed PTB-XL, a large publicly available electrocardiography dataset
title_short PTB-XL, a large publicly available electrocardiography dataset
title_sort ptb-xl, a large publicly available electrocardiography dataset
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7248071/
https://www.ncbi.nlm.nih.gov/pubmed/32451379
http://dx.doi.org/10.1038/s41597-020-0495-6
work_keys_str_mv AT wagnerpatrick ptbxlalargepubliclyavailableelectrocardiographydataset
AT strodthoffnils ptbxlalargepubliclyavailableelectrocardiographydataset
AT bousseljotralfdieter ptbxlalargepubliclyavailableelectrocardiographydataset
AT kreiselerdieter ptbxlalargepubliclyavailableelectrocardiographydataset
AT lunzefatimai ptbxlalargepubliclyavailableelectrocardiographydataset
AT samekwojciech ptbxlalargepubliclyavailableelectrocardiographydataset
AT schaefftertobias ptbxlalargepubliclyavailableelectrocardiographydataset