Cargando…
PTB-XL+, a comprehensive electrocardiographic feature dataset
Machine learning (ML) methods for the analysis of electrocardiography (ECG) data are gaining importance, substantially supported by the release of large public datasets. However, these current datasets miss important derived descriptors such as ECG features that have been devised in the past hundred...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10183020/ https://www.ncbi.nlm.nih.gov/pubmed/37179420 http://dx.doi.org/10.1038/s41597-023-02153-8 |
_version_ | 1785041869532561408 |
---|---|
author | Strodthoff, Nils Mehari, Temesgen Nagel, Claudia Aston, Philip J. Sundar, Ashish Graff, Claus Kanters, Jørgen K. Haverkamp, Wilhelm Dössel, Olaf Loewe, Axel Bär, Markus Schaeffter, Tobias |
author_facet | Strodthoff, Nils Mehari, Temesgen Nagel, Claudia Aston, Philip J. Sundar, Ashish Graff, Claus Kanters, Jørgen K. Haverkamp, Wilhelm Dössel, Olaf Loewe, Axel Bär, Markus Schaeffter, Tobias |
author_sort | Strodthoff, Nils |
collection | PubMed |
description | Machine learning (ML) methods for the analysis of electrocardiography (ECG) data are gaining importance, substantially supported by the release of large public datasets. However, these current datasets miss important derived descriptors such as ECG features that have been devised in the past hundred years and still form the basis of most automatic ECG analysis algorithms and are critical for cardiologists’ decision processes. ECG features are available from sophisticated commercial software but are not accessible to the general public. To alleviate this issue, we add ECG features from two leading commercial algorithms and an open-source implementation supplemented by a set of automatic diagnostic statements from a commercial ECG analysis software in preprocessed format. This allows the comparison of ML models trained on clinically versus automatically generated label sets. We provide an extensive technical validation of features and diagnostic statements for ML applications. We believe this release crucially enhances the usability of the PTB-XL dataset as a reference dataset for ML methods in the context of ECG data. |
format | Online Article Text |
id | pubmed-10183020 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-101830202023-05-15 PTB-XL+, a comprehensive electrocardiographic feature dataset Strodthoff, Nils Mehari, Temesgen Nagel, Claudia Aston, Philip J. Sundar, Ashish Graff, Claus Kanters, Jørgen K. Haverkamp, Wilhelm Dössel, Olaf Loewe, Axel Bär, Markus Schaeffter, Tobias Sci Data Data Descriptor Machine learning (ML) methods for the analysis of electrocardiography (ECG) data are gaining importance, substantially supported by the release of large public datasets. However, these current datasets miss important derived descriptors such as ECG features that have been devised in the past hundred years and still form the basis of most automatic ECG analysis algorithms and are critical for cardiologists’ decision processes. ECG features are available from sophisticated commercial software but are not accessible to the general public. To alleviate this issue, we add ECG features from two leading commercial algorithms and an open-source implementation supplemented by a set of automatic diagnostic statements from a commercial ECG analysis software in preprocessed format. This allows the comparison of ML models trained on clinically versus automatically generated label sets. We provide an extensive technical validation of features and diagnostic statements for ML applications. We believe this release crucially enhances the usability of the PTB-XL dataset as a reference dataset for ML methods in the context of ECG data. Nature Publishing Group UK 2023-05-13 /pmc/articles/PMC10183020/ /pubmed/37179420 http://dx.doi.org/10.1038/s41597-023-02153-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Data Descriptor Strodthoff, Nils Mehari, Temesgen Nagel, Claudia Aston, Philip J. Sundar, Ashish Graff, Claus Kanters, Jørgen K. Haverkamp, Wilhelm Dössel, Olaf Loewe, Axel Bär, Markus Schaeffter, Tobias PTB-XL+, a comprehensive electrocardiographic feature dataset |
title | PTB-XL+, a comprehensive electrocardiographic feature dataset |
title_full | PTB-XL+, a comprehensive electrocardiographic feature dataset |
title_fullStr | PTB-XL+, a comprehensive electrocardiographic feature dataset |
title_full_unstemmed | PTB-XL+, a comprehensive electrocardiographic feature dataset |
title_short | PTB-XL+, a comprehensive electrocardiographic feature dataset |
title_sort | ptb-xl+, a comprehensive electrocardiographic feature dataset |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10183020/ https://www.ncbi.nlm.nih.gov/pubmed/37179420 http://dx.doi.org/10.1038/s41597-023-02153-8 |
work_keys_str_mv | AT strodthoffnils ptbxlacomprehensiveelectrocardiographicfeaturedataset AT meharitemesgen ptbxlacomprehensiveelectrocardiographicfeaturedataset AT nagelclaudia ptbxlacomprehensiveelectrocardiographicfeaturedataset AT astonphilipj ptbxlacomprehensiveelectrocardiographicfeaturedataset AT sundarashish ptbxlacomprehensiveelectrocardiographicfeaturedataset AT graffclaus ptbxlacomprehensiveelectrocardiographicfeaturedataset AT kantersjørgenk ptbxlacomprehensiveelectrocardiographicfeaturedataset AT haverkampwilhelm ptbxlacomprehensiveelectrocardiographicfeaturedataset AT dosselolaf ptbxlacomprehensiveelectrocardiographicfeaturedataset AT loeweaxel ptbxlacomprehensiveelectrocardiographicfeaturedataset AT barmarkus ptbxlacomprehensiveelectrocardiographicfeaturedataset AT schaefftertobias ptbxlacomprehensiveelectrocardiographicfeaturedataset |