摘要
Overview The dataset provides (soft) segmentation/delineation masks for the PTB-XL ECG dataset. The underlying original dataset (v1.0.1) is available from PhysioNet (doi.org/10.13026/kfzx-aw45) For further details on the orginal dataset, consult the dataset descriptor (Wagner, P., Strodthoff, N., Bousseljot, RD. et al. PTB-XL, a large publicly available electrocardiography dataset. Sci Data 7, 154 (2020). https://doi.org/10.1038/s41597-020-0495-6). Dataset The dataset was obtained by training a U-net segmentation model using ECGDeli (Pilia, N., Nagel, C., et al. ECGdeli - An open source ECG delineation toolbox for MATLAB. SoftwareX 13, 100639 (2021).
https://doi.org/10.1016/j.softx.2020.100639.). as initial labels increased by adding intermediate segments. Details on the model architecture will be described in a forthcoming article (Wagner, P., Mehari, T. et al.). Dataset format In the folder ptbxl_segmentations_8bit, we provide files in numpy format (8 bit unsigned integer) with filenames corresponding to the ECG ID of the corresponding sample in PTB-XL. Each file contains an array of shape 1000x12x24, where the first axis corresponds to time steps in the sequence (10s at 100 Hz), the second refers to the 12 ECG leads (['I', 'II', 'III', 'aVR', 'aVL', 'aVF', 'V1', 'V2', 'V3', 'V4', 'V5', 'V6']) and the third axis refers to ['P-Onset', '(p', 'P-Peak', 'p)', 'P-Offset', 'pq', 'QRS-Onset', '(q', 'Q-Peak', 'qr', 'R-Peak', 'rs', 'S-Peak', 'q)', 'QRS-Offset', 'qj', 'J-point', 'jt', 'T-Onset', '(t', 'T-Peak', 't)', 'T-Offset', 'tp']. For every time step and channel it provides a soft output distribution over the 24 output classes of the segmentation model.