johannes-garstenauer/structs_token_size_4_reduced_labelled_train
收藏Hugging Face2023-10-30 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/johannes-garstenauer/structs_token_size_4_reduced_labelled_train
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: struct
dtype: string
- name: label
dtype: int64
splits:
- name: train
num_bytes: 372362495.3356041
num_examples: 1518855
download_size: 138213330
dataset_size: 372362495.3356041
---
# Dataset Card for "structs_token_size_4_reduced_labelled_train"
Dataset created for thesis: "Generating Robust Representations of
Structures in OpenSSH Heap Dumps" by Johannes Garstenauer.
This dataset contains raw heap data structures along with their labels.
This is the training dataset. Validation set at: https://huggingface.co/datasets/johannes-garstenauer/structs_token_size_4_reduced_labelled_eval
Data structures and labels are extracted from: https://zenodo.org/records/6537904
Thesis and associated scripts: https://zenodo.org/records/10053730
提供机构:
johannes-garstenauer
原始信息汇总
数据集概述
数据集信息
-
特征:
struct:数据类型为stringlabel:数据类型为int64
-
数据分割:
train:包含 1518855 个样本,占用 372362495.3356041 字节
-
数据大小:
- 下载大小:138213330 字节
- 数据集大小:372362495.3356041 字节



