Dudep/types-xlnet-base-cased
收藏Hugging Face2024-07-05 更新2024-07-06 收录
下载链接:
https://hf-mirror.com/datasets/Dudep/types-xlnet-base-cased
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个主要特征:labels、input_ids和attention_mask。labels特征是一个序列,包含27个不同的类别标签,每个标签代表一个特定的组合(如0-0-0、0-0-E3等)。input_ids和attention_mask特征都是int64类型的序列。数据集被分为训练集、验证集和测试集,分别包含120335、13371和14857个样本,对应的字节大小分别为371113140.0、41236164.0和45818988.0。整个数据集的下载大小为38259040字节,总大小为458168292.0字节。
The dataset includes three main features: labels, input_ids, and attention_mask. The labels feature is a sequence containing 27 different class labels, each representing a specific combination (e.g., 0-0-0, 0-0-E3, etc.). The input_ids and attention_mask features are sequences of int64 type. The dataset is divided into training, validation, and test sets, containing 120335, 13371, and 14857 samples respectively, with corresponding byte sizes of 371113140.0, 41236164.0, and 45818988.0. The total download size of the dataset is 38259040 bytes, and the overall size is 458168292.0 bytes.
提供机构:
Dudep
原始信息汇总
数据集概述
特征
- labels:
- 类型: 序列
- 类别标签:
- 0: 0-0-0
- 1: 0-0-E3
- 2: 0-0-M3
- 3: 0-E2-0
- 4: 0-E2-E3
- 5: 0-E2-M3
- 6: 0-M2-0
- 7: 0-M2-E3
- 8: 0-M2-M3
- 9: E1-0-0
- 10: E1-0-E3
- 11: E1-0-M3
- 12: E1-E2-0
- 13: E1-E2-E3
- 14: E1-E2-M3
- 15: E1-M2-0
- 16: E1-M2-E3
- 17: E1-M2-M3
- 18: M1-0-0
- 19: M1-0-E3
- 20: M1-0-M3
- 21: M1-E2-0
- 22: M1-E2-E3
- 23: M1-E2-M3
- 24: M1-M2-0
- 25: M1-M2-E3
- 26: M1-M2-M3
- input_ids:
- 类型: 序列
- 数据类型: int64
- attention_mask:
- 类型: 序列
- 数据类型: int64
数据分割
- train:
- 样本数量: 120335
- 字节数: 371113140.0
- validation:
- 样本数量: 13371
- 字节数: 41236164.0
- test:
- 样本数量: 14857
- 字节数: 45818988.0
数据集大小
- 下载大小: 38259040 字节
- 数据集总大小: 458168292.0 字节
配置
- config_name: default
- 数据文件路径:
- train: data/train-*
- validation: data/validation-*
- test: data/test-*
- 数据文件路径:



