jackboyla/zsre_grow

Name: jackboyla/zsre_grow
Creator: jackboyla
Published: 2024-06-03 10:57:57
License: 暂无描述

Hugging Face2024-06-03 更新2024-06-12 收录

下载链接：

https://hf-mirror.com/datasets/jackboyla/zsre_grow

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含多个特征，如id、文本、分词后的文本、模型名称、指令、实体、生成内容、命名实体识别结果等。数据集被分为一个训练集，包含28,213个样本，总大小为877,097,759字节。数据集的下载大小为121,534,084字节，配置信息中指定了训练集的文件路径。

The dataset includes multiple features such as id (int64), text (string sequence), tokenized_text (sequence of string sequences), model_name (string), instruction (string), ents (list of head and tail string sequences), generation (string sequence), ner (sequence of string sequences), etc. The dataset is divided into a training set with 28213 examples. The download size is 121534084 bytes, and the total size is 877097759 bytes. The dataset configuration is default, with data file paths at data/train-*.

提供机构：

jackboyla

原始信息汇总

数据集概述

数据集特征

id: 整数类型 (int64)
text: 字符串序列
tokenized_text: 字符串序列的序列
model_name: 字符串类型 (string)
instruction: 字符串类型 (string)
ents: 列表类型，包含两个子列表：
- head: 字符串序列
- tail: 字符串序列
generation: 字符串序列
ner: 字符串序列的序列
index_level_0: 整数类型 (int64)

数据集分割

train:
- 数据量: 877,097,759 字节
- 示例数量: 28,213

数据集大小

下载大小: 121,534,084 字节
数据集总大小: 877,097,759 字节

配置

config_name: default
data_files:
- split: train
- path: data/train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集