jackboyla/zsre_grow
收藏Hugging Face2024-06-03 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/jackboyla/zsre_grow
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如id、文本、分词后的文本、模型名称、指令、实体、生成内容、命名实体识别结果等。数据集被分为一个训练集,包含28,213个样本,总大小为877,097,759字节。数据集的下载大小为121,534,084字节,配置信息中指定了训练集的文件路径。
The dataset includes multiple features such as id (int64), text (string sequence), tokenized_text (sequence of string sequences), model_name (string), instruction (string), ents (list of head and tail string sequences), generation (string sequence), ner (sequence of string sequences), etc. The dataset is divided into a training set with 28213 examples. The download size is 121534084 bytes, and the total size is 877097759 bytes. The dataset configuration is default, with data file paths at data/train-*.
提供机构:
jackboyla
原始信息汇总
数据集概述
数据集特征
- id: 整数类型 (int64)
- text: 字符串序列
- tokenized_text: 字符串序列的序列
- model_name: 字符串类型 (string)
- instruction: 字符串类型 (string)
- ents: 列表类型,包含两个子列表:
- head: 字符串序列
- tail: 字符串序列
- generation: 字符串序列
- ner: 字符串序列的序列
- index_level_0: 整数类型 (int64)
数据集分割
- train:
- 数据量: 877,097,759 字节
- 示例数量: 28,213
数据集大小
- 下载大小: 121,534,084 字节
- 数据集总大小: 877,097,759 字节
配置
- config_name: default
- data_files:
- split: train
- path: data/train-*



