karuna-bhaila/Unlearning_SQuAD
收藏Hugging Face2024-07-17 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/karuna-bhaila/Unlearning_SQuAD
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,包括id、title、context、question和answers。answers字段是一个序列,包含text和answer_start两个子字段。数据集分为train_forget、test_forget、train_retain和test_retain四个部分,每个部分都有对应的文件大小和示例数量。数据集的下载大小为55945597字节,总大小为89819092.0字节。
The dataset contains multiple fields, including id, title, context, question, and answers. The answers field is a sequence containing two subfields: text and answer_start. The dataset is divided into four parts: train_forget, test_forget, train_retain, and test_retain, each with corresponding file sizes and example counts. The download size of the dataset is 55945597 bytes, and the total size is 89819092.0 bytes.
提供机构:
karuna-bhaila
原始信息汇总
数据集概述
数据集特征
- id: 字符串类型
- title: 字符串类型
- context: 字符串类型
- question: 字符串类型
- answers: 序列类型
- text: 字符串类型
- answer_start: 整数类型 (int32)
数据集分割
- train_forget:
- 字节数: 4267296.652588903
- 样本数: 4664
- test_forget:
- 字节数: 1067739.1066833725
- 样本数: 1167
- train_retain:
- 字节数: 67586879.01516773
- 样本数: 73870
- test_retain:
- 字节数: 16897177.225560002
- 样本数: 18468
数据集大小
- 下载大小: 55945597 字节
- 数据集总大小: 89819092.0 字节
配置
- config_name: default
- data_files:
- train_forget: data/train_forget-*
- test_forget: data/test_forget-*
- train_retain: data/train_retain-*
- test_retain: data/test_retain-*
- data_files:



