chan030609/MUSE-Books-2
收藏Hugging Face2024-06-15 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/chan030609/MUSE-Books-2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个配置:knowmem、main和train。knowmem配置包含问题和答案的特征,分为retain2_qa_icl、forget_qa_icl、retain2_qa和forget_qa四个分割。main和train配置包含文本特征,分为forget、holdout、retain2和retain1等分割。每个分割都有对应的字节数和示例数。
The dataset contains three configurations: knowmem, main, and train. The knowmem configuration includes features for questions and answers, divided into four splits: retain2_qa_icl, forget_qa_icl, retain2_qa, and forget_qa. The main and train configurations include text features, divided into splits such as forget, holdout, retain2, and retain1. Each split has corresponding byte sizes and example counts.
提供机构:
chan030609
原始信息汇总
数据集概述
配置信息
配置名称:knowmem
- 特征:
answer:字符串类型question:字符串类型
- 分割:
retain2_qa_icl:1143字节,10个样本forget_qa_icl:1033字节,10个样本retain2_qa:9398字节,100个样本forget_qa:9896字节,100个样本
- 下载大小:21229字节
- 数据集大小:21470字节
配置名称:main
- 特征:
text:字符串类型
- 分割:
forget:4096855字节,4个样本holdout:2328993字节,3个样本retain2:1969626字节,13个样本retain1:836924字节,12个样本
- 下载大小:5386338字节
- 数据集大小:9232398字节
配置名称:train
- 特征:
text:字符串类型
- 分割:
forget:4096855字节,4个样本retain2:1969626字节,13个样本retain1:836924字节,12个样本
- 下载大小:3997041字节
- 数据集大小:6903405字节
数据文件路径
配置名称:knowmem
retain2_qa_icl:knowmem/retain2_qa_icl-*forget_qa_icl:knowmem/forget_qa_icl-*retain2_qa:knowmem/retain2_qa-*forget_qa:knowmem/forget_qa-*
配置名称:main
forget:main/forget-*holdout:main/holdout-*retain2:main/retain2-*retain1:main/retain1-*
配置名称:train
forget:train/forget-*retain2:train/retain2-*retain1:train/retain1-*



