TOFU

arXiv2025-09-30 收录

下载链接：

https://locuslab.github.io/tofu/

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是一组用于测试大型语言模型遗忘能力的合成作者传记问答数据集。它能够评估大型语言模型在遗忘和保留知识方面的表现，使用的评估指标包括遗忘质量和模型效用等。该数据集涉及的任务包括实体遗忘、有害知识遗忘以及版权内容遗忘。

This dataset is a collection of synthetic author biography question-answering datasets developed to test the forgetting capabilities of large language models (LLMs). It enables the evaluation of LLMs' performance in terms of knowledge forgetting and retention, with evaluation metrics including forgetting quality, model utility, and others. The tasks covered by this dataset include entity forgetting, harmful knowledge forgetting, and copyrighted content forgetting.

5,000+

优质数据集

54 个

任务类型

进入经典数据集