five

UltraEditBench

收藏
arXiv2025-09-30 收录
下载链接:
https://huggingface.co/datasets/XiaojieGu/UltraEditBench
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集名为UltraEditBench,它基于Wikidata5M知识库中的实体-关系-对象三元组构建,包含了超过200万个完整的编辑对。这些编辑对旨在全面评估终身模型编辑方法在精确性、泛化能力以及安全性方面的表现。数据集被划分为三种样本类型:编辑实例、等价实例和不相关实例,以确保在关键维度上进行全面评估。其规模超过200万个编辑对,任务是对模型编辑进行评估。

This dataset is named UltraEditBench. It is constructed from entity-relation-object triples within the Wikidata5M knowledge base and contains over 2 million complete edit pairs. These edit pairs are designed to comprehensively evaluate the performance of lifelong model editing methods in terms of accuracy, generalization ability and safety. The dataset is divided into three sample types: edit instances, equivalent instances and irrelevant instances, to guarantee comprehensive assessments across key dimensions. With a total of over 2 million edit pairs, its core task is to evaluate model editing.
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作