five

m-a-p/Inverse_IFEval

收藏
Hugging Face2025-09-24 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/m-a-p/Inverse_IFEval
下载链接
链接失效反馈
官方服务:
资源简介:
Inverse IFEval是一个新颖的基准,旨在评估大型语言模型遵循与标准训练范式故意偏离的反直觉指令的能力。该数据集挑战模型推翻其根深蒂固的训练习惯,忠实执行与标准认知模式或注释规范冲突的指令。数据集包含八种指令类型的高质量问题,覆盖23个知识领域,并支持中文和英文两个版本。

Inverse IFEval is a novel benchmark designed to evaluate large language models (LLMs) ability to follow counterintuitive instructions that deliberately deviate from conventional training paradigms. The dataset challenges models to override their ingrained training conventions and faithfully execute instructions that conflict with standard cognitive patterns or annotation norms. It contains high-quality questions across eight instruction types, covering 23 knowledge domains, and supports both Chinese and English versions.
提供机构:
m-a-p
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作