m-a-p/Inverse_IFEval
收藏Hugging Face2025-09-24 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/m-a-p/Inverse_IFEval
下载链接
链接失效反馈官方服务:
资源简介:
Inverse IFEval是一个新颖的基准,旨在评估大型语言模型遵循与标准训练范式故意偏离的反直觉指令的能力。该数据集挑战模型推翻其根深蒂固的训练习惯,忠实执行与标准认知模式或注释规范冲突的指令。数据集包含八种指令类型的高质量问题,覆盖23个知识领域,并支持中文和英文两个版本。
Inverse IFEval is a novel benchmark designed to evaluate large language models (LLMs) ability to follow counterintuitive instructions that deliberately deviate from conventional training paradigms. The dataset challenges models to override their ingrained training conventions and faithfully execute instructions that conflict with standard cognitive patterns or annotation norms. It contains high-quality questions across eight instruction types, covering 23 knowledge domains, and supports both Chinese and English versions.
提供机构:
m-a-p



