Sellopale/AdvancedIF
收藏Hugging Face2025-12-15 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/Sellopale/AdvancedIF
下载链接
链接失效反馈官方服务:
资源简介:
我们介绍了AdvancedIF,这是一个新的基准测试,包含超过1,600个提示和专家设计的评分标准,旨在评估LLMs在以下方面的能力:
* 复杂指令遵循:每个提示包含6个以上的指令,结合了格式、风格、结构、长度、负面约束、拼写和相互条件指令;
* 多轮指令遵循:能够遵循之前传递的指令;
* 系统提示可操控性:能够遵循系统提示中的指令。
We introduce AdvancedIF, a new benchmark featuring over 1,600 prompts and expert-curated rubric designed to assess LLMs proficiency in
* Complex instruction following: each prompt has 6+ instructions with combination of one, format, style, structure, length, negative constraints, spelling, and inter-conditional instructions;
* Multi-turn instruction following: the ability to follow instruction carried from previous;
* System prompt steerability: The ability to follow instructions in the system prompt.
提供机构:
Sellopale



