Instruction-Following Test Samples
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/Junjie-Ye/MulDimIF
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了1,200个可验证代码的指令跟随测试样本,这些样本是通过自动指令生成管道生成的。此外,该数据集涵盖了多种约束模式和难度级别,旨在评估不同大型语言模型(LLM)的性能。规模上,该数据集共有1,200个样本,任务是对指令跟随能力进行评估。
This dataset contains 1,200 instruction-following test samples with verifiable code, which are generated via an automated instruction generation pipeline. Moreover, this dataset covers multiple constraint patterns and difficulty levels, aiming to evaluate the performance of different Large Language Models (LLMs). In terms of scale, this dataset has a total of 1,200 samples, and its task is to evaluate the instruction-following capability of models.



