IFEval

arXiv2025-09-30 收录

下载链接：

https://github.com/google-research/google-research/tree/master/instruction_following_eval

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集旨在评估语言模型精确遵循指令的能力。在此评估中，对设备上的AFM模型和服务器上的AFM模型进行了指令遵循能力的测试。这项任务被称为指令遵循评估。

This dataset is designed to evaluate the ability of language models to precisely follow instructions. In this evaluation, the instruction-following capabilities of both on-device AFM models and server-side AFM models are tested. This task is referred to as the instruction-following evaluation.

搜集汇总

背景与挑战

背景概述

IFEval数据集专注于评估语言模型精确遵循指令的能力，通过测试设备上和服务器上的AFM模型来执行指令遵循评估。该数据集旨在衡量模型在指令执行方面的准确性和可靠性。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集