five

THU-KEG/IFBench

收藏
Hugging Face2025-03-07 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/THU-KEG/IFBench
下载链接
链接失效反馈
官方服务:
资源简介:
IFBench是一个用于评估指令遵循奖励模型的基准数据集。它包含了唯一标识符、源数据集、原始指令和增强指令、选择的响应和拒绝的响应,以及需要基于LLM或基于代码验证的约束。该数据集与论文《Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems》相关。

IFBench is a benchmark dataset for evaluating instruction-following reward models. It includes a unique identifier, source dataset, original and augmented instructions, chosen and rejected responses, and constraints that require LLM-based or code-based verification. The dataset is related to the paper Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems.
提供机构:
THU-KEG
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作