JT-LM/CCR-Bench

Name: JT-LM/CCR-Bench
Creator: JT-LM
Published: 2026-03-10 01:49:17
License: 暂无描述

Hugging Face2026-03-10 更新2025-08-09 收录

下载链接：

https://hf-mirror.com/datasets/JT-LM/CCR-Bench

下载链接

链接失效反馈

官方服务：

资源简介：

CCR-Bench是一个用于评估大型语言模型（LLM）遵循复杂指令能力的基准数据集。它包含174个测试案例，分为三个核心部分：复杂内容格式约束、逻辑工作流控制和工业场景应用。数据集旨在评估LLM在实际工业部署条件下的实用性和鲁棒性。

CCR-Bench is designed to assess LLMs ability to follow complex instructions through a progressive and multi-dimensional lens. The dataset contains 174 test cases and comprises three core components: Complex Content-Format Constraints, Logical Workflow Control, and Industrial Scenario Application. The goal is to evaluate the practical utility and robustness of LLMs under conditions that approximate real-world industrial deployments.

提供机构：

JT-LM

5,000+

优质数据集

54 个

任务类型

进入经典数据集