kargaranamir/coercion_cross

Name: kargaranamir/coercion_cross
Creator: kargaranamir
Published: 2026-04-26 22:06:43
License: 暂无描述

Hugging Face2026-04-26 更新2026-04-26 收录

下载链接：

https://hf-mirror.com/datasets/kargaranamir/coercion_cross

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集名为跨模型强制—mmlu，主要研究如何使用gpt-5.1模型的推理能力来挑战meta-llama/Llama-3.3-70B-Instruct模型。数据集基于mmlu源数据，包含2,052条记录，其中1,744条是符合条件的（正确且可强制）。关键指标显示，跨模型信念崩溃率（BCR）高达93.8%。数据集详细记录了强制模型的推理结果、基线模型的响应、强制成功与否以及信念崩溃情况等信息。

The dataset is named Cross-Model Coercion — mmlu, which focuses on using the reasoning from gpt-5.1 to challenge the meta-llama/Llama-3.3-70B-Instruct model. The dataset is derived from mmlu, containing 2,052 rows with 1,744 eligible rows (correct + coercible). Key results show a Cross-BCR (Belief Collapse Rate) of 93.8%. The schema includes details such as original ID, coercion model, baseline model, coercion attribution, coercion results, baseline reasoning, and belief collapse information.

提供机构：

kargaranamir

5,000+

优质数据集

54 个

任务类型

进入经典数据集