treadon/disinhibition-eval

Name: treadon/disinhibition-eval
Creator: treadon
Published: 2026-04-30 00:36:40
License: 暂无描述

Hugging Face2026-04-30 更新2026-05-03 收录

下载链接：

https://hf-mirror.com/datasets/treadon/disinhibition-eval

下载链接

链接失效反馈

官方服务：

资源简介：

disinhibition-eval是一个小型评估数据集，用于测量聊天调优语言模型中的对冲与承诺行为。该数据集包含多个分割，旨在测试模型在不同情境下的行为，如对有争议问题的看法、事实正确性、明确的中立性请求、一般连贯性以及在需要对冲的边缘案例中的表现。数据集的设计目的是与对冲/承诺检测器配对使用，以比较基线模型和经过消融的模型。README还提供了模型响应的示例、数据集的模式、预期用例以及相关资源和模型的链接。

disinhibition-eval is a small evaluation dataset for measuring hedging vs. commitment behavior in chat-tuned language models. The dataset includes various splits to test different aspects of model behavior, such as opinions on contentious questions, factual correctness, explicit neutrality requests, general coherence, and edge cases where hedging is appropriate. The dataset is intended to be used with a hedge/commit detector to compare baseline and ablated models. The README also provides examples of model responses, the schema of the dataset, intended use cases, and links to related resources and models.

提供机构：

treadon

5,000+

优质数据集

54 个

任务类型

进入经典数据集