alon-albalak/qwen-235b-a22b-noveltybench-comprehensive-evaluation-judgev2
收藏Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/alon-albalak/qwen-235b-a22b-noveltybench-comprehensive-evaluation-judgev2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如ID、提示(prompt)、消息(messages)、完成项(completions)、分区(partition)等。消息部分包含内容和角色,完成项是字符串列表。数据集还包括各种距离和奖励分数的统计信息,如平均、标准差、最小和最大值。此外,还包含法官响应、法官思考、法官评分等相关信息。数据集分为训练集,包含100个示例。
The dataset includes various features such as ID, prompt, messages, completions, partition, etc. The messages part contains content and role, and completions are a list of strings. The dataset also includes statistical information on various distances and reward scores, such as mean, standard deviation, minimum, and maximum values. Additionally, it contains information related to judge responses, judge thinking, judge scores, etc. The dataset is split into a training set with 100 examples.
提供机构:
alon-albalak



