alon-albalak/qwen-235b-a22b-noveltybench-comprehensive-summary-judgev2
收藏Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/alon-albalak/qwen-235b-a22b-noveltybench-comprehensive-summary-judgev2
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: mean_distinct
dtype: float64
- name: mean_utility
dtype: float64
- name: total_instances
dtype: int64
- name: total_completions
dtype: int64
- name: mean_partition_score
dtype: float64
- name: std_partition_score
dtype: float64
- name: mean_intra_diversity
dtype: float64
- name: std_intra_diversity
dtype: float64
- name: median_intra_diversity
dtype: float64
- name: min_intra_diversity
dtype: float64
- name: max_intra_diversity
dtype: float64
- name: mean_group_size
dtype: float64
- name: total_groups
dtype: int64
- name: total_pairs_computed
dtype: int64
- name: mean_reward
dtype: float64
- name: std_reward
dtype: float64
- name: median_reward
dtype: float64
- name: min_mean_reward
dtype: float64
- name: max_mean_reward
dtype: float64
- name: global_mean_reward
dtype: float64
- name: global_std_reward
dtype: float64
- name: global_min_reward
dtype: float64
- name: global_max_reward
dtype: float64
- name: mean_judge_score
dtype: float64
- name: std_judge_score
dtype: float64
- name: median_judge_score
dtype: float64
- name: min_mean_judge_score
dtype: float64
- name: max_mean_judge_score
dtype: float64
- name: global_mean_judge_score
dtype: float64
- name: global_std_judge_score
dtype: float64
- name: global_min_judge_score
dtype: float64
- name: global_max_judge_score
dtype: float64
- name: score_parsing_success_rate
dtype: float64
- name: total_valid_scores
dtype: int64
splits:
- name: train
num_bytes: 272
num_examples: 1
download_size: 16637
dataset_size: 272
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
数据集信息:
特征字段:
- 名称:平均独特度(mean_distinct),数据类型:float64
- 名称:平均效用值(mean_utility),数据类型:float64
- 名称:总实例数(total_instances),数据类型:int64
- 名称:总完成数(total_completions),数据类型:int64
- 名称:平均划分得分(mean_partition_score),数据类型:float64
- 名称:划分得分标准差(std_partition_score),数据类型:float64
- 名称:平均组内多样性(mean_intra_diversity),数据类型:float64
- 名称:组内多样性标准差(std_intra_diversity),数据类型:float64
- 名称:组内多样性中位数(median_intra_diversity),数据类型:float64
- 名称:组内多样性最小值(min_intra_diversity),数据类型:float64
- 名称:组内多样性最大值(max_intra_diversity),数据类型:float64
- 名称:平均组规模(mean_group_size),数据类型:float64
- 名称:总组数(total_groups),数据类型:int64
- 名称:已计算总对数(total_pairs_computed),数据类型:int64
- 名称:平均奖励值(mean_reward),数据类型:float64
- 名称:奖励值标准差(std_reward),数据类型:float64
- 名称:奖励值中位数(median_reward),数据类型:float64
- 名称:最小平均奖励值(min_mean_reward),数据类型:float64
- 名称:最大平均奖励值(max_mean_reward),数据类型:float64
- 名称:全局平均奖励值(global_mean_reward),数据类型:float64
- 名称:全局奖励值标准差(global_std_reward),数据类型:float64
- 名称:全局最小奖励值(global_min_reward),数据类型:float64
- 名称:全局最大奖励值(global_max_reward),数据类型:float64
- 名称:平均评分得分(mean_judge_score),数据类型:float64
- 名称:评分得分标准差(std_judge_score),数据类型:float64
- 名称:评分得分中位数(median_judge_score),数据类型:float64
- 名称:最小平均评分得分(min_mean_judge_score),数据类型:float64
- 名称:最大平均评分得分(max_mean_judge_score),数据类型:float64
- 名称:全局平均评分得分(global_mean_judge_score),数据类型:float64
- 名称:全局评分得分标准差(global_std_judge_score),数据类型:float64
- 名称:全局最小评分得分(global_min_judge_score),数据类型:float64
- 名称:全局最大评分得分(global_max_judge_score),数据类型:float64
- 名称:得分解析成功率(score_parsing_success_rate),数据类型:float64
- 名称:有效总得分数量(total_valid_scores),数据类型:int64
数据划分:
- 名称:train(训练集),字节数:272,样本数:1
下载大小:16637
数据集大小:272
配置项:
- 配置名称:default(默认配置),数据文件:
- 划分:train(训练集),路径:data/train-*
提供机构:
alon-albalak



