five

alon-albalak/llama-33-70b-noveltybench-comprehensive-summary

收藏
Hugging Face2025-12-17 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/alon-albalak/llama-33-70b-noveltybench-comprehensive-summary
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: mean_distinct dtype: float64 - name: mean_utility dtype: float64 - name: total_instances dtype: int64 - name: total_completions dtype: int64 - name: mean_partition_score dtype: float64 - name: std_partition_score dtype: float64 - name: mean_intra_diversity dtype: float64 - name: std_intra_diversity dtype: float64 - name: median_intra_diversity dtype: float64 - name: min_intra_diversity dtype: float64 - name: max_intra_diversity dtype: float64 - name: mean_group_size dtype: float64 - name: total_groups dtype: int64 - name: total_pairs_computed dtype: int64 - name: mean_reward dtype: float64 - name: std_reward dtype: float64 - name: median_reward dtype: float64 - name: min_mean_reward dtype: float64 - name: max_mean_reward dtype: float64 - name: global_mean_reward dtype: float64 - name: global_std_reward dtype: float64 - name: global_min_reward dtype: float64 - name: global_max_reward dtype: float64 - name: mean_judge_score dtype: float64 - name: std_judge_score dtype: float64 - name: median_judge_score dtype: float64 - name: min_mean_judge_score dtype: float64 - name: max_mean_judge_score dtype: float64 - name: global_mean_judge_score dtype: float64 - name: global_std_judge_score dtype: float64 - name: global_min_judge_score dtype: float64 - name: global_max_judge_score dtype: float64 - name: score_parsing_success_rate dtype: float64 - name: total_valid_scores dtype: int64 splits: - name: train num_bytes: 272 num_examples: 1 download_size: 16637 dataset_size: 272 configs: - config_name: default data_files: - split: train path: data/train-* ---

数据集信息: 特征: - 名称:平均独特性(mean_distinct),数据类型:float64 - 名称:平均效用(mean_utility),数据类型:float64 - 名称:总实例数(total_instances),数据类型:int64 - 名称:总完成数(total_completions),数据类型:int64 - 名称:平均划分得分(mean_partition_score),数据类型:float64 - 名称:划分得分标准差(std_partition_score),数据类型:float64 - 名称:平均组内多样性(mean_intra_diversity),数据类型:float64 - 名称:组内多样性标准差(std_intra_diversity),数据类型:float64 - 名称:组内多样性中位数(median_intra_diversity),数据类型:float64 - 名称:组内多样性最小值(min_intra_diversity),数据类型:float64 - 名称:组内多样性最大值(max_intra_diversity),数据类型:float64 - 名称:平均组大小(mean_group_size),数据类型:float64 - 名称:总组数(total_groups),数据类型:int64 - 名称:已计算总对数(total_pairs_computed),数据类型:int64 - 名称:平均奖励(mean_reward),数据类型:float64 - 名称:奖励标准差(std_reward),数据类型:float64 - 名称:奖励中位数(median_reward),数据类型:float64 - 名称:最小平均奖励(min_mean_reward),数据类型:float64 - 名称:最大平均奖励(max_mean_reward),数据类型:float64 - 名称:全局平均奖励(global_mean_reward),数据类型:float64 - 名称:全局奖励标准差(global_std_reward),数据类型:float64 - 名称:全局最小奖励(global_min_reward),数据类型:float64 - 名称:全局最大奖励(global_max_reward),数据类型:float64 - 名称:平均评估得分(mean_judge_score),数据类型:float64 - 名称:评估得分标准差(std_judge_score),数据类型:float64 - 名称:评估得分中位数(median_judge_score),数据类型:float64 - 名称:最小平均评估得分(min_mean_judge_score),数据类型:float64 - 名称:最大平均评估得分(max_mean_judge_score),数据类型:float64 - 名称:全局平均评估得分(global_mean_judge_score),数据类型:float64 - 名称:全局评估得分标准差(global_std_judge_score),数据类型:float64 - 名称:全局最小评估得分(global_min_judge_score),数据类型:float64 - 名称:全局最大评估得分(global_max_judge_score),数据类型:float64 - 名称:得分解析成功率(score_parsing_success_rate),数据类型:float64 - 名称:有效总得分(total_valid_scores),数据类型:int64 数据集划分: - 名称:训练集(train),字节占用量:272,样本数量:1 下载大小:16637 数据集总大小:272 配置项: - 配置名称:默认配置(default),数据文件: - 划分:训练集(train),文件路径:data/train-*
提供机构:
alon-albalak
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作