OALL/details_Ali-C137__F1H10M-0000
收藏Hugging Face2024-06-22 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/OALL/details_Ali-C137__F1H10M-0000
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是在模型Ali-C137/F1H10M-0000的评估运行期间自动创建的。数据集由136个配置组成,每个配置对应一个评估任务。数据集是从1次运行中生成的,每次运行在每个配置中表示为特定的分割,分割名称使用运行的时间戳。train分割始终指向最新的结果。此外,还有一个results配置,存储了所有运行的聚合结果。README还提供了如何使用`datasets`库中的`load_dataset`函数加载数据集的示例,并包含了特定运行的最新结果。
The dataset was automatically created during the evaluation run of the model Ali-C137/F1H10M-0000. The dataset is composed of 136 configurations, each corresponding to one of the evaluated tasks. The dataset has been created from 1 run, with each run represented as a specific split in each configuration, named using the timestamp of the run. The train split always points to the latest results. Additionally, there is a results configuration that stores all the aggregated results of the run. The README also provides an example of how to load the dataset using the `load_dataset` function from the `datasets` library and includes the latest results from a specific run.
提供机构:
OALL
原始信息汇总
数据集概述
数据集基本信息
- 名称: Evaluation run of Ali-C137/F1H10M-0000
- 来源: 自动创建于模型评估过程中
- 配置数量: 136
- 创建次数: 1次
- 最新结果: 2024-06-22T07:46:22.914375
数据集结构
- 配置: 每个配置对应一个评估任务
- 分割: 每个运行结果作为一个特定的分割,分割名称使用运行的时间戳
- 训练分割: 始终指向最新的结果
- 结果配置: 存储所有运行的聚合结果
加载示例
python from datasets import load_dataset data = load_dataset("OALL/details_Ali-C137__F1H10M-0000", "lighteval_xstory_cloze_ar_0", split="train")
最新结果
-
总体结果:
acc_norm: 0.3212359053263138acc_norm_stderr: 0.03575787720992959acc: 0.4798146922567836acc_stderr: 0.012856635706498289
-
具体任务结果:
community|acva:Algeria|0:acc_norm: 0.558974358974359acc_norm_stderr: 0.0356473293185358
community|acva:Ancient_Egypt|0:acc_norm: 0.0761904761904762acc_norm_stderr: 0.014971893787809661
community|acva:Arab_Empire|0:acc_norm: 0.32075471698113206acc_norm_stderr: 0.028727502957880263
community|acva:Arabic_Architecture|0:acc_norm: 0.46153846153846156acc_norm_stderr: 0.0357915435254457
community|acva:Arabic_Art|0:acc_norm: 0.36923076923076925acc_norm_stderr: 0.03464841141863756
community|acva:Arabic_Astronomy|0:acc_norm: 0.4666666666666667acc_norm_stderr: 0.03581804596782233
community|acva:Arabic_Calligraphy|0:acc_norm: 0.6470588235294118acc_norm_stderr: 0.02998514740090689
community|acva:Arabic_Ceremony|0:acc_norm: 0.518918918918919acc_norm_stderr: 0.036834092970087065
community|acva:Arabic_Clothing|0:acc_norm: 0.5076923076923077acc_norm_stderr: 0.03589365940635212
community|acva:Arabic_Culture|0:acc_norm: 0.24102564102564103acc_norm_stderr: 0.030707489381124217
community|acva:Arabic_Food|0:acc_norm: 0.4512820512820513acc_norm_stderr: 0.03572709860318392
community|acva:Arabic_Funeral|0:acc_norm: 0.4acc_norm_stderr: 0.050529115263991134
community|acva:Arabic_Geography|0:acc_norm: 0.593103448275862acc_norm_stderr: 0.04093793981266237
community|acva:Arabic_History|0:acc_norm: 0.2717948717948718acc_norm_stderr: 0.031940861870257214
community|acva:Arabic_Language_Origin|0:acc_norm: 0.5473684210526316acc_norm_stderr: 0.051339113773544845
community|acva:Arabic_Literature|0:acc_norm: 0.4689655172413793acc_norm_stderr: 0.04158632762097828
community|acva:Arabic_Math|0:acc_norm: 0.3128205128205128acc_norm_stderr: 0.03328755065724854
community|acva:Arabic_Medicine|0:acc_norm: 0.4827586206896552acc_norm_stderr: 0.04164188720169377
community|acva:Arabic_Music|0:acc_norm: 0.2446043165467626acc_norm_stderr: 0.03659146222520568
community|acva:Arabic_Ornament|0:acc_norm: 0.4666666666666667acc_norm_stderr: 0.03581804596782233
community|acva:Arabic_Philosophy|0:acc_norm: 0.5793103448275863acc_norm_stderr: 0.0411391498118926
community|acva:Arabic_Physics_and_Chemistry|0:acc_norm: 0.5282051282051282acc_norm_stderr: 0.03584074674920833
community|acva:Arabic_Wedding|0:acc_norm: 0.4256410256410256acc_norm_stderr: 0.03549871080367708
community|acva:Bahrain|0:acc_norm: 0.3333333333333333acc_norm_stderr: 0.07106690545187012
community|acva:Comoros|0:acc_norm: 0.37777777777777777acc_norm_stderr: 0.07309112127323451
community|acva:Egypt_modern|0:acc_norm: 0.30526315789473685acc_norm_stderr: 0.047498887145627756
community|acva:InfluenceFromAncientEgypt|0:acc_norm: 0.6102564102564103acc_norm_stderr: 0.035014247762563705
community|acva:InfluenceFromByzantium|0:acc_norm: 0.7172413793103448acc_norm_stderr: 0.03752833958003337
community|acva:InfluenceFromChina|0:acc_norm: 0.26666666666666666acc_norm_stderr: 0.0317493043641267
community|acva:InfluenceFromGreece|0:acc_norm: 0.6307692307692307acc_norm_stderr: 0.034648411418637566
community|acva:InfluenceFromIslam|0:acc_norm: 0.3103448275862069acc_norm_stderr: 0.03855289616378947
community|acva:InfluenceFromPersia|0:acc_norm: 0.7028571428571428acc_norm_stderr: 0.03464507889884372
community|acva:InfluenceFromRome|0:acc_norm: 0.5743589743589743acc_norm_stderr: 0.03549871080367708
community|acva:Iraq|0:acc_norm: 0.5176470588235295acc_norm_stderr: 0.05452048340661895
community|acva:Islam_Education|0:acc_norm: 0.47692307692307695acc_norm_stderr: 0.03585965308947409
community|acva:Islam_branches_and_schools|0:acc_norm: 0.4114285714285714acc_norm_stderr: 0.037305441811354055
community|acva:Islamic_law_system|0:acc_norm: 0.38461538461538464acc_norm_stderr: 0.034928969937423046
community|acva:Jordan|0:acc_norm: 0.35555555555555557acc_norm_stderr: 0.07216392363431012
community|acva:Kuwait|0:acc_norm: 0.26666666666666666acc_norm_stderr: 0.06666666666666667
community|acva:Lebanon|0:acc_norm: 0.17777777777777778acc_norm_stderr: 0.05763774795025094
community|acva:Libya|0:acc_norm: 0.4444444444444444acc_norm_stderr: 0.07491109582924914
community|acva:Mauritania|0:acc_norm: 0.4222222222222222acc_norm_stderr: 0.07446027270295805
community|acva:Mesopotamia_civilization|0:acc_norm: 0.5225806451612903acc_norm_stderr: 0.0402500394824441
community|acva:Morocco|0:acc_norm: 0.2222222222222222acc_norm_stderr: 0.06267511942419628
community|acva:Oman|0:acc_norm: 0.17777777777777778acc_norm_stderr: 0.05763774795025094
community|acva:Palestine|0:acc_norm: 0.24705882352941178acc_norm_stderr: 0.047058823529411785
community|acva:Qatar|0:acc_norm: 0.4222222222222222acc_norm_stderr: 0.07446027270295806
community|acva:Saudi_Arabia|0:acc_norm: 0.3282051282051282acc_norm_stderr: 0.03371243782413707
community|acva:Somalia|0:acc_norm: 0.35555555555555557acc_norm_stderr: 0.07216392363431012
community|acva:Sudan|0:acc_norm: 0.35555555555555557acc_norm_stderr: 0.07216392363431012
community|acva:Syria|0:acc_norm: 0.3333333333333333acc_norm_stderr: 0.07106690545



