OALL/details_elmrc__juhaina
收藏Hugging Face2024-07-11 更新2024-07-13 收录
下载链接:
https://hf-mirror.com/datasets/OALL/details_elmrc__juhaina
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是在模型elmrc/juhaina的评估运行期间自动创建的。数据集由136个配置组成,每个配置对应一个评估任务。数据集是从1次运行中生成的,每次运行在每个配置中表示为特定的分割,train分割始终指向最新的结果。一个名为results的额外配置存储了所有运行的聚合结果。README还提供了如何使用Hugging Face datasets库加载数据集的示例。
The dataset was automatically created during the evaluation run of the model elmrc/juhaina. The dataset is composed of 136 configurations, each corresponding to one of the evaluated tasks. The dataset has been created from 1 run, with each run represented as a specific split in each configuration. The train split always points to the latest results. An additional configuration named results stores all the aggregated results of the run. The README also provides an example of how to load the dataset using the Hugging Face datasets library.
提供机构:
OALL
原始信息汇总
数据集概述
数据集基本信息
- 名称: Evaluation run of elmrc/juhaina
- 来源: 自动创建于模型 elmrc/juhaina 的评估运行期间。
- 配置数量: 136 个配置,每个配置对应一个评估任务。
- 创建方式: 从 1 次运行中创建。每个运行可以在每个配置中找到特定的分割,分割名称使用运行的时间戳。
- 额外配置: 包含一个名为 "results" 的额外配置,存储所有运行的聚合结果。
数据加载示例
python from datasets import load_dataset data = load_dataset("OALL/details_elmrc__juhaina", "lighteval_xstory_cloze_ar_0", split="train")
最新结果
- 时间戳: 2024-07-11T12:12:19.106154
- 结果: 包含多个任务的评估结果,具体结果如下:
- 总体结果:
acc_norm: 0.6112715406044172acc_norm_stderr: 0.02765136964492306acc: 0.5956320317670417acc_stderr: 0.012629580396570923
- 具体任务结果:
community|acva:Algeria|0:acc_norm: 0.9743589743589743acc_norm_stderr: 0.011348182888535903
community|acva:Ancient_Egypt|0:acc_norm: 0.9936507936507937acc_norm_stderr: 0.0044824121718997295
community|acva:Arab_Empire|0:acc_norm: 0.8867924528301887acc_norm_stderr: 0.01950054374082687
community|acva:Arabic_Architecture|0:acc_norm: 0.9128205128205128acc_norm_stderr: 0.020253448757437547
community|acva:Arabic_Art|0:acc_norm: 0.882051282051282acc_norm_stderr: 0.02315755291754122
community|acva:Arabic_Astronomy|0:acc_norm: 0.5282051282051282acc_norm_stderr: 0.03584074674920833
community|acva:Arabic_Calligraphy|0:acc_norm: 0.9372549019607843acc_norm_stderr: 0.015216049172060217
community|acva:Arabic_Ceremony|0:acc_norm: 0.8972972972972973acc_norm_stderr: 0.022379490994554444
community|acva:Arabic_Clothing|0:acc_norm: 0.8153846153846154acc_norm_stderr: 0.027855716655754172
community|acva:Arabic_Culture|0:acc_norm: 0.9743589743589743acc_norm_stderr: 0.011348182888535905
community|acva:Arabic_Food|0:acc_norm: 0.9641025641025641acc_norm_stderr: 0.013356493843863415
community|acva:Arabic_Funeral|0:acc_norm: 1.0acc_norm_stderr: 0.0
community|acva:Arabic_Geography|0:acc_norm: 0.8896551724137931acc_norm_stderr: 0.026109923428966807
community|acva:Arabic_History|0:acc_norm: 0.8717948717948718acc_norm_stderr: 0.024002638741003144
community|acva:Arabic_Language_Origin|0:acc_norm: 0.8947368421052632acc_norm_stderr: 0.031653514053981126
community|acva:Arabic_Literature|0:acc_norm: 0.9793103448275862acc_norm_stderr: 0.01186193531066109
community|acva:Arabic_Math|0:acc_norm: 1.0acc_norm_stderr: 0.0
community|acva:Arabic_Medicine|0:acc_norm: 0.9586206896551724acc_norm_stderr: 0.01659715985999271
community|acva:Arabic_Music|0:acc_norm: 0.8920863309352518acc_norm_stderr: 0.026412051088626296
community|acva:Arabic_Ornament|0:acc_norm: 0.9435897435897436acc_norm_stderr: 0.016564173764014367
community|acva:Arabic_Philosophy|0:acc_norm: 0.9862068965517241acc_norm_stderr: 0.009719272715682624
community|acva:Arabic_Physics_and_Chemistry|0:acc_norm: 0.9846153846153847acc_norm_stderr: 0.008836408106064468
community|acva:Arabic_Wedding|0:acc_norm: 0.9487179487179487acc_norm_stderr: 0.015836178483478878
community|acva:Bahrain|0:acc_norm: 0.9777777777777777acc_norm_stderr: 0.022222222222222227
community|acva:Comoros|0:acc_norm: 0.9111111111111111acc_norm_stderr: 0.04290254662948545
community|acva:Egypt_modern|0:acc_norm: 0.9368421052631579acc_norm_stderr: 0.02508898526421084
community|acva:InfluenceFromAncientEgypt|0:acc_norm: 0.9794871794871794acc_norm_stderr: 0.010176799141181117
community|acva:InfluenceFromByzantium|0:acc_norm: 0.9172413793103448acc_norm_stderr: 0.022959752132687576
community|acva:InfluenceFromChina|0:acc_norm: 0.9333333333333333acc_norm_stderr: 0.01790902298391122
community|acva:InfluenceFromGreece|0:acc_norm: 0.958974358974359acc_norm_stderr: 0.014240666649386528
community|acva:InfluenceFromIslam|0:acc_norm: 0.9517241379310345acc_norm_stderr: 0.01786237961829886
community|acva:InfluenceFromPersia|0:acc_norm: 0.9885714285714285acc_norm_stderr: 0.008057964997824758
community|acva:InfluenceFromRome|0:acc_norm: 0.9128205128205128acc_norm_stderr: 0.020253448757437547
community|acva:Iraq|0:acc_norm: 0.9294117647058824acc_norm_stderr: 0.027946704450951116
community|acva:Islam_Education|0:acc_norm: 0.9743589743589743acc_norm_stderr: 0.011348182888535905
community|acva:Islam_branches_and_schools|0:acc_norm: 0.8857142857142857acc_norm_stderr: 0.024119492974684464
community|acva:Islamic_law_system|0:acc_norm: 0.9333333333333333acc_norm_stderr: 0.017909022983911196
community|acva:Jordan|0:acc_norm: 0.9111111111111111acc_norm_stderr: 0.04290254662948542
community|acva:Kuwait|0:acc_norm: 0.9555555555555556acc_norm_stderr: 0.031067790907534743
community|acva:Lebanon|0:acc_norm: 0.9555555555555556acc_norm_stderr: 0.031067790907534743
community|acva:Libya|0:acc_norm: 0.9111111111111111acc_norm_stderr: 0.04290254662948545
community|acva:Mauritania|0:acc_norm: 0.8888888888888888acc_norm_stderr: 0.04737793696791343
community|acva:Mesopotamia_civilization|0:acc_norm: 0.9483870967741935acc_norm_stderr: 0.017828368508587274
community|acva:Morocco|0:acc_norm: 0.8666666666666667acc_norm_stderr: 0.05124707431905382
community|acva:Oman|0:acc_norm: 0.9777777777777777acc_norm_stderr: 0.022222222222222227
community|acva:Palestine|0:acc_norm: 0.9529411764705882acc_norm_stderr: 0.023105423672046245
community|acva:Qatar|0:acc_norm: 0.9333333333333333acc_norm_stderr: 0.037605071654517735
community|acva:Saudi_Arabia|0:acc_norm: 0.9743589743589743acc_norm_stderr: 0.011348182888535905
community|acva:Somalia|0:acc_norm: 0.8444444444444444acc_norm_stderr: 0.05463890236888294
community|acva:Sudan|0:acc_norm: 0.9777777777777777acc_norm_stderr: 0.022222222222222227
community|acva:Syria|0:acc_norm: 0.9555555555555556acc_norm_stderr: 0.031067790907534733
- `community|acva
- 总体结果:



