OALL/details_NotAiLOL__Yi-1.5-dolphin-9B
收藏数据集概述
该数据集是在评估模型 NotAiLOL/Yi-1.5-dolphin-9B 的过程中自动创建的。数据集包含 136 个配置,每个配置对应一个评估任务。
数据集结构
- 数据集从 1 次运行中创建。每次运行可以在每个配置中找到一个特定的分割,分割名称使用运行的时间戳。
- "train" 分割始终指向最新的结果。
- 一个额外的配置 "results" 存储所有运行结果的聚合。
加载数据集示例
python from datasets import load_dataset data = load_dataset("OALL/details_NotAiLOL__Yi-1.5-dolphin-9B", "lighteval_xstory_cloze_ar_0", split="train")
最新结果
以下是 2024-05-23T16:06:35.231009 运行的最新结果:
python { "all": { "acc_norm": 0.3700303071543711, "acc_norm_stderr": 0.03732674539636462, "acc": 0.5162144275314361, "acc_stderr": 0.01286035780505586 }, "community|acva:Algeria|0": { "acc_norm": 0.5230769230769231, "acc_norm_stderr": 0.03585965308947409 }, "community|acva:Ancient_Egypt|0": { "acc_norm": 0.06031746031746032, "acc_norm_stderr": 0.013435297210747564 }, "community|acva:Arab_Empire|0": { "acc_norm": 0.30943396226415093, "acc_norm_stderr": 0.028450154794118627 }, "community|acva:Arabic_Architecture|0": { "acc_norm": 0.4666666666666667, "acc_norm_stderr": 0.03581804596782233 }, "community|acva:Arabic_Art|0": { "acc_norm": 0.441025641025641, "acc_norm_stderr": 0.0356473293185358 }, "community|acva:Arabic_Astronomy|0": { "acc_norm": 0.49743589743589745, "acc_norm_stderr": 0.03589743589743588 }, "community|acva:Arabic_Calligraphy|0": { "acc_norm": 0.44313725490196076, "acc_norm_stderr": 0.031169250205067875 }, "community|acva:Arabic_Ceremony|0": { "acc_norm": 0.5243243243243243, "acc_norm_stderr": 0.0368168445060319 }, "community|acva:Arabic_Clothing|0": { "acc_norm": 0.517948717948718, "acc_norm_stderr": 0.03587477098773825 }, "community|acva:Arabic_Culture|0": { "acc_norm": 0.27692307692307694, "acc_norm_stderr": 0.032127058190759304 }, "community|acva:Arabic_Food|0": { "acc_norm": 0.4205128205128205, "acc_norm_stderr": 0.03544138389303482 }, "community|acva:Arabic_Funeral|0": { "acc_norm": 0.4, "acc_norm_stderr": 0.050529115263991134 }, "community|acva:Arabic_Geography|0": { "acc_norm": 0.496551724137931, "acc_norm_stderr": 0.041665675771015785 }, "community|acva:Arabic_History|0": { "acc_norm": 0.40512820512820513, "acc_norm_stderr": 0.03524577495610962 }, "community|acva:Arabic_Language_Origin|0": { "acc_norm": 0.4842105263157895, "acc_norm_stderr": 0.051545341795930656 }, "community|acva:Arabic_Literature|0": { "acc_norm": 0.47586206896551725, "acc_norm_stderr": 0.0416180850350153 }, "community|acva:Arabic_Math|0": { "acc_norm": 0.3333333333333333, "acc_norm_stderr": 0.03384487217112063 }, "community|acva:Arabic_Medicine|0": { "acc_norm": 0.593103448275862, "acc_norm_stderr": 0.04093793981266237 }, "community|acva:Arabic_Music|0": { "acc_norm": 0.2589928057553957, "acc_norm_stderr": 0.037291986581642324 }, "community|acva:Arabic_Ornament|0": { "acc_norm": 0.6461538461538462, "acc_norm_stderr": 0.03433004254147036 }, "community|acva:Arabic_Philosophy|0": { "acc_norm": 0.5793103448275863, "acc_norm_stderr": 0.0411391498118926 }, "community|acva:Arabic_Physics_and_Chemistry|0": { "acc_norm": 0.5333333333333333, "acc_norm_stderr": 0.03581804596782232 }, "community|acva:Arabic_Wedding|0": { "acc_norm": 0.41025641025641024, "acc_norm_stderr": 0.03531493712326671 }, "community|acva:Bahrain|0": { "acc_norm": 0.35555555555555557, "acc_norm_stderr": 0.07216392363431012 }, "community|acva:Comoros|0": { "acc_norm": 0.4, "acc_norm_stderr": 0.07385489458759965 }, "community|acva:Egypt_modern|0": { "acc_norm": 0.3368421052631579, "acc_norm_stderr": 0.04874810431502904 }, "community|acva:InfluenceFromAncientEgypt|0": { "acc_norm": 0.5692307692307692, "acc_norm_stderr": 0.03555213252058761 }, "community|acva:InfluenceFromByzantium|0": { "acc_norm": 0.7103448275862069, "acc_norm_stderr": 0.037800192304380156 }, "community|acva:InfluenceFromChina|0": { "acc_norm": 0.26153846153846155, "acc_norm_stderr": 0.03155228802742769 }, "community|acva:InfluenceFromGreece|0": { "acc_norm": 0.6307692307692307, "acc_norm_stderr": 0.034648411418637566 }, "community|acva:InfluenceFromIslam|0": { "acc_norm": 0.3448275862068966, "acc_norm_stderr": 0.039609335494512087 }, "community|acva:InfluenceFromPersia|0": { "acc_norm": 0.7142857142857143, "acc_norm_stderr": 0.03424737867752743 }, "community|acva:InfluenceFromRome|0": { "acc_norm": 0.6, "acc_norm_stderr": 0.0351726229056329 }, "community|acva:Iraq|0": { "acc_norm": 0.5294117647058824, "acc_norm_stderr": 0.054460005868973586 }, "community|acva:Islam_Education|0": { "acc_norm": 0.47692307692307695, "acc_norm_stderr": 0.03585965308947409 }, "community|acva:Islam_branches_and_schools|0": { "acc_norm": 0.29714285714285715, "acc_norm_stderr": 0.034645078898843704 }, "community|acva:Islamic_law_system|0": { "acc_norm": 0.4, "acc_norm_stderr": 0.03517262290563291 }, "community|acva:Jordan|0": { "acc_norm": 0.35555555555555557, "acc_norm_stderr": 0.07216392363431012 }, "community|acva:Kuwait|0": { "acc_norm": 0.26666666666666666, "acc_norm_stderr": 0.06666666666666667 }, "community|acva:Lebanon|0": { "acc_norm": 0.2, "acc_norm_stderr": 0.06030226891555273 }, "community|acva:Libya|0": { "acc_norm": 0.4444444444444444, "acc_norm_stderr": 0.07491109582924914 }, "community|acva:Mauritania|0": { "acc_norm": 0.4444444444444444, "acc_norm_stderr": 0.07491109582924915 }, "community|acva:Mesopotamia_civilization|0": { "acc_norm": 0.5935483870967742, "acc_norm_stderr": 0.03957966643707445 }, "community|acva:Morocco|0": { "acc_norm": 0.2222222222222222, "acc_norm_stderr": 0.06267511942419628 }, "community|acva:Oman|0": { "acc_norm": 0.24444444444444444, "acc_norm_stderr": 0.06478835438716998 }, "community|acva:Palestine|0": { "acc_norm": 0.27058823529411763, "acc_norm_stderr": 0.048473144530236524 }, "community|acva:Qatar|0": { "acc_norm": 0.4444444444444444, "acc_norm_stderr": 0.07491109582924915 }, "community|acva:Saudi_Arabia|0": { "acc_norm": 0.37435897435897436, "acc_norm_stderr": 0.03474608430626235 }, "community|acva:Somalia|0": { "acc_norm": 0.35555555555555557, "acc_norm_stderr": 0.07216392363431012 },



