OALL/details_cognitivecomputations__dolphin-2.9.1-llama-3-70b
收藏Hugging Face2024-05-26 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/OALL/details_cognitivecomputations__dolphin-2.9.1-llama-3-70b
下载链接
链接失效反馈官方服务:
资源简介:
---
pretty_name: Evaluation run of cognitivecomputations/dolphin-2.9.1-llama-3-70b
dataset_summary: "Dataset automatically created during the evaluation run of model\
\ [cognitivecomputations/dolphin-2.9.1-llama-3-70b](https://huggingface.co/cognitivecomputations/dolphin-2.9.1-llama-3-70b).\n\
\nThe dataset is composed of 136 configuration, each one coresponding to one of\
\ the evaluated task.\n\nThe dataset has been created from 1 run(s). Each run can\
\ be found as a specific split in each configuration, the split being named using\
\ the timestamp of the run.The \"train\" split is always pointing to the latest\
\ results.\n\nAn additional configuration \"results\" store all the aggregated results\
\ of the run.\n\nTo load the details from a run, you can for instance do the following:\n\
```python\nfrom datasets import load_dataset\ndata = load_dataset(\"OALL/details_cognitivecomputations__dolphin-2.9.1-llama-3-70b\"\
,\n\t\"lighteval_xstory_cloze_ar_0\",\n\tsplit=\"train\")\n```\n\n## Latest results\n\
\nThese are the [latest results from run 2024-05-26T06:39:48.188561](https://huggingface.co/datasets/OALL/details_cognitivecomputations__dolphin-2.9.1-llama-3-70b/blob/main/results_2024-05-26T06-39-48.188561.json)(note\
\ that their might be results for other tasks in the repos if successive evals didn't\
\ cover the same tasks. You find each in the results and the \"latest\" split for\
\ each eval):\n\n```python\n{\n \"all\": {\n \"acc_norm\": 0.5103080320175644,\n\
\ \"acc_norm_stderr\": 0.037739287164696544,\n \"acc\": 0.698874917273329,\n\
\ \"acc_stderr\": 0.011805509076527741\n },\n \"community|acva:Algeria|0\"\
: {\n \"acc_norm\": 0.5282051282051282,\n \"acc_norm_stderr\": 0.035840746749208334\n\
\ },\n \"community|acva:Ancient_Egypt|0\": {\n \"acc_norm\": 0.08253968253968254,\n\
\ \"acc_norm_stderr\": 0.015529598137078241\n },\n \"community|acva:Arab_Empire|0\"\
: {\n \"acc_norm\": 0.3132075471698113,\n \"acc_norm_stderr\": 0.02854479331905533\n\
\ },\n \"community|acva:Arabic_Architecture|0\": {\n \"acc_norm\":\
\ 0.5128205128205128,\n \"acc_norm_stderr\": 0.03588610523192216\n },\n\
\ \"community|acva:Arabic_Art|0\": {\n \"acc_norm\": 0.3487179487179487,\n\
\ \"acc_norm_stderr\": 0.034215338466705415\n },\n \"community|acva:Arabic_Astronomy|0\"\
: {\n \"acc_norm\": 0.48717948717948717,\n \"acc_norm_stderr\": 0.03588610523192216\n\
\ },\n \"community|acva:Arabic_Calligraphy|0\": {\n \"acc_norm\": 0.6627450980392157,\n\
\ \"acc_norm_stderr\": 0.029664397990297985\n },\n \"community|acva:Arabic_Ceremony|0\"\
: {\n \"acc_norm\": 0.572972972972973,\n \"acc_norm_stderr\": 0.03646580777990099\n\
\ },\n \"community|acva:Arabic_Clothing|0\": {\n \"acc_norm\": 0.5128205128205128,\n\
\ \"acc_norm_stderr\": 0.03588610523192215\n },\n \"community|acva:Arabic_Culture|0\"\
: {\n \"acc_norm\": 0.3333333333333333,\n \"acc_norm_stderr\": 0.03384487217112063\n\
\ },\n \"community|acva:Arabic_Food|0\": {\n \"acc_norm\": 0.5333333333333333,\n\
\ \"acc_norm_stderr\": 0.03581804596782233\n },\n \"community|acva:Arabic_Funeral|0\"\
: {\n \"acc_norm\": 0.4,\n \"acc_norm_stderr\": 0.050529115263991134\n\
\ },\n \"community|acva:Arabic_Geography|0\": {\n \"acc_norm\": 0.6344827586206897,\n\
\ \"acc_norm_stderr\": 0.04013124195424385\n },\n \"community|acva:Arabic_History|0\"\
: {\n \"acc_norm\": 0.41025641025641024,\n \"acc_norm_stderr\": 0.035314937123266714\n\
\ },\n \"community|acva:Arabic_Language_Origin|0\": {\n \"acc_norm\"\
: 0.6526315789473685,\n \"acc_norm_stderr\": 0.049109474007766586\n },\n\
\ \"community|acva:Arabic_Literature|0\": {\n \"acc_norm\": 0.4689655172413793,\n\
\ \"acc_norm_stderr\": 0.04158632762097828\n },\n \"community|acva:Arabic_Math|0\"\
: {\n \"acc_norm\": 0.3230769230769231,\n \"acc_norm_stderr\": 0.033575443964031323\n\
\ },\n \"community|acva:Arabic_Medicine|0\": {\n \"acc_norm\": 0.503448275862069,\n\
\ \"acc_norm_stderr\": 0.04166567577101579\n },\n \"community|acva:Arabic_Music|0\"\
: {\n \"acc_norm\": 0.302158273381295,\n \"acc_norm_stderr\": 0.03908914479291562\n\
\ },\n \"community|acva:Arabic_Ornament|0\": {\n \"acc_norm\": 0.5846153846153846,\n\
\ \"acc_norm_stderr\": 0.03538013280575031\n },\n \"community|acva:Arabic_Philosophy|0\"\
: {\n \"acc_norm\": 0.5862068965517241,\n \"acc_norm_stderr\": 0.04104269211806232\n\
\ },\n \"community|acva:Arabic_Physics_and_Chemistry|0\": {\n \"acc_norm\"\
: 0.6410256410256411,\n \"acc_norm_stderr\": 0.03444042881521377\n },\n\
\ \"community|acva:Arabic_Wedding|0\": {\n \"acc_norm\": 0.4153846153846154,\n\
\ \"acc_norm_stderr\": 0.03538013280575029\n },\n \"community|acva:Bahrain|0\"\
: {\n \"acc_norm\": 0.3333333333333333,\n \"acc_norm_stderr\": 0.07106690545187012\n\
\ },\n \"community|acva:Comoros|0\": {\n \"acc_norm\": 0.37777777777777777,\n\
\ \"acc_norm_stderr\": 0.07309112127323451\n },\n \"community|acva:Egypt_modern|0\"\
: {\n \"acc_norm\": 0.3157894736842105,\n \"acc_norm_stderr\": 0.04794350420740798\n\
\ },\n \"community|acva:InfluenceFromAncientEgypt|0\": {\n \"acc_norm\"\
: 0.6974358974358974,\n \"acc_norm_stderr\": 0.032980708700856204\n },\n\
\ \"community|acva:InfluenceFromByzantium|0\": {\n \"acc_norm\": 0.7586206896551724,\n\
\ \"acc_norm_stderr\": 0.03565998174135303\n },\n \"community|acva:InfluenceFromChina|0\"\
: {\n \"acc_norm\": 0.2923076923076923,\n \"acc_norm_stderr\": 0.03265438393749511\n\
\ },\n \"community|acva:InfluenceFromGreece|0\": {\n \"acc_norm\":\
\ 0.6564102564102564,\n \"acc_norm_stderr\": 0.03409627301409855\n },\n\
\ \"community|acva:InfluenceFromIslam|0\": {\n \"acc_norm\": 0.31724137931034485,\n\
\ \"acc_norm_stderr\": 0.03878352372138621\n },\n \"community|acva:InfluenceFromPersia|0\"\
: {\n \"acc_norm\": 0.7314285714285714,\n \"acc_norm_stderr\": 0.033600151915923894\n\
\ },\n \"community|acva:InfluenceFromRome|0\": {\n \"acc_norm\": 0.5897435897435898,\n\
\ \"acc_norm_stderr\": 0.0353149371232667\n },\n \"community|acva:Iraq|0\"\
: {\n \"acc_norm\": 0.5294117647058824,\n \"acc_norm_stderr\": 0.054460005868973586\n\
\ },\n \"community|acva:Islam_Education|0\": {\n \"acc_norm\": 0.4564102564102564,\n\
\ \"acc_norm_stderr\": 0.03576123096991215\n },\n \"community|acva:Islam_branches_and_schools|0\"\
: {\n \"acc_norm\": 0.4342857142857143,\n \"acc_norm_stderr\": 0.037576101528126626\n\
\ },\n \"community|acva:Islamic_law_system|0\": {\n \"acc_norm\": 0.4358974358974359,\n\
\ \"acc_norm_stderr\": 0.035601666623466345\n },\n \"community|acva:Jordan|0\"\
: {\n \"acc_norm\": 0.3333333333333333,\n \"acc_norm_stderr\": 0.07106690545187012\n\
\ },\n \"community|acva:Kuwait|0\": {\n \"acc_norm\": 0.3111111111111111,\n\
\ \"acc_norm_stderr\": 0.06979205927323111\n },\n \"community|acva:Lebanon|0\"\
: {\n \"acc_norm\": 0.17777777777777778,\n \"acc_norm_stderr\": 0.05763774795025094\n\
\ },\n \"community|acva:Libya|0\": {\n \"acc_norm\": 0.4444444444444444,\n\
\ \"acc_norm_stderr\": 0.07491109582924914\n },\n \"community|acva:Mauritania|0\"\
: {\n \"acc_norm\": 0.4222222222222222,\n \"acc_norm_stderr\": 0.07446027270295805\n\
\ },\n \"community|acva:Mesopotamia_civilization|0\": {\n \"acc_norm\"\
: 0.5225806451612903,\n \"acc_norm_stderr\": 0.0402500394824441\n },\n\
\ \"community|acva:Morocco|0\": {\n \"acc_norm\": 0.26666666666666666,\n\
\ \"acc_norm_stderr\": 0.06666666666666664\n },\n \"community|acva:Oman|0\"\
: {\n \"acc_norm\": 0.26666666666666666,\n \"acc_norm_stderr\": 0.06666666666666664\n\
\ },\n \"community|acva:Palestine|0\": {\n \"acc_norm\": 0.27058823529411763,\n\
\ \"acc_norm_stderr\": 0.04847314453023652\n },\n \"community|acva:Qatar|0\"\
: {\n \"acc_norm\": 0.5333333333333333,\n \"acc_norm_stderr\": 0.0752101433090355\n\
\ },\n \"community|acva:Saudi_Arabia|0\": {\n \"acc_norm\": 0.358974358974359,\n\
\ \"acc_norm_stderr\": 0.03444042881521375\n },\n \"community|acva:Somalia|0\"\
: {\n \"acc_norm\": 0.35555555555555557,\n \"acc_norm_stderr\": 0.07216392363431012\n\
\ },\n \"community|acva:Sudan|0\": {\n \"acc_norm\": 0.4,\n \
\ \"acc_norm_stderr\": 0.07385489458759965\n },\n \"community|acva:Syria|0\"\
: {\n \"acc_norm\": 0.35555555555555557,\n \"acc_norm_stderr\": 0.07216392363431012\n\
\ },\n \"community|acva:Tunisia|0\": {\n \"acc_norm\": 0.3111111111111111,\n\
\ \"acc_norm_stderr\": 0.06979205927323111\n },\n \"community|acva:United_Arab_Emirates|0\"\
: {\n \"acc_norm\": 0.25882352941176473,\n \"acc_norm_stderr\": 0.04778846120374094\n\
\ },\n \"community|acva:Yemen|0\": {\n \"acc_norm\": 0.2,\n \
\ \"acc_norm_stderr\": 0.13333333333333333\n },\n \"community|acva:communication|0\"\
: {\n \"acc_norm\": 0.43131868131868134,\n \"acc_norm_stderr\": 0.02599443023962308\n\
\ },\n \"community|acva:computer_and_phone|0\": {\n \"acc_norm\": 0.4711864406779661,\n\
\ \"acc_norm_stderr\": 0.029112132426516474\n },\n \"community|acva:daily_life|0\"\
: {\n \"acc_norm\": 0.2551928783382789,\n \"acc_norm_stderr\": 0.023784090394712923\n\
\ },\n \"community|acva:entertainment|0\": {\n \"acc_norm\": 0.2440677966101695,\n\
\ \"acc_norm_stderr\": 0.025050880690319712\n },\n \"community|alghafa:mcq_exams_test_ar|0\"\
: {\n \"acc_norm\": 0.414721723518851,\n \"acc_norm_stderr\": 0.02089402928209432\n\
\ },\n \"community|alghafa:meta_ar_dialects|0\": {\n \"acc_norm\":\
\ 0.44151992585727523,\n \"acc_norm_stderr\": 0.006761195976200771\n },\n\
\ \"community|alghafa:meta_ar_msa|0\": {\n \"acc_norm\": 0.5083798882681564,\n\
\ \"acc_norm_stderr\": 0.016720152794672486\n },\n \"community|alghafa:multiple_choice_facts_truefalse_balanced_task|0\"\
: {\n \"acc_norm\": 0.5333333333333333,\n \"acc_norm_stderr\": 0.05799451149344531\n\
\ },\n \"community|alghafa:multiple_choice_grounded_statement_soqal_task|0\"\
: {\n \"acc_norm\": 0.66,\n \"acc_norm_stderr\": 0.03880773464731456\n\
\ },\n \"community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0\"\
: {\n \"acc_norm\": 0.52,\n \"acc_norm_stderr\": 0.04092881363092387\n\
\ },\n \"community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0\"\
: {\n \"acc_norm\": 0.8021263289555972,\n \"acc_norm_stderr\": 0.004455878286876554\n\
\ },\n \"community|alghafa:multiple_choice_rating_sentiment_task|0\": {\n\
\ \"acc_norm\": 0.5803169307756464,\n \"acc_norm_stderr\": 0.006374336352573141\n\
\ },\n \"community|alghafa:multiple_choice_sentiment_task|0\": {\n \
\ \"acc_norm\": 0.37383720930232556,\n \"acc_norm_stderr\": 0.011669357711165854\n\
\ },\n \"community|arabic_exams|0\": {\n \"acc_norm\": 0.5437616387337058,\n\
\ \"acc_norm_stderr\": 0.021513832714984076\n },\n \"community|arabic_mmlu:abstract_algebra|0\"\
: {\n \"acc_norm\": 0.41,\n \"acc_norm_stderr\": 0.049431107042371025\n\
\ },\n \"community|arabic_mmlu:anatomy|0\": {\n \"acc_norm\": 0.5555555555555556,\n\
\ \"acc_norm_stderr\": 0.04292596718256981\n },\n \"community|arabic_mmlu:astronomy|0\"\
: {\n \"acc_norm\": 0.7302631578947368,\n \"acc_norm_stderr\": 0.03611780560284898\n\
\ },\n \"community|arabic_mmlu:business_ethics|0\": {\n \"acc_norm\"\
: 0.67,\n \"acc_norm_stderr\": 0.04725815626252609\n },\n \"community|arabic_mmlu:clinical_knowledge|0\"\
: {\n \"acc_norm\": 0.6716981132075471,\n \"acc_norm_stderr\": 0.02890159361241178\n\
\ },\n \"community|arabic_mmlu:college_biology|0\": {\n \"acc_norm\"\
: 0.6805555555555556,\n \"acc_norm_stderr\": 0.038990736873573344\n },\n\
\ \"community|arabic_mmlu:college_chemistry|0\": {\n \"acc_norm\": 0.48,\n\
\ \"acc_norm_stderr\": 0.050211673156867795\n },\n \"community|arabic_mmlu:college_computer_science|0\"\
: {\n \"acc_norm\": 0.46,\n \"acc_norm_stderr\": 0.05009082659620333\n\
\ },\n \"community|arabic_mmlu:college_mathematics|0\": {\n \"acc_norm\"\
: 0.36,\n \"acc_norm_stderr\": 0.048241815132442176\n },\n \"community|arabic_mmlu:college_medicine|0\"\
: {\n \"acc_norm\": 0.5202312138728323,\n \"acc_norm_stderr\": 0.03809342081273957\n\
\ },\n \"community|arabic_mmlu:college_physics|0\": {\n \"acc_norm\"\
: 0.43137254901960786,\n \"acc_norm_stderr\": 0.04928099597287533\n },\n\
\ \"community|arabic_mmlu:computer_security|0\": {\n \"acc_norm\": 0.71,\n\
\ \"acc_norm_stderr\": 0.045604802157206845\n },\n \"community|arabic_mmlu:conceptual_physics|0\"\
: {\n \"acc_norm\": 0.6468085106382979,\n \"acc_norm_stderr\": 0.031245325202761926\n\
\ },\n \"community|arabic_mmlu:econometrics|0\": {\n \"acc_norm\":\
\ 0.4473684210526316,\n \"acc_norm_stderr\": 0.04677473004491199\n },\n\
\ \"community|arabic_mmlu:electrical_engineering|0\": {\n \"acc_norm\"\
: 0.5310344827586206,\n \"acc_norm_stderr\": 0.04158632762097828\n },\n\
\ \"community|arabic_mmlu:elementary_mathematics|0\": {\n \"acc_norm\"\
: 0.5105820105820106,\n \"acc_norm_stderr\": 0.025745542276045478\n },\n\
\ \"community|arabic_mmlu:formal_logic|0\": {\n \"acc_norm\": 0.49206349206349204,\n\
\ \"acc_norm_stderr\": 0.044715725362943486\n },\n \"community|arabic_mmlu:global_facts|0\"\
: {\n \"acc_norm\": 0.47,\n \"acc_norm_stderr\": 0.050161355804659205\n\
\ },\n \"community|arabic_mmlu:high_school_biology|0\": {\n \"acc_norm\"\
: 0.6258064516129033,\n \"acc_norm_stderr\": 0.027528904299845693\n },\n\
\ \"community|arabic_mmlu:high_school_chemistry|0\": {\n \"acc_norm\"\
: 0.5320197044334976,\n \"acc_norm_stderr\": 0.035107665979592154\n },\n\
\ \"community|arabic_mmlu:high_school_computer_science|0\": {\n \"acc_norm\"\
: 0.68,\n \"acc_norm_stderr\": 0.04688261722621504\n },\n \"community|arabic_mmlu:high_school_european_history|0\"\
: {\n \"acc_norm\": 0.26666666666666666,\n \"acc_norm_stderr\": 0.03453131801885415\n\
\ },\n \"community|arabic_mmlu:high_school_geography|0\": {\n \"acc_norm\"\
: 0.7373737373737373,\n \"acc_norm_stderr\": 0.03135305009533087\n },\n\
\ \"community|arabic_mmlu:high_school_government_and_politics|0\": {\n \
\ \"acc_norm\": 0.7461139896373057,\n \"acc_norm_stderr\": 0.03141024780565318\n\
\ },\n \"community|arabic_mmlu:high_school_macroeconomics|0\": {\n \
\ \"acc_norm\": 0.6512820512820513,\n \"acc_norm_stderr\": 0.024162780284017717\n\
\ },\n \"community|arabic_mmlu:high_school_mathematics|0\": {\n \"\
acc_norm\": 0.3814814814814815,\n \"acc_norm_stderr\": 0.02961671892749759\n\
\ },\n \"community|arabic_mmlu:high_school_microeconomics|0\": {\n \
\ \"acc_norm\": 0.6092436974789915,\n \"acc_norm_stderr\": 0.031693802357129965\n\
\ },\n \"community|arabic_mmlu:high_school_physics|0\": {\n \"acc_norm\"\
: 0.39072847682119205,\n \"acc_norm_stderr\": 0.03983798306659807\n },\n\
\ \"community|arabic_mmlu:high_school_psychology|0\": {\n \"acc_norm\"\
: 0.6697247706422018,\n \"acc_norm_stderr\": 0.02016446633634298\n },\n\
\ \"community|arabic_mmlu:high_school_statistics|0\": {\n \"acc_norm\"\
: 0.4305555555555556,\n \"acc_norm_stderr\": 0.03376922151252335\n },\n\
\ \"community|arabic_mmlu:high_school_us_history|0\": {\n \"acc_norm\"\
: 0.35294117647058826,\n \"acc_norm_stderr\": 0.03354092437591519\n },\n\
\ \"community|arabic_mmlu:high_school_world_history|0\": {\n \"acc_norm\"\
: 0.379746835443038,\n \"acc_norm_stderr\": 0.031591887529658504\n },\n\
\ \"community|arabic_mmlu:human_aging|0\": {\n \"acc_norm\": 0.6367713004484304,\n\
\ \"acc_norm_stderr\": 0.03227790442850499\n },\n \"community|arabic_mmlu:human_sexuality|0\"\
: {\n \"acc_norm\": 0.6183206106870229,\n \"acc_norm_stderr\": 0.042607351576445594\n\
\ },\n \"community|arabic_mmlu:international_law|0\": {\n \"acc_norm\"\
: 0.8181818181818182,\n \"acc_norm_stderr\": 0.03520893951097653\n },\n\
\ \"community|arabic_mmlu:jurisprudence|0\": {\n \"acc_norm\": 0.6481481481481481,\n\
\ \"acc_norm_stderr\": 0.04616631111801713\n },\n \"community|arabic_mmlu:logical_fallacies|0\"\
: {\n \"acc_norm\": 0.5705521472392638,\n \"acc_norm_stderr\": 0.03889066619112722\n\
\ },\n \"community|arabic_mmlu:machine_learning|0\": {\n \"acc_norm\"\
: 0.48214285714285715,\n \"acc_norm_stderr\": 0.047427623612430116\n },\n\
\ \"community|arabic_mmlu:management|0\": {\n \"acc_norm\": 0.7281553398058253,\n\
\ \"acc_norm_stderr\": 0.044052680241409216\n },\n \"community|arabic_mmlu:marketing|0\"\
: {\n \"acc_norm\": 0.811965811965812,\n \"acc_norm_stderr\": 0.02559819368665227\n\
\ },\n \"community|arabic_mmlu:medical_genetics|0\": {\n \"acc_norm\"\
: 0.7,\n \"acc_norm_stderr\": 0.046056618647183814\n },\n \"community|arabic_mmlu:miscellaneous|0\"\
: {\n \"acc_norm\": 0.7330779054916986,\n \"acc_norm_stderr\": 0.015818450894777552\n\
\ },\n \"community|arabic_mmlu:moral_disputes|0\": {\n \"acc_norm\"\
: 0.6069364161849711,\n \"acc_norm_stderr\": 0.02629622791561368\n },\n\
\ \"community|arabic_mmlu:moral_scenarios|0\": {\n \"acc_norm\": 0.3743016759776536,\n\
\ \"acc_norm_stderr\": 0.016185444179457168\n },\n \"community|arabic_mmlu:nutrition|0\"\
: {\n \"acc_norm\": 0.7287581699346405,\n \"acc_norm_stderr\": 0.02545775669666788\n\
\ },\n \"community|arabic_mmlu:philosophy|0\": {\n \"acc_norm\": 0.6463022508038585,\n\
\ \"acc_norm_stderr\": 0.027155208103200865\n },\n \"community|arabic_mmlu:prehistory|0\"\
: {\n \"acc_norm\": 0.6265432098765432,\n \"acc_norm_stderr\": 0.02691500301138016\n\
\ },\n \"community|arabic_mmlu:professional_accounting|0\": {\n \"\
acc_norm\": 0.41134751773049644,\n \"acc_norm_stderr\": 0.029354911159940985\n\
\ },\n \"community|arabic_mmlu:professional_law|0\": {\n \"acc_norm\"\
: 0.39504563233376794,\n \"acc_norm_stderr\": 0.012485727813251562\n },\n\
\ \"community|arabic_mmlu:professional_medicine|0\": {\n \"acc_norm\"\
: 0.2977941176470588,\n \"acc_norm_stderr\": 0.027778298701545443\n },\n\
\ \"community|arabic_mmlu:professional_psychology|0\": {\n \"acc_norm\"\
: 0.6094771241830066,\n \"acc_norm_stderr\": 0.019737008998094593\n },\n\
\ \"community|arabic_mmlu:public_relations|0\": {\n \"acc_norm\": 0.6272727272727273,\n\
\ \"acc_norm_stderr\": 0.04631381319425465\n },\n \"community|arabic_mmlu:security_studies|0\"\
: {\n \"acc_norm\": 0.6571428571428571,\n \"acc_norm_stderr\": 0.030387262919547724\n\
\ },\n \"community|arabic_mmlu:sociology|0\": {\n \"acc_norm\": 0.7611940298507462,\n\
\ \"acc_norm_stderr\": 0.03014777593540922\n },\n \"community|arabic_mmlu:us_foreign_policy|0\"\
: {\n \"acc_norm\": 0.82,\n \"acc_norm_stderr\": 0.038612291966536934\n\
\ },\n \"community|arabic_mmlu:virology|0\": {\n \"acc_norm\": 0.46987951807228917,\n\
\ \"acc_norm_stderr\": 0.03885425420866767\n },\n \"community|arabic_mmlu:world_religions|0\"\
: {\n \"acc_norm\": 0.695906432748538,\n \"acc_norm_stderr\": 0.03528211258245231\n\
\ },\n \"community|arc_challenge_okapi_ar|0\": {\n \"acc_norm\": 0.5258620689655172,\n\
\ \"acc_norm_stderr\": 0.01466717774523103\n },\n \"community|arc_easy_ar|0\"\
: {\n \"acc_norm\": 0.5583756345177665,\n \"acc_norm_stderr\": 0.010215458925359593\n\
\ },\n \"community|boolq_ar|0\": {\n \"acc_norm\": 0.7917177914110429,\n\
\ \"acc_norm_stderr\": 0.007113266977878542\n },\n \"community|copa_ext_ar|0\"\
: {\n \"acc_norm\": 0.6,\n \"acc_norm_stderr\": 0.05192907868894985\n\
\ },\n \"community|hellaswag_okapi_ar|0\": {\n \"acc_norm\": 0.4286337367789772,\n\
\ \"acc_norm_stderr\": 0.005167920261920976\n },\n \"community|openbook_qa_ext_ar|0\"\
: {\n \"acc_norm\": 0.5252525252525253,\n \"acc_norm_stderr\": 0.02246735418300409\n\
\ },\n \"community|piqa_ar|0\": {\n \"acc_norm\": 0.6786688488816148,\n\
\ \"acc_norm_stderr\": 0.010910449361123257\n },\n \"community|race_ar|0\"\
: {\n \"acc_norm\": 0.5615743558531142,\n \"acc_norm_stderr\": 0.007068320907453806\n\
\ },\n \"community|sciq_ar|0\": {\n \"acc_norm\": 0.5889447236180905,\n\
\ \"acc_norm_stderr\": 0.015606092943535746\n },\n \"community|toxigen_ar|0\"\
: {\n \"acc_norm\": 0.6213903743315508,\n \"acc_norm_stderr\": 0.01587101303083359\n\
\ },\n \"lighteval|xstory_cloze:ar|0\": {\n \"acc\": 0.698874917273329,\n\
\ \"acc_stderr\": 0.011805509076527741\n },\n \"community|acva:_average|0\"\
: {\n \"acc_norm\": 0.42977124151803386,\n \"acc_norm_stderr\": 0.046310869991255735\n\
\ },\n \"community|alghafa:_average|0\": {\n \"acc_norm\": 0.5371372600012427,\n\
\ \"acc_norm_stderr\": 0.022734001130585206\n },\n \"community|arabic_mmlu:_average|0\"\
: {\n \"acc_norm\": 0.5737971101047391,\n \"acc_norm_stderr\": 0.03546732161097641\n\
\ }\n}\n```"
repo_url: https://huggingface.co/cognitivecomputations/dolphin-2.9.1-llama-3-70b
configs:
- config_name: community_acva_Algeria_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Algeria|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Algeria|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Ancient_Egypt_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Ancient_Egypt|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Ancient_Egypt|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arab_Empire_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arab_Empire|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arab_Empire|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Architecture_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Architecture|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Architecture|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Art_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Art|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Art|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Astronomy_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Astronomy|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Astronomy|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Calligraphy_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Calligraphy|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Calligraphy|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Ceremony_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Ceremony|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Ceremony|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Clothing_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Clothing|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Clothing|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Culture_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Culture|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Culture|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Food_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Food|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Food|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Funeral_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Funeral|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Funeral|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Geography_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Geography|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Geography|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_History_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_History|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_History|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Language_Origin_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Language_Origin|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Language_Origin|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Literature_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Literature|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Literature|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Math_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Math|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Math|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Medicine_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Medicine|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Medicine|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Music_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Music|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Music|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Ornament_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Ornament|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Ornament|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Philosophy_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Philosophy|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Philosophy|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Physics_and_Chemistry_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Physics_and_Chemistry|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Physics_and_Chemistry|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Arabic_Wedding_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Arabic_Wedding|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Wedding|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Bahrain_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Bahrain|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Bahrain|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Comoros_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Comoros|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Comoros|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Egypt_modern_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Egypt_modern|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Egypt_modern|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_InfluenceFromAncientEgypt_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:InfluenceFromAncientEgypt|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:InfluenceFromAncientEgypt|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_InfluenceFromByzantium_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:InfluenceFromByzantium|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:InfluenceFromByzantium|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_InfluenceFromChina_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:InfluenceFromChina|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:InfluenceFromChina|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_InfluenceFromGreece_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:InfluenceFromGreece|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:InfluenceFromGreece|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_InfluenceFromIslam_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:InfluenceFromIslam|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:InfluenceFromIslam|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_InfluenceFromPersia_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:InfluenceFromPersia|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:InfluenceFromPersia|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_InfluenceFromRome_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:InfluenceFromRome|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:InfluenceFromRome|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Iraq_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Iraq|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Iraq|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Islam_Education_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Islam_Education|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Islam_Education|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Islam_branches_and_schools_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Islam_branches_and_schools|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Islam_branches_and_schools|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Islamic_law_system_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Islamic_law_system|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Islamic_law_system|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Jordan_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Jordan|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Jordan|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Kuwait_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Kuwait|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Kuwait|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Lebanon_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Lebanon|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Lebanon|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Libya_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Libya|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Libya|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Mauritania_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Mauritania|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Mauritania|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Mesopotamia_civilization_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Mesopotamia_civilization|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Mesopotamia_civilization|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Morocco_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Morocco|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Morocco|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Oman_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Oman|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Oman|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Palestine_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Palestine|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Palestine|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Qatar_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Qatar|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Qatar|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Saudi_Arabia_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Saudi_Arabia|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Saudi_Arabia|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Somalia_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Somalia|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Somalia|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Sudan_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Sudan|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Sudan|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Syria_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Syria|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Syria|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Tunisia_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Tunisia|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Tunisia|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_United_Arab_Emirates_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:United_Arab_Emirates|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:United_Arab_Emirates|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_Yemen_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:Yemen|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:Yemen|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_communication_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:communication|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:communication|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_computer_and_phone_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:computer_and_phone|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:computer_and_phone|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_daily_life_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:daily_life|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:daily_life|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_acva_entertainment_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|acva:entertainment|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|acva:entertainment|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_alghafa_mcq_exams_test_ar_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|alghafa:mcq_exams_test_ar|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|alghafa:mcq_exams_test_ar|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_alghafa_meta_ar_dialects_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|alghafa:meta_ar_dialects|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|alghafa:meta_ar_dialects|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_alghafa_meta_ar_msa_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|alghafa:meta_ar_msa|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|alghafa:meta_ar_msa|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_alghafa_multiple_choice_facts_truefalse_balanced_task_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|alghafa:multiple_choice_facts_truefalse_balanced_task|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|alghafa:multiple_choice_facts_truefalse_balanced_task|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_alghafa_multiple_choice_grounded_statement_soqal_task_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|alghafa:multiple_choice_grounded_statement_soqal_task|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|alghafa:multiple_choice_grounded_statement_soqal_task|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_alghafa_multiple_choice_grounded_statement_xglue_mlqa_task_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_alghafa_multiple_choice_rating_sentiment_no_neutral_task_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_alghafa_multiple_choice_rating_sentiment_task_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|alghafa:multiple_choice_rating_sentiment_task|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|alghafa:multiple_choice_rating_sentiment_task|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_alghafa_multiple_choice_sentiment_task_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|alghafa:multiple_choice_sentiment_task|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|alghafa:multiple_choice_sentiment_task|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_exams_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_exams|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_exams|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_abstract_algebra_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:abstract_algebra|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:abstract_algebra|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_anatomy_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:anatomy|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:anatomy|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_astronomy_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:astronomy|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:astronomy|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_business_ethics_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:business_ethics|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:business_ethics|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_clinical_knowledge_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:clinical_knowledge|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:clinical_knowledge|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_college_biology_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:college_biology|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:college_biology|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_college_chemistry_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:college_chemistry|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:college_chemistry|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_college_computer_science_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:college_computer_science|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:college_computer_science|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_college_mathematics_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:college_mathematics|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:college_mathematics|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_college_medicine_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:college_medicine|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:college_medicine|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_college_physics_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:college_physics|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:college_physics|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_computer_security_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:computer_security|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:computer_security|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_conceptual_physics_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:conceptual_physics|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:conceptual_physics|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_econometrics_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:econometrics|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:econometrics|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_electrical_engineering_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:electrical_engineering|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:electrical_engineering|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_elementary_mathematics_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:elementary_mathematics|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:elementary_mathematics|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_formal_logic_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:formal_logic|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:formal_logic|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_global_facts_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:global_facts|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:global_facts|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_high_school_biology_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:high_school_biology|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_biology|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_high_school_chemistry_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:high_school_chemistry|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_chemistry|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_high_school_computer_science_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:high_school_computer_science|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_computer_science|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_high_school_european_history_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:high_school_european_history|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_european_history|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_high_school_geography_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:high_school_geography|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_geography|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_high_school_government_and_politics_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:high_school_government_and_politics|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_government_and_politics|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_high_school_macroeconomics_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:high_school_macroeconomics|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_macroeconomics|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_high_school_mathematics_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:high_school_mathematics|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_mathematics|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_high_school_microeconomics_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:high_school_microeconomics|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_microeconomics|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_high_school_physics_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:high_school_physics|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_physics|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_high_school_psychology_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:high_school_psychology|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_psychology|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_high_school_statistics_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:high_school_statistics|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_statistics|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_high_school_us_history_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:high_school_us_history|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_us_history|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_high_school_world_history_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:high_school_world_history|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_world_history|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_human_aging_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:human_aging|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:human_aging|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_human_sexuality_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:human_sexuality|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:human_sexuality|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_international_law_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:international_law|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:international_law|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_jurisprudence_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:jurisprudence|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:jurisprudence|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_logical_fallacies_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:logical_fallacies|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:logical_fallacies|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_machine_learning_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:machine_learning|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:machine_learning|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_management_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:management|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:management|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_marketing_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:marketing|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:marketing|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_medical_genetics_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:medical_genetics|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:medical_genetics|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_miscellaneous_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:miscellaneous|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:miscellaneous|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_moral_disputes_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:moral_disputes|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:moral_disputes|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_moral_scenarios_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:moral_scenarios|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:moral_scenarios|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_nutrition_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:nutrition|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:nutrition|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_philosophy_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:philosophy|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:philosophy|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_prehistory_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:prehistory|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:prehistory|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_professional_accounting_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:professional_accounting|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:professional_accounting|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_professional_law_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:professional_law|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:professional_law|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_professional_medicine_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:professional_medicine|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:professional_medicine|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_professional_psychology_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:professional_psychology|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:professional_psychology|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_public_relations_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:public_relations|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:public_relations|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_security_studies_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:security_studies|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:security_studies|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_sociology_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:sociology|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:sociology|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_us_foreign_policy_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:us_foreign_policy|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:us_foreign_policy|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_virology_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:virology|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:virology|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arabic_mmlu_world_religions_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arabic_mmlu:world_religions|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:world_religions|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arc_challenge_okapi_ar_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arc_challenge_okapi_ar|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arc_challenge_okapi_ar|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_arc_easy_ar_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|arc_easy_ar|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|arc_easy_ar|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_boolq_ar_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|boolq_ar|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|boolq_ar|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_copa_ext_ar_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|copa_ext_ar|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|copa_ext_ar|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_hellaswag_okapi_ar_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|hellaswag_okapi_ar|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|hellaswag_okapi_ar|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_openbook_qa_ext_ar_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|openbook_qa_ext_ar|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|openbook_qa_ext_ar|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_piqa_ar_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|piqa_ar|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|piqa_ar|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_race_ar_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|race_ar|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|race_ar|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_sciq_ar_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|sciq_ar|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|sciq_ar|0_2024-05-26T06-39-48.188561.parquet'
- config_name: community_toxigen_ar_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_community|toxigen_ar|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_community|toxigen_ar|0_2024-05-26T06-39-48.188561.parquet'
- config_name: lighteval_xstory_cloze_ar_0
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- '**/details_lighteval|xstory_cloze:ar|0_2024-05-26T06-39-48.188561.parquet'
- split: latest
path:
- '**/details_lighteval|xstory_cloze:ar|0_2024-05-26T06-39-48.188561.parquet'
- config_name: results
data_files:
- split: 2024_05_26T06_39_48.188561
path:
- results_2024-05-26T06-39-48.188561.parquet
- split: latest
path:
- results_2024-05-26T06-39-48.188561.parquet
---
# Dataset Card for Evaluation run of cognitivecomputations/dolphin-2.9.1-llama-3-70b
<!-- Provide a quick summary of the dataset. -->
Dataset automatically created during the evaluation run of model [cognitivecomputations/dolphin-2.9.1-llama-3-70b](https://huggingface.co/cognitivecomputations/dolphin-2.9.1-llama-3-70b).
The dataset is composed of 136 configuration, each one coresponding to one of the evaluated task.
The dataset has been created from 1 run(s). Each run can be found as a specific split in each configuration, the split being named using the timestamp of the run.The "train" split is always pointing to the latest results.
An additional configuration "results" store all the aggregated results of the run.
To load the details from a run, you can for instance do the following:
```python
from datasets import load_dataset
data = load_dataset("OALL/details_cognitivecomputations__dolphin-2.9.1-llama-3-70b",
"lighteval_xstory_cloze_ar_0",
split="train")
```
## Latest results
These are the [latest results from run 2024-05-26T06:39:48.188561](https://huggingface.co/datasets/OALL/details_cognitivecomputations__dolphin-2.9.1-llama-3-70b/blob/main/results_2024-05-26T06-39-48.188561.json)(note that their might be results for other tasks in the repos if successive evals didn't cover the same tasks. You find each in the results and the "latest" split for each eval):
```python
{
"all": {
"acc_norm": 0.5103080320175644,
"acc_norm_stderr": 0.037739287164696544,
"acc": 0.698874917273329,
"acc_stderr": 0.011805509076527741
},
"community|acva:Algeria|0": {
"acc_norm": 0.5282051282051282,
"acc_norm_stderr": 0.035840746749208334
},
"community|acva:Ancient_Egypt|0": {
"acc_norm": 0.08253968253968254,
"acc_norm_stderr": 0.015529598137078241
},
"community|acva:Arab_Empire|0": {
"acc_norm": 0.3132075471698113,
"acc_norm_stderr": 0.02854479331905533
},
"community|acva:Arabic_Architecture|0": {
"acc_norm": 0.5128205128205128,
"acc_norm_stderr": 0.03588610523192216
},
"community|acva:Arabic_Art|0": {
"acc_norm": 0.3487179487179487,
"acc_norm_stderr": 0.034215338466705415
},
"community|acva:Arabic_Astronomy|0": {
"acc_norm": 0.48717948717948717,
"acc_norm_stderr": 0.03588610523192216
},
"community|acva:Arabic_Calligraphy|0": {
"acc_norm": 0.6627450980392157,
"acc_norm_stderr": 0.029664397990297985
},
"community|acva:Arabic_Ceremony|0": {
"acc_norm": 0.572972972972973,
"acc_norm_stderr": 0.03646580777990099
},
"community|acva:Arabic_Clothing|0": {
"acc_norm": 0.5128205128205128,
"acc_norm_stderr": 0.03588610523192215
},
"community|acva:Arabic_Culture|0": {
"acc_norm": 0.3333333333333333,
"acc_norm_stderr": 0.03384487217112063
},
"community|acva:Arabic_Food|0": {
"acc_norm": 0.5333333333333333,
"acc_norm_stderr": 0.03581804596782233
},
"community|acva:Arabic_Funeral|0": {
"acc_norm": 0.4,
"acc_norm_stderr": 0.050529115263991134
},
"community|acva:Arabic_Geography|0": {
"acc_norm": 0.6344827586206897,
"acc_norm_stderr": 0.04013124195424385
},
"community|acva:Arabic_History|0": {
"acc_norm": 0.41025641025641024,
"acc_norm_stderr": 0.035314937123266714
},
"community|acva:Arabic_Language_Origin|0": {
"acc_norm": 0.6526315789473685,
"acc_norm_stderr": 0.049109474007766586
},
"community|acva:Arabic_Literature|0": {
"acc_norm": 0.4689655172413793,
"acc_norm_stderr": 0.04158632762097828
},
"community|acva:Arabic_Math|0": {
"acc_norm": 0.3230769230769231,
"acc_norm_stderr": 0.033575443964031323
},
"community|acva:Arabic_Medicine|0": {
"acc_norm": 0.503448275862069,
"acc_norm_stderr": 0.04166567577101579
},
"community|acva:Arabic_Music|0": {
"acc_norm": 0.302158273381295,
"acc_norm_stderr": 0.03908914479291562
},
"community|acva:Arabic_Ornament|0": {
"acc_norm": 0.5846153846153846,
"acc_norm_stderr": 0.03538013280575031
},
"community|acva:Arabic_Philosophy|0": {
"acc_norm": 0.5862068965517241,
"acc_norm_stderr": 0.04104269211806232
},
"community|acva:Arabic_Physics_and_Chemistry|0": {
"acc_norm": 0.6410256410256411,
"acc_norm_stderr": 0.03444042881521377
},
"community|acva:Arabic_Wedding|0": {
"acc_norm": 0.4153846153846154,
"acc_norm_stderr": 0.03538013280575029
},
"community|acva:Bahrain|0": {
"acc_norm": 0.3333333333333333,
"acc_norm_stderr": 0.07106690545187012
},
"community|acva:Comoros|0": {
"acc_norm": 0.37777777777777777,
"acc_norm_stderr": 0.07309112127323451
},
"community|acva:Egypt_modern|0": {
"acc_norm": 0.3157894736842105,
"acc_norm_stderr": 0.04794350420740798
},
"community|acva:InfluenceFromAncientEgypt|0": {
"acc_norm": 0.6974358974358974,
"acc_norm_stderr": 0.032980708700856204
},
"community|acva:InfluenceFromByzantium|0": {
"acc_norm": 0.7586206896551724,
"acc_norm_stderr": 0.03565998174135303
},
"community|acva:InfluenceFromChina|0": {
"acc_norm": 0.2923076923076923,
"acc_norm_stderr": 0.03265438393749511
},
"community|acva:InfluenceFromGreece|0": {
"acc_norm": 0.6564102564102564,
"acc_norm_stderr": 0.03409627301409855
},
"community|acva:InfluenceFromIslam|0": {
"acc_norm": 0.31724137931034485,
"acc_norm_stderr": 0.03878352372138621
},
"community|acva:InfluenceFromPersia|0": {
"acc_norm": 0.7314285714285714,
"acc_norm_stderr": 0.033600151915923894
},
"community|acva:InfluenceFromRome|0": {
"acc_norm": 0.5897435897435898,
"acc_norm_stderr": 0.0353149371232667
},
"community|acva:Iraq|0": {
"acc_norm": 0.5294117647058824,
"acc_norm_stderr": 0.054460005868973586
},
"community|acva:Islam_Education|0": {
"acc_norm": 0.4564102564102564,
"acc_norm_stderr": 0.03576123096991215
},
"community|acva:Islam_branches_and_schools|0": {
"acc_norm": 0.4342857142857143,
"acc_norm_stderr": 0.037576101528126626
},
"community|acva:Islamic_law_system|0": {
"acc_norm": 0.4358974358974359,
"acc_norm_stderr": 0.035601666623466345
},
"community|acva:Jordan|0": {
"acc_norm": 0.3333333333333333,
"acc_norm_stderr": 0.07106690545187012
},
"community|acva:Kuwait|0": {
"acc_norm": 0.3111111111111111,
"acc_norm_stderr": 0.06979205927323111
},
"community|acva:Lebanon|0": {
"acc_norm": 0.17777777777777778,
"acc_norm_stderr": 0.05763774795025094
},
"community|acva:Libya|0": {
"acc_norm": 0.4444444444444444,
"acc_norm_stderr": 0.07491109582924914
},
"community|acva:Mauritania|0": {
"acc_norm": 0.4222222222222222,
"acc_norm_stderr": 0.07446027270295805
},
"community|acva:Mesopotamia_civilization|0": {
"acc_norm": 0.5225806451612903,
"acc_norm_stderr": 0.0402500394824441
},
"community|acva:Morocco|0": {
"acc_norm": 0.26666666666666666,
"acc_norm_stderr": 0.06666666666666664
},
"community|acva:Oman|0": {
"acc_norm": 0.26666666666666666,
"acc_norm_stderr": 0.06666666666666664
},
"community|acva:Palestine|0": {
"acc_norm": 0.27058823529411763,
"acc_norm_stderr": 0.04847314453023652
},
"community|acva:Qatar|0": {
"acc_norm": 0.5333333333333333,
"acc_norm_stderr": 0.0752101433090355
},
"community|acva:Saudi_Arabia|0": {
"acc_norm": 0.358974358974359,
"acc_norm_stderr": 0.03444042881521375
},
"community|acva:Somalia|0": {
"acc_norm": 0.35555555555555557,
"acc_norm_stderr": 0.07216392363431012
},
"community|acva:Sudan|0": {
"acc_norm": 0.4,
"acc_norm_stderr": 0.07385489458759965
},
"community|acva:Syria|0": {
"acc_norm": 0.35555555555555557,
"acc_norm_stderr": 0.07216392363431012
},
"community|acva:Tunisia|0": {
"acc_norm": 0.3111111111111111,
"acc_norm_stderr": 0.06979205927323111
},
"community|acva:United_Arab_Emirates|0": {
"acc_norm": 0.25882352941176473,
"acc_norm_stderr": 0.04778846120374094
},
"community|acva:Yemen|0": {
"acc_norm": 0.2,
"acc_norm_stderr": 0.13333333333333333
},
"community|acva:communication|0": {
"acc_norm": 0.43131868131868134,
"acc_norm_stderr": 0.02599443023962308
},
"community|acva:computer_and_phone|0": {
"acc_norm": 0.4711864406779661,
"acc_norm_stderr": 0.029112132426516474
},
"community|acva:daily_life|0": {
"acc_norm": 0.2551928783382789,
"acc_norm_stderr": 0.023784090394712923
},
"community|acva:entertainment|0": {
"acc_norm": 0.2440677966101695,
"acc_norm_stderr": 0.025050880690319712
},
"community|alghafa:mcq_exams_test_ar|0": {
"acc_norm": 0.414721723518851,
"acc_norm_stderr": 0.02089402928209432
},
"community|alghafa:meta_ar_dialects|0": {
"acc_norm": 0.44151992585727523,
"acc_norm_stderr": 0.006761195976200771
},
"community|alghafa:meta_ar_msa|0": {
"acc_norm": 0.5083798882681564,
"acc_norm_stderr": 0.016720152794672486
},
"community|alghafa:multiple_choice_facts_truefalse_balanced_task|0": {
"acc_norm": 0.5333333333333333,
"acc_norm_stderr": 0.05799451149344531
},
"community|alghafa:multiple_choice_grounded_statement_soqal_task|0": {
"acc_norm": 0.66,
"acc_norm_stderr": 0.03880773464731456
},
"community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0": {
"acc_norm": 0.52,
"acc_norm_stderr": 0.04092881363092387
},
"community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0": {
"acc_norm": 0.8021263289555972,
"acc_norm_stderr": 0.004455878286876554
},
"community|alghafa:multiple_choice_rating_sentiment_task|0": {
"acc_norm": 0.5803169307756464,
"acc_norm_stderr": 0.006374336352573141
},
"community|alghafa:multiple_choice_sentiment_task|0": {
"acc_norm": 0.37383720930232556,
"acc_norm_stderr": 0.011669357711165854
},
"community|arabic_exams|0": {
"acc_norm": 0.5437616387337058,
"acc_norm_stderr": 0.021513832714984076
},
"community|arabic_mmlu:abstract_algebra|0": {
"acc_norm": 0.41,
"acc_norm_stderr": 0.049431107042371025
},
"community|arabic_mmlu:anatomy|0": {
"acc_norm": 0.5555555555555556,
"acc_norm_stderr": 0.04292596718256981
},
"community|arabic_mmlu:astronomy|0": {
"acc_norm": 0.7302631578947368,
"acc_norm_stderr": 0.03611780560284898
},
"community|arabic_mmlu:business_ethics|0": {
"acc_norm": 0.67,
"acc_norm_stderr": 0.04725815626252609
},
"community|arabic_mmlu:clinical_knowledge|0": {
"acc_norm": 0.6716981132075471,
"acc_norm_stderr": 0.02890159361241178
},
"community|arabic_mmlu:college_biology|0": {
"acc_norm": 0.6805555555555556,
"acc_norm_stderr": 0.038990736873573344
},
"community|arabic_mmlu:college_chemistry|0": {
"acc_norm": 0.48,
"acc_norm_stderr": 0.050211673156867795
},
"community|arabic_mmlu:college_computer_science|0": {
"acc_norm": 0.46,
"acc_norm_stderr": 0.05009082659620333
},
"community|arabic_mmlu:college_mathematics|0": {
"acc_norm": 0.36,
"acc_norm_stderr": 0.048241815132442176
},
"community|arabic_mmlu:college_medicine|0": {
"acc_norm": 0.5202312138728323,
"acc_norm_stderr": 0.03809342081273957
},
"community|arabic_mmlu:college_physics|0": {
"acc_norm": 0.43137254901960786,
"acc_norm_stderr": 0.04928099597287533
},
"community|arabic_mmlu:computer_security|0": {
"acc_norm": 0.71,
"acc_norm_stderr": 0.045604802157206845
},
"community|arabic_mmlu:conceptual_physics|0": {
"acc_norm": 0.6468085106382979,
"acc_norm_stderr": 0.031245325202761926
},
"community|arabic_mmlu:econometrics|0": {
"acc_norm": 0.4473684210526316,
"acc_norm_stderr": 0.04677473004491199
},
"community|arabic_mmlu:electrical_engineering|0": {
"acc_norm": 0.5310344827586206,
"acc_norm_stderr": 0.04158632762097828
},
"community|arabic_mmlu:elementary_mathematics|0": {
"acc_norm": 0.5105820105820106,
"acc_norm_stderr": 0.025745542276045478
},
"community|arabic_mmlu:formal_logic|0": {
"acc_norm": 0.49206349206349204,
"acc_norm_stderr": 0.044715725362943486
},
"community|arabic_mmlu:global_facts|0": {
"acc_norm": 0.47,
"acc_norm_stderr": 0.050161355804659205
},
"community|arabic_mmlu:high_school_biology|0": {
"acc_norm": 0.6258064516129033,
"acc_norm_stderr": 0.027528904299845693
},
"community|arabic_mmlu:high_school_chemistry|0": {
"acc_norm": 0.5320197044334976,
"acc_norm_stderr": 0.035107665979592154
},
"community|arabic_mmlu:high_school_computer_science|0": {
"acc_norm": 0.68,
"acc_norm_stderr": 0.04688261722621504
},
"community|arabic_mmlu:high_school_european_history|0": {
"acc_norm": 0.26666666666666666,
"acc_norm_stderr": 0.03453131801885415
},
"community|arabic_mmlu:high_school_geography|0": {
"acc_norm": 0.7373737373737373,
"acc_norm_stderr": 0.03135305009533087
},
"community|arabic_mmlu:high_school_government_and_politics|0": {
"acc_norm": 0.7461139896373057,
"acc_norm_stderr": 0.03141024780565318
},
"community|arabic_mmlu:high_school_macroeconomics|0": {
"acc_norm": 0.6512820512820513,
"acc_norm_stderr": 0.024162780284017717
},
"community|arabic_mmlu:high_school_mathematics|0": {
"acc_norm": 0.3814814814814815,
"acc_norm_stderr": 0.02961671892749759
},
"community|arabic_mmlu:high_school_microeconomics|0": {
"acc_norm": 0.6092436974789915,
"acc_norm_stderr": 0.031693802357129965
},
"community|arabic_mmlu:high_school_physics|0": {
"acc_norm": 0.39072847682119205,
"acc_norm_stderr": 0.03983798306659807
},
"community|arabic_mmlu:high_school_psychology|0": {
"acc_norm": 0.6697247706422018,
"acc_norm_stderr": 0.02016446633634298
},
"community|arabic_mmlu:high_school_statistics|0": {
"acc_norm": 0.4305555555555556,
"acc_norm_stderr": 0.03376922151252335
},
"community|arabic_mmlu:high_school_us_history|0": {
"acc_norm": 0.35294117647058826,
"acc_norm_stderr": 0.03354092437591519
},
"community|arabic_mmlu:high_school_world_history|0": {
"acc_norm": 0.379746835443038,
"acc_norm_stderr": 0.031591887529658504
},
"community|arabic_mmlu:human_aging|0": {
"acc_norm": 0.6367713004484304,
"acc_norm_stderr": 0.03227790442850499
},
"community|arabic_mmlu:human_sexuality|0": {
"acc_norm": 0.6183206106870229,
"acc_norm_stderr": 0.042607351576445594
},
"community|arabic_mmlu:international_law|0": {
"acc_norm": 0.8181818181818182,
"acc_norm_stderr": 0.03520893951097653
},
"community|arabic_mmlu:jurisprudence|0": {
"acc_norm": 0.6481481481481481,
"acc_norm_stderr": 0.04616631111801713
},
"community|arabic_mmlu:logical_fallacies|0": {
"acc_norm": 0.5705521472392638,
"acc_norm_stderr": 0.03889066619112722
},
"community|arabic_mmlu:machine_learning|0": {
"acc_norm": 0.48214285714285715,
"acc_norm_stderr": 0.047427623612430116
},
"community|arabic_mmlu:management|0": {
"acc_norm": 0.7281553398058253,
"acc_norm_stderr": 0.044052680241409216
},
"community|arabic_mmlu:marketing|0": {
"acc_norm": 0.811965811965812,
"acc_norm_stderr": 0.02559819368665227
},
"community|arabic_mmlu:medical_genetics|0": {
"acc_norm": 0.7,
"acc_norm_stderr": 0.046056618647183814
},
"community|arabic_mmlu:miscellaneous|0": {
"acc_norm": 0.7330779054916986,
"acc_norm_stderr": 0.015818450894777552
},
"community|arabic_mmlu:moral_disputes|0": {
"acc_norm": 0.6069364161849711,
"acc_norm_stderr": 0.02629622791561368
},
"community|arabic_mmlu:moral_scenarios|0": {
"acc_norm": 0.3743016759776536,
"acc_norm_stderr": 0.016185444179457168
},
"community|arabic_mmlu:nutrition|0": {
"acc_norm": 0.7287581699346405,
"acc_norm_stderr": 0.02545775669666788
},
"community|arabic_mmlu:philosophy|0": {
"acc_norm": 0.6463022508038585,
"acc_norm_stderr": 0.027155208103200865
},
"community|arabic_mmlu:prehistory|0": {
"acc_norm": 0.6265432098765432,
"acc_norm_stderr": 0.02691500301138016
},
"community|arabic_mmlu:professional_accounting|0": {
"acc_norm": 0.41134751773049644,
"acc_norm_stderr": 0.029354911159940985
},
"community|arabic_mmlu:professional_law|0": {
"acc_norm": 0.39504563233376794,
"acc_norm_stderr": 0.012485727813251562
},
"community|arabic_mmlu:professional_medicine|0": {
"acc_norm": 0.2977941176470588,
"acc_norm_stderr": 0.027778298701545443
},
"community|arabic_mmlu:professional_psychology|0": {
"acc_norm": 0.6094771241830066,
"acc_norm_stderr": 0.019737008998094593
},
"community|arabic_mmlu:public_relations|0": {
"acc_norm": 0.6272727272727273,
"acc_norm_stderr": 0.04631381319425465
},
"community|arabic_mmlu:security_studies|0": {
"acc_norm": 0.6571428571428571,
"acc_norm_stderr": 0.030387262919547724
},
"community|arabic_mmlu:sociology|0": {
"acc_norm": 0.7611940298507462,
"acc_norm_stderr": 0.03014777593540922
},
"community|arabic_mmlu:us_foreign_policy|0": {
"acc_norm": 0.82,
"acc_norm_stderr": 0.038612291966536934
},
"community|arabic_mmlu:virology|0": {
"acc_norm": 0.46987951807228917,
"acc_norm_stderr": 0.03885425420866767
},
"community|arabic_mmlu:world_religions|0": {
"acc_norm": 0.695906432748538,
"acc_norm_stderr": 0.03528211258245231
},
"community|arc_challenge_okapi_ar|0": {
"acc_norm": 0.5258620689655172,
"acc_norm_stderr": 0.01466717774523103
},
"community|arc_easy_ar|0": {
"acc_norm": 0.5583756345177665,
"acc_norm_stderr": 0.010215458925359593
},
"community|boolq_ar|0": {
"acc_norm": 0.7917177914110429,
"acc_norm_stderr": 0.007113266977878542
},
"community|copa_ext_ar|0": {
"acc_norm": 0.6,
"acc_norm_stderr": 0.05192907868894985
},
"community|hellaswag_okapi_ar|0": {
"acc_norm": 0.4286337367789772,
"acc_norm_stderr": 0.005167920261920976
},
"community|openbook_qa_ext_ar|0": {
"acc_norm": 0.5252525252525253,
"acc_norm_stderr": 0.02246735418300409
},
"community|piqa_ar|0": {
"acc_norm": 0.6786688488816148,
"acc_norm_stderr": 0.010910449361123257
},
"community|race_ar|0": {
"acc_norm": 0.5615743558531142,
"acc_norm_stderr": 0.007068320907453806
},
"community|sciq_ar|0": {
"acc_norm": 0.5889447236180905,
"acc_norm_stderr": 0.015606092943535746
},
"community|toxigen_ar|0": {
"acc_norm": 0.6213903743315508,
"acc_norm_stderr": 0.01587101303083359
},
"lighteval|xstory_cloze:ar|0": {
"acc": 0.698874917273329,
"acc_stderr": 0.011805509076527741
},
"community|acva:_average|0": {
"acc_norm": 0.42977124151803386,
"acc_norm_stderr": 0.046310869991255735
},
"community|alghafa:_average|0": {
"acc_norm": 0.5371372600012427,
"acc_norm_stderr": 0.022734001130585206
},
"community|arabic_mmlu:_average|0": {
"acc_norm": 0.5737971101047391,
"acc_norm_stderr": 0.03546732161097641
}
}
```
## Dataset Details
### Dataset Description
<!-- Provide a longer summary of what this dataset is. -->
- **Curated by:** [More Information Needed]
- **Funded by [optional]:** [More Information Needed]
- **Shared by [optional]:** [More Information Needed]
- **Language(s) (NLP):** [More Information Needed]
- **License:** [More Information Needed]
### Dataset Sources [optional]
<!-- Provide the basic links for the dataset. -->
- **Repository:** [More Information Needed]
- **Paper [optional]:** [More Information Needed]
- **Demo [optional]:** [More Information Needed]
## Uses
<!-- Address questions around how the dataset is intended to be used. -->
### Direct Use
<!-- This section describes suitable use cases for the dataset. -->
[More Information Needed]
### Out-of-Scope Use
<!-- This section addresses misuse, malicious use, and uses that the dataset will not work well for. -->
[More Information Needed]
## Dataset Structure
<!-- This section provides a description of the dataset fields, and additional information about the dataset structure such as criteria used to create the splits, relationships between data points, etc. -->
[More Information Needed]
## Dataset Creation
### Curation Rationale
<!-- Motivation for the creation of this dataset. -->
[More Information Needed]
### Source Data
<!-- This section describes the source data (e.g. news text and headlines, social media posts, translated sentences, ...). -->
#### Data Collection and Processing
<!-- This section describes the data collection and processing process such as data selection criteria, filtering and normalization methods, tools and libraries used, etc. -->
[More Information Needed]
#### Who are the source data producers?
<!-- This section describes the people or systems who originally created the data. It should also include self-reported demographic or identity information for the source data creators if this information is available. -->
[More Information Needed]
### Annotations [optional]
<!-- If the dataset contains annotations which are not part of the initial data collection, use this section to describe them. -->
#### Annotation process
<!-- This section describes the annotation process such as annotation tools used in the process, the amount of data annotated, annotation guidelines provided to the annotators, interannotator statistics, annotation validation, etc. -->
[More Information Needed]
#### Who are the annotators?
<!-- This section describes the people or systems who created the annotations. -->
[More Information Needed]
#### Personal and Sensitive Information
<!-- State whether the dataset contains data that might be considered personal, sensitive, or private (e.g., data that reveals addresses, uniquely identifiable names or aliases, racial or ethnic origins, sexual orientations, religious beliefs, political opinions, financial or health data, etc.). If efforts were made to anonymize the data, describe the anonymization process. -->
[More Information Needed]
## Bias, Risks, and Limitations
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
[More Information Needed]
### Recommendations
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
Users should be made aware of the risks, biases and limitations of the dataset. More information needed for further recommendations.
## Citation [optional]
<!-- If there is a paper or blog post introducing the dataset, the APA and Bibtex information for that should go in this section. -->
**BibTeX:**
[More Information Needed]
**APA:**
[More Information Needed]
## Glossary [optional]
<!-- If relevant, include terms and calculations in this section that can help readers understand the dataset or dataset card. -->
[More Information Needed]
## More Information [optional]
[More Information Needed]
## Dataset Card Authors [optional]
[More Information Needed]
## Dataset Card Contact
[More Information Needed]
提供机构:
OALL
原始信息汇总
数据集概述
数据集名称
- pretty_name: Evaluation run of cognitivecomputations/dolphin-2.9.1-llama-3-70b
数据集创建
- 创建背景: 自动创建于模型cognitivecomputations/dolphin-2.9.1-llama-3-70b的评估运行期间。
- 数据组成: 包含136个配置,每个配置对应一个评估任务。
- 创建次数: 数据集由1次运行创建,每次运行作为一个特定的分割,分割名使用运行时间戳命名。
- 特殊配置: 额外配置“results”存储了所有运行的聚合结果。
数据集加载示例
python from datasets import load_dataset data = load_dataset("OALL/details_cognitivecomputations__dolphin-2.9.1-llama-3-70b", "lighteval_xstory_cloze_ar_0", split="train")
最新结果
- 结果链接: latest results from run 2024-05-26T06:39:48.188561
- 结果内容: 包含多个社区和任务的归一化准确率及标准误差。
搜集汇总
数据集介绍

构建方式
在自然语言处理与模型评估的交叉领域中,该数据集是基于对认知计算团队开发的dolphin-2.9.1-llama-3-70b模型进行自动化评测而生成的。数据集由136个配置组成,每个配置对应一个被评估的具体任务,全面覆盖了模型在不同维度上的表现。整个数据集源自一次完整的运行流程,每次运行的结果均以独立分割的形式存储,分割名称采用运行时间戳进行标识,而‘train’分割则始终指向最新的评测结果。此外,数据集还包含一个名为‘results’的额外配置,用于汇总所有运行的综合指标,从而为研究者提供全局视角。
特点
该数据集的核心特色在于其精细化的任务划分与动态更新机制。136个配置涵盖了从阿拉伯语文化知识到多学科选择题的广泛任务,例如阿拉伯语方言识别、情感分析、以及各类专业知识测试,每个任务都提供了标准化准确率(acc_norm)及其标准误差,确保了评测结果的可比性与统计可靠性。数据集的‘train’分割自动指向最新运行结果,而历史运行则通过时间戳保留,这种设计既支持了模型性能的持续追踪,又便于进行跨时间维度的对比分析,充分满足了迭代式模型评估的需求。
使用方法
使用该数据集时,研究者可通过HuggingFace的datasets库便捷加载。例如,通过load_dataset函数指定数据集名称与对应任务配置(如‘lighteval_xstory_cloze_ar_0’),并选择‘train’分割即可获取最新评测详情。若需访问特定历史运行的结果,则可在配置中选用相应时间戳命名的分割。此外,通过加载‘results’配置,用户能够直接获取所有任务的聚合指标,从而快速评估模型的整体表现。这种灵活的加载方式使得数据集既适用于细粒度的任务分析,也适合宏观的性能概览。
背景与挑战
背景概述
该数据集源自2024年对认知计算团队发布的dolphin-2.9.1-llama-3-70b模型的一次系统性评估运行。作为大语言模型评测领域的重要实践,该数据集由136个独立配置构成,每个配置对应一项被评测的具体任务,旨在多维度衡量模型在阿拉伯语相关知识与推理能力上的表现。数据集创建于2024年5月26日,由OALL机构主导构建,其核心研究问题聚焦于评估模型在涵盖阿拉伯文化、历史、科学、法律、医学等众多领域的综合理解水平。该数据集通过标准化评测流程,为后续大语言模型在低资源语言场景下的性能对比与改进提供了宝贵基准,对推动多语言AI系统的公平性与鲁棒性研究具有显著影响力。
当前挑战
该数据集所解决的核心领域问题在于大语言模型在阿拉伯语这一低资源语言上的多任务评估挑战,具体包括:1)模型在阿拉伯文化特有概念(如阿拉伯书法、伊斯兰法系)上的知识缺口,导致准确性波动剧烈;2)跨领域任务(如医学、数学、历史)的评测难度不均衡,部分任务准确率低至8.25%,凸显模型知识覆盖的碎片化。构建过程中面临的挑战包括:1)136个评测配置的标准化设计,需确保各任务指标的可比性与可复现性;2)评测运行的时间戳管理与结果聚合,要求精确追踪每次运行的最新状态,避免数据版本混乱。
常用场景
经典使用场景
在自然语言处理与多语言大模型评估的交叉领域中,OALL/details_cognitivecomputations__dolphin-2.9.1-llama-3-70b数据集作为模型性能的精细化度量工具而备受关注。其经典使用场景在于对特定大型语言模型——即cognitivecomputations/dolphin-2.9.1-llama-3-70b——进行系统性的多任务评估。该数据集囊括了136个独立配置,每个配置对应一项被评估的任务,覆盖了从阿拉伯文化知识(如acva系列)到阿拉伯语考试(如arabic_exams)等广泛领域。研究者通过加载特定任务配置下的最新评估分割,能够精准获取模型在诸如故事完形填空、方言识别、情感分类等任务上的准确率与标准误,从而对模型的综合语言理解与文化适应能力进行量化刻画。
衍生相关工作
该数据集衍生了一系列具有影响力的相关工作,推动了阿拉伯语NLP评估体系的系统化发展。基于其多任务评估框架,研究者开发了诸如Arabic MMLU(大规模多任务语言理解基准)等扩展评测集,将学科知识从57个领域进一步细化,并引入了标准化考试题型。同时,受其acva子集启发,涌现了一批聚焦阿拉伯文化常识推理的专用数据集与评测任务,如针对伊斯兰法律、历史与哲学的知识图谱构建工作。此外,数据集的高精度评估结果被用于指导模型微调策略的优化,例如通过分析不同方言任务的准确率差异,催生了面向阿拉伯语方言的领域自适应训练方法。这些衍生工作不仅丰富了低资源语言评估的理论工具箱,也为多语言大模型在文化敏感场景下的公平性与鲁棒性研究奠定了基石。
数据集最近研究
最新研究方向
当前,大语言模型在多语言、多文化背景下的评估与对齐成为前沿热点。该数据集围绕dolphin-2.9.1-llama-3-70b模型在阿拉伯语及相关文化领域的性能展开系统性评测,覆盖了从阿拉伯历史、医学、哲学到现代通信、计算机等136项任务。值得注意的是,数据集不仅包含标准阿拉伯语(MSA)任务,还深入探讨了阿拉伯方言、情感分析、事实一致性等挑战性场景,反映了研究界对低资源语言和文化特异性知识理解的重视。这一方向与近期全球AI治理中强调的“文化包容性”和“多语言公平性”紧密相连,为构建更普惠、更少偏见的大模型提供了关键的量化基准。其意义在于,通过细粒度的错误分析,推动模型在非英语语境下的鲁棒性提升,从而促进AI技术在阿拉伯世界教育、医疗等领域的负责任部署。
以上内容由遇见数据集搜集并总结生成



