OALL/details_microsoft__Phi-3-medium-4k-instruct
收藏Hugging Face2024-10-11 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/OALL/details_microsoft__Phi-3-medium-4k-instruct
下载链接
链接失效反馈官方服务:
资源简介:
---
pretty_name: Evaluation run of microsoft/Phi-3-medium-4k-instruct
dataset_summary: "Dataset automatically created during the evaluation run of model\
\ [microsoft/Phi-3-medium-4k-instruct](https://huggingface.co/microsoft/Phi-3-medium-4k-instruct).\n\
\nThe dataset is composed of 136 configuration, each one coresponding to one of\
\ the evaluated task.\n\nThe dataset has been created from 1 run(s). Each run can\
\ be found as a specific split in each configuration, the split being named using\
\ the timestamp of the run.The \"train\" split is always pointing to the latest\
\ results.\n\nAn additional configuration \"results\" store all the aggregated results\
\ of the run.\n\nTo load the details from a run, you can for instance do the following:\n\
```python\nfrom datasets import load_dataset\ndata = load_dataset(\"OALL/details_microsoft__Phi-3-medium-4k-instruct\"\
,\n\t\"lighteval_xstory_cloze_ar_0\",\n\tsplit=\"train\")\n```\n\n## Latest results\n\
\nThese are the [latest results from run 2024-10-11T17:01:44.350127](https://huggingface.co/datasets/OALL/details_microsoft__Phi-3-medium-4k-instruct/blob/main/results_2024-10-11T17-01-44.350127.json)(note\
\ that their might be results for other tasks in the repos if successive evals didn't\
\ cover the same tasks. You find each in the results and the \"latest\" split for\
\ each eval):\n\n```python\n{\n \"all\": {\n \"acc_norm\": 0.47111505023267153,\n\
\ \"acc_norm_stderr\": 0.03852465552924525,\n \"acc\": 0.5598941098610192,\n\
\ \"acc_stderr\": 0.012774475160716331\n },\n \"community|acva:Algeria|0\"\
: {\n \"acc_norm\": 0.5846153846153846,\n \"acc_norm_stderr\": 0.03538013280575029\n\
\ },\n \"community|acva:Ancient_Egypt|0\": {\n \"acc_norm\": 0.6,\n\
\ \"acc_norm_stderr\": 0.0276465406550454\n },\n \"community|acva:Arab_Empire|0\"\
: {\n \"acc_norm\": 0.35471698113207545,\n \"acc_norm_stderr\": 0.029445175328199593\n\
\ },\n \"community|acva:Arabic_Architecture|0\": {\n \"acc_norm\":\
\ 0.6564102564102564,\n \"acc_norm_stderr\": 0.03409627301409855\n },\n\
\ \"community|acva:Arabic_Art|0\": {\n \"acc_norm\": 0.36923076923076925,\n\
\ \"acc_norm_stderr\": 0.03464841141863756\n },\n \"community|acva:Arabic_Astronomy|0\"\
: {\n \"acc_norm\": 0.48717948717948717,\n \"acc_norm_stderr\": 0.03588610523192216\n\
\ },\n \"community|acva:Arabic_Calligraphy|0\": {\n \"acc_norm\": 0.796078431372549,\n\
\ \"acc_norm_stderr\": 0.025280907058814615\n },\n \"community|acva:Arabic_Ceremony|0\"\
: {\n \"acc_norm\": 0.6,\n \"acc_norm_stderr\": 0.03611575592573071\n\
\ },\n \"community|acva:Arabic_Clothing|0\": {\n \"acc_norm\": 0.4307692307692308,\n\
\ \"acc_norm_stderr\": 0.03555213252058761\n },\n \"community|acva:Arabic_Culture|0\"\
: {\n \"acc_norm\": 0.6615384615384615,\n \"acc_norm_stderr\": 0.03397280032734095\n\
\ },\n \"community|acva:Arabic_Food|0\": {\n \"acc_norm\": 0.6666666666666666,\n\
\ \"acc_norm_stderr\": 0.033844872171120644\n },\n \"community|acva:Arabic_Funeral|0\"\
: {\n \"acc_norm\": 0.4105263157894737,\n \"acc_norm_stderr\": 0.050738635645512106\n\
\ },\n \"community|acva:Arabic_Geography|0\": {\n \"acc_norm\": 0.5586206896551724,\n\
\ \"acc_norm_stderr\": 0.04137931034482758\n },\n \"community|acva:Arabic_History|0\"\
: {\n \"acc_norm\": 0.35384615384615387,\n \"acc_norm_stderr\": 0.03433004254147036\n\
\ },\n \"community|acva:Arabic_Language_Origin|0\": {\n \"acc_norm\"\
: 0.7052631578947368,\n \"acc_norm_stderr\": 0.047025008739248385\n },\n\
\ \"community|acva:Arabic_Literature|0\": {\n \"acc_norm\": 0.6206896551724138,\n\
\ \"acc_norm_stderr\": 0.040434618619167466\n },\n \"community|acva:Arabic_Math|0\"\
: {\n \"acc_norm\": 0.4307692307692308,\n \"acc_norm_stderr\": 0.0355521325205876\n\
\ },\n \"community|acva:Arabic_Medicine|0\": {\n \"acc_norm\": 0.6137931034482759,\n\
\ \"acc_norm_stderr\": 0.04057324734419035\n },\n \"community|acva:Arabic_Music|0\"\
: {\n \"acc_norm\": 0.2517985611510791,\n \"acc_norm_stderr\": 0.03694846055443904\n\
\ },\n \"community|acva:Arabic_Ornament|0\": {\n \"acc_norm\": 0.5794871794871795,\n\
\ \"acc_norm_stderr\": 0.03544138389303483\n },\n \"community|acva:Arabic_Philosophy|0\"\
: {\n \"acc_norm\": 0.593103448275862,\n \"acc_norm_stderr\": 0.04093793981266236\n\
\ },\n \"community|acva:Arabic_Physics_and_Chemistry|0\": {\n \"acc_norm\"\
: 0.558974358974359,\n \"acc_norm_stderr\": 0.03564732931853579\n },\n\
\ \"community|acva:Arabic_Wedding|0\": {\n \"acc_norm\": 0.676923076923077,\n\
\ \"acc_norm_stderr\": 0.03357544396403132\n },\n \"community|acva:Bahrain|0\"\
: {\n \"acc_norm\": 0.5111111111111111,\n \"acc_norm_stderr\": 0.07535922203472523\n\
\ },\n \"community|acva:Comoros|0\": {\n \"acc_norm\": 0.4888888888888889,\n\
\ \"acc_norm_stderr\": 0.07535922203472523\n },\n \"community|acva:Egypt_modern|0\"\
: {\n \"acc_norm\": 0.5894736842105263,\n \"acc_norm_stderr\": 0.05073863564551208\n\
\ },\n \"community|acva:InfluenceFromAncientEgypt|0\": {\n \"acc_norm\"\
: 0.6666666666666666,\n \"acc_norm_stderr\": 0.033844872171120644\n },\n\
\ \"community|acva:InfluenceFromByzantium|0\": {\n \"acc_norm\": 0.7103448275862069,\n\
\ \"acc_norm_stderr\": 0.03780019230438015\n },\n \"community|acva:InfluenceFromChina|0\"\
: {\n \"acc_norm\": 0.27692307692307694,\n \"acc_norm_stderr\": 0.032127058190759304\n\
\ },\n \"community|acva:InfluenceFromGreece|0\": {\n \"acc_norm\":\
\ 0.7794871794871795,\n \"acc_norm_stderr\": 0.029766004661644124\n },\n\
\ \"community|acva:InfluenceFromIslam|0\": {\n \"acc_norm\": 0.7034482758620689,\n\
\ \"acc_norm_stderr\": 0.03806142687309992\n },\n \"community|acva:InfluenceFromPersia|0\"\
: {\n \"acc_norm\": 0.8228571428571428,\n \"acc_norm_stderr\": 0.028943391569621377\n\
\ },\n \"community|acva:InfluenceFromRome|0\": {\n \"acc_norm\": 0.5692307692307692,\n\
\ \"acc_norm_stderr\": 0.035552132520587594\n },\n \"community|acva:Iraq|0\"\
: {\n \"acc_norm\": 0.6,\n \"acc_norm_stderr\": 0.05345224838248487\n\
\ },\n \"community|acva:Islam_Education|0\": {\n \"acc_norm\": 0.558974358974359,\n\
\ \"acc_norm_stderr\": 0.0356473293185358\n },\n \"community|acva:Islam_branches_and_schools|0\"\
: {\n \"acc_norm\": 0.5714285714285714,\n \"acc_norm_stderr\": 0.037516123674206446\n\
\ },\n \"community|acva:Islamic_law_system|0\": {\n \"acc_norm\": 0.6153846153846154,\n\
\ \"acc_norm_stderr\": 0.03492896993742303\n },\n \"community|acva:Jordan|0\"\
: {\n \"acc_norm\": 0.4222222222222222,\n \"acc_norm_stderr\": 0.07446027270295806\n\
\ },\n \"community|acva:Kuwait|0\": {\n \"acc_norm\": 0.6444444444444445,\n\
\ \"acc_norm_stderr\": 0.07216392363431011\n },\n \"community|acva:Lebanon|0\"\
: {\n \"acc_norm\": 0.5333333333333333,\n \"acc_norm_stderr\": 0.0752101433090355\n\
\ },\n \"community|acva:Libya|0\": {\n \"acc_norm\": 0.5777777777777777,\n\
\ \"acc_norm_stderr\": 0.07446027270295806\n },\n \"community|acva:Mauritania|0\"\
: {\n \"acc_norm\": 0.6222222222222222,\n \"acc_norm_stderr\": 0.07309112127323451\n\
\ },\n \"community|acva:Mesopotamia_civilization|0\": {\n \"acc_norm\"\
: 0.632258064516129,\n \"acc_norm_stderr\": 0.03885602832856746\n },\n\
\ \"community|acva:Morocco|0\": {\n \"acc_norm\": 0.5111111111111111,\n\
\ \"acc_norm_stderr\": 0.07535922203472523\n },\n \"community|acva:Oman|0\"\
: {\n \"acc_norm\": 0.6888888888888889,\n \"acc_norm_stderr\": 0.06979205927323111\n\
\ },\n \"community|acva:Palestine|0\": {\n \"acc_norm\": 0.5058823529411764,\n\
\ \"acc_norm_stderr\": 0.05455069703232772\n },\n \"community|acva:Qatar|0\"\
: {\n \"acc_norm\": 0.5777777777777777,\n \"acc_norm_stderr\": 0.07446027270295805\n\
\ },\n \"community|acva:Saudi_Arabia|0\": {\n \"acc_norm\": 0.517948717948718,\n\
\ \"acc_norm_stderr\": 0.03587477098773826\n },\n \"community|acva:Somalia|0\"\
: {\n \"acc_norm\": 0.5555555555555556,\n \"acc_norm_stderr\": 0.07491109582924915\n\
\ },\n \"community|acva:Sudan|0\": {\n \"acc_norm\": 0.4,\n \
\ \"acc_norm_stderr\": 0.07385489458759965\n },\n \"community|acva:Syria|0\"\
: {\n \"acc_norm\": 0.6,\n \"acc_norm_stderr\": 0.07385489458759965\n\
\ },\n \"community|acva:Tunisia|0\": {\n \"acc_norm\": 0.5333333333333333,\n\
\ \"acc_norm_stderr\": 0.0752101433090355\n },\n \"community|acva:United_Arab_Emirates|0\"\
: {\n \"acc_norm\": 0.5176470588235295,\n \"acc_norm_stderr\": 0.05452048340661897\n\
\ },\n \"community|acva:Yemen|0\": {\n \"acc_norm\": 0.4,\n \
\ \"acc_norm_stderr\": 0.16329931618554522\n },\n \"community|acva:communication|0\"\
: {\n \"acc_norm\": 0.48626373626373626,\n \"acc_norm_stderr\": 0.026233288793681565\n\
\ },\n \"community|acva:computer_and_phone|0\": {\n \"acc_norm\": 0.47796610169491527,\n\
\ \"acc_norm_stderr\": 0.029132263908368084\n },\n \"community|acva:daily_life|0\"\
: {\n \"acc_norm\": 0.3560830860534125,\n \"acc_norm_stderr\": 0.02612287368198665\n\
\ },\n \"community|acva:entertainment|0\": {\n \"acc_norm\": 0.45084745762711864,\n\
\ \"acc_norm_stderr\": 0.029019347731871377\n },\n \"community|alghafa:mcq_exams_test_ar|0\"\
: {\n \"acc_norm\": 0.3105924596050269,\n \"acc_norm_stderr\": 0.019624385782512334\n\
\ },\n \"community|alghafa:meta_ar_dialects|0\": {\n \"acc_norm\":\
\ 0.318628359592215,\n \"acc_norm_stderr\": 0.006344227814191393\n },\n\
\ \"community|alghafa:meta_ar_msa|0\": {\n \"acc_norm\": 0.3776536312849162,\n\
\ \"acc_norm_stderr\": 0.016214148752136632\n },\n \"community|alghafa:multiple_choice_facts_truefalse_balanced_task|0\"\
: {\n \"acc_norm\": 0.52,\n \"acc_norm_stderr\": 0.05807730170189531\n\
\ },\n \"community|alghafa:multiple_choice_grounded_statement_soqal_task|0\"\
: {\n \"acc_norm\": 0.6066666666666667,\n \"acc_norm_stderr\": 0.040018638461474625\n\
\ },\n \"community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0\"\
: {\n \"acc_norm\": 0.4,\n \"acc_norm_stderr\": 0.040134003725439044\n\
\ },\n \"community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0\"\
: {\n \"acc_norm\": 0.8033771106941838,\n \"acc_norm_stderr\": 0.004445234658793058\n\
\ },\n \"community|alghafa:multiple_choice_rating_sentiment_task|0\": {\n\
\ \"acc_norm\": 0.5457881567973311,\n \"acc_norm_stderr\": 0.006431065182552263\n\
\ },\n \"community|alghafa:multiple_choice_sentiment_task|0\": {\n \
\ \"acc_norm\": 0.3744186046511628,\n \"acc_norm_stderr\": 0.011673005337197204\n\
\ },\n \"community|arabic_exams|0\": {\n \"acc_norm\": 0.34823091247672255,\n\
\ \"acc_norm_stderr\": 0.02057776223602678\n },\n \"community|arabic_mmlu:abstract_algebra|0\"\
: {\n \"acc_norm\": 0.36,\n \"acc_norm_stderr\": 0.04824181513244218\n\
\ },\n \"community|arabic_mmlu:anatomy|0\": {\n \"acc_norm\": 0.2814814814814815,\n\
\ \"acc_norm_stderr\": 0.03885004245800254\n },\n \"community|arabic_mmlu:astronomy|0\"\
: {\n \"acc_norm\": 0.42105263157894735,\n \"acc_norm_stderr\": 0.04017901275981749\n\
\ },\n \"community|arabic_mmlu:business_ethics|0\": {\n \"acc_norm\"\
: 0.56,\n \"acc_norm_stderr\": 0.049888765156985884\n },\n \"community|arabic_mmlu:clinical_knowledge|0\"\
: {\n \"acc_norm\": 0.4075471698113208,\n \"acc_norm_stderr\": 0.030242233800854498\n\
\ },\n \"community|arabic_mmlu:college_biology|0\": {\n \"acc_norm\"\
: 0.2916666666666667,\n \"acc_norm_stderr\": 0.038009680605548594\n },\n\
\ \"community|arabic_mmlu:college_chemistry|0\": {\n \"acc_norm\": 0.28,\n\
\ \"acc_norm_stderr\": 0.045126085985421276\n },\n \"community|arabic_mmlu:college_computer_science|0\"\
: {\n \"acc_norm\": 0.32,\n \"acc_norm_stderr\": 0.046882617226215034\n\
\ },\n \"community|arabic_mmlu:college_mathematics|0\": {\n \"acc_norm\"\
: 0.24,\n \"acc_norm_stderr\": 0.04292346959909282\n },\n \"community|arabic_mmlu:college_medicine|0\"\
: {\n \"acc_norm\": 0.3063583815028902,\n \"acc_norm_stderr\": 0.03514942551267437\n\
\ },\n \"community|arabic_mmlu:college_physics|0\": {\n \"acc_norm\"\
: 0.22549019607843138,\n \"acc_norm_stderr\": 0.041583075330832865\n },\n\
\ \"community|arabic_mmlu:computer_security|0\": {\n \"acc_norm\": 0.46,\n\
\ \"acc_norm_stderr\": 0.05009082659620333\n },\n \"community|arabic_mmlu:conceptual_physics|0\"\
: {\n \"acc_norm\": 0.39148936170212767,\n \"acc_norm_stderr\": 0.031907012423268113\n\
\ },\n \"community|arabic_mmlu:econometrics|0\": {\n \"acc_norm\":\
\ 0.2894736842105263,\n \"acc_norm_stderr\": 0.04266339443159394\n },\n\
\ \"community|arabic_mmlu:electrical_engineering|0\": {\n \"acc_norm\"\
: 0.4827586206896552,\n \"acc_norm_stderr\": 0.04164188720169377\n },\n\
\ \"community|arabic_mmlu:elementary_mathematics|0\": {\n \"acc_norm\"\
: 0.4523809523809524,\n \"acc_norm_stderr\": 0.02563425811555496\n },\n\
\ \"community|arabic_mmlu:formal_logic|0\": {\n \"acc_norm\": 0.38095238095238093,\n\
\ \"acc_norm_stderr\": 0.04343525428949098\n },\n \"community|arabic_mmlu:global_facts|0\"\
: {\n \"acc_norm\": 0.35,\n \"acc_norm_stderr\": 0.047937248544110175\n\
\ },\n \"community|arabic_mmlu:high_school_biology|0\": {\n \"acc_norm\"\
: 0.45483870967741935,\n \"acc_norm_stderr\": 0.02832774309156107\n },\n\
\ \"community|arabic_mmlu:high_school_chemistry|0\": {\n \"acc_norm\"\
: 0.4433497536945813,\n \"acc_norm_stderr\": 0.03495334582162934\n },\n\
\ \"community|arabic_mmlu:high_school_computer_science|0\": {\n \"acc_norm\"\
: 0.55,\n \"acc_norm_stderr\": 0.05\n },\n \"community|arabic_mmlu:high_school_european_history|0\"\
: {\n \"acc_norm\": 0.23636363636363636,\n \"acc_norm_stderr\": 0.033175059300091805\n\
\ },\n \"community|arabic_mmlu:high_school_geography|0\": {\n \"acc_norm\"\
: 0.32323232323232326,\n \"acc_norm_stderr\": 0.03332299921070645\n },\n\
\ \"community|arabic_mmlu:high_school_government_and_politics|0\": {\n \
\ \"acc_norm\": 0.40414507772020725,\n \"acc_norm_stderr\": 0.0354150857888402\n\
\ },\n \"community|arabic_mmlu:high_school_macroeconomics|0\": {\n \
\ \"acc_norm\": 0.43333333333333335,\n \"acc_norm_stderr\": 0.025124653525885127\n\
\ },\n \"community|arabic_mmlu:high_school_mathematics|0\": {\n \"\
acc_norm\": 0.2814814814814815,\n \"acc_norm_stderr\": 0.027420019350945277\n\
\ },\n \"community|arabic_mmlu:high_school_microeconomics|0\": {\n \
\ \"acc_norm\": 0.3907563025210084,\n \"acc_norm_stderr\": 0.03169380235712997\n\
\ },\n \"community|arabic_mmlu:high_school_physics|0\": {\n \"acc_norm\"\
: 0.2847682119205298,\n \"acc_norm_stderr\": 0.036848815213890225\n },\n\
\ \"community|arabic_mmlu:high_school_psychology|0\": {\n \"acc_norm\"\
: 0.3651376146788991,\n \"acc_norm_stderr\": 0.020642801454383998\n },\n\
\ \"community|arabic_mmlu:high_school_statistics|0\": {\n \"acc_norm\"\
: 0.2962962962962963,\n \"acc_norm_stderr\": 0.03114144782353603\n },\n\
\ \"community|arabic_mmlu:high_school_us_history|0\": {\n \"acc_norm\"\
: 0.21568627450980393,\n \"acc_norm_stderr\": 0.028867431449849313\n },\n\
\ \"community|arabic_mmlu:high_school_world_history|0\": {\n \"acc_norm\"\
: 0.29957805907172996,\n \"acc_norm_stderr\": 0.029818024749753095\n },\n\
\ \"community|arabic_mmlu:human_aging|0\": {\n \"acc_norm\": 0.42152466367713004,\n\
\ \"acc_norm_stderr\": 0.03314190222110657\n },\n \"community|arabic_mmlu:human_sexuality|0\"\
: {\n \"acc_norm\": 0.4351145038167939,\n \"acc_norm_stderr\": 0.04348208051644858\n\
\ },\n \"community|arabic_mmlu:international_law|0\": {\n \"acc_norm\"\
: 0.6115702479338843,\n \"acc_norm_stderr\": 0.04449270350068382\n },\n\
\ \"community|arabic_mmlu:jurisprudence|0\": {\n \"acc_norm\": 0.5092592592592593,\n\
\ \"acc_norm_stderr\": 0.04832853553437055\n },\n \"community|arabic_mmlu:logical_fallacies|0\"\
: {\n \"acc_norm\": 0.44785276073619634,\n \"acc_norm_stderr\": 0.039069474794566024\n\
\ },\n \"community|arabic_mmlu:machine_learning|0\": {\n \"acc_norm\"\
: 0.33035714285714285,\n \"acc_norm_stderr\": 0.04464285714285712\n },\n\
\ \"community|arabic_mmlu:management|0\": {\n \"acc_norm\": 0.49514563106796117,\n\
\ \"acc_norm_stderr\": 0.049505043821289195\n },\n \"community|arabic_mmlu:marketing|0\"\
: {\n \"acc_norm\": 0.6025641025641025,\n \"acc_norm_stderr\": 0.03205953453789293\n\
\ },\n \"community|arabic_mmlu:medical_genetics|0\": {\n \"acc_norm\"\
: 0.42,\n \"acc_norm_stderr\": 0.04960449637488584\n },\n \"community|arabic_mmlu:miscellaneous|0\"\
: {\n \"acc_norm\": 0.37547892720306514,\n \"acc_norm_stderr\": 0.01731661319718279\n\
\ },\n \"community|arabic_mmlu:moral_disputes|0\": {\n \"acc_norm\"\
: 0.4595375722543353,\n \"acc_norm_stderr\": 0.026830805998952233\n },\n\
\ \"community|arabic_mmlu:moral_scenarios|0\": {\n \"acc_norm\": 0.264804469273743,\n\
\ \"acc_norm_stderr\": 0.014756906483260666\n },\n \"community|arabic_mmlu:nutrition|0\"\
: {\n \"acc_norm\": 0.4673202614379085,\n \"acc_norm_stderr\": 0.02856869975222588\n\
\ },\n \"community|arabic_mmlu:philosophy|0\": {\n \"acc_norm\": 0.48231511254019294,\n\
\ \"acc_norm_stderr\": 0.028380322849077138\n },\n \"community|arabic_mmlu:prehistory|0\"\
: {\n \"acc_norm\": 0.3611111111111111,\n \"acc_norm_stderr\": 0.026725868809100783\n\
\ },\n \"community|arabic_mmlu:professional_accounting|0\": {\n \"\
acc_norm\": 0.31560283687943264,\n \"acc_norm_stderr\": 0.027724989449509314\n\
\ },\n \"community|arabic_mmlu:professional_law|0\": {\n \"acc_norm\"\
: 0.288135593220339,\n \"acc_norm_stderr\": 0.011567140661324561\n },\n\
\ \"community|arabic_mmlu:professional_medicine|0\": {\n \"acc_norm\"\
: 0.1948529411764706,\n \"acc_norm_stderr\": 0.024060599423487417\n },\n\
\ \"community|arabic_mmlu:professional_psychology|0\": {\n \"acc_norm\"\
: 0.3627450980392157,\n \"acc_norm_stderr\": 0.019450768432505518\n },\n\
\ \"community|arabic_mmlu:public_relations|0\": {\n \"acc_norm\": 0.43636363636363634,\n\
\ \"acc_norm_stderr\": 0.04750185058907297\n },\n \"community|arabic_mmlu:security_studies|0\"\
: {\n \"acc_norm\": 0.49387755102040815,\n \"acc_norm_stderr\": 0.03200682020163907\n\
\ },\n \"community|arabic_mmlu:sociology|0\": {\n \"acc_norm\": 0.5472636815920398,\n\
\ \"acc_norm_stderr\": 0.03519702717576915\n },\n \"community|arabic_mmlu:us_foreign_policy|0\"\
: {\n \"acc_norm\": 0.61,\n \"acc_norm_stderr\": 0.04902071300001974\n\
\ },\n \"community|arabic_mmlu:virology|0\": {\n \"acc_norm\": 0.43373493975903615,\n\
\ \"acc_norm_stderr\": 0.03858158940685517\n },\n \"community|arabic_mmlu:world_religions|0\"\
: {\n \"acc_norm\": 0.39766081871345027,\n \"acc_norm_stderr\": 0.0375363895576169\n\
\ },\n \"community|arc_challenge_okapi_ar|0\": {\n \"acc_norm\": 0.3905172413793103,\n\
\ \"acc_norm_stderr\": 0.014330425995124086\n },\n \"community|arc_easy_ar|0\"\
: {\n \"acc_norm\": 0.39382402707275804,\n \"acc_norm_stderr\": 0.010051215921647427\n\
\ },\n \"community|boolq_ar|0\": {\n \"acc_norm\": 0.7447852760736197,\n\
\ \"acc_norm_stderr\": 0.007637060376807818\n },\n \"community|copa_ext_ar|0\"\
: {\n \"acc_norm\": 0.5111111111111111,\n \"acc_norm_stderr\": 0.05298680599073449\n\
\ },\n \"community|hellaswag_okapi_ar|0\": {\n \"acc_norm\": 0.2830661868934685,\n\
\ \"acc_norm_stderr\": 0.004704341723374388\n },\n \"community|openbook_qa_ext_ar|0\"\
: {\n \"acc_norm\": 0.44646464646464645,\n \"acc_norm_stderr\": 0.022366742858015935\n\
\ },\n \"community|piqa_ar|0\": {\n \"acc_norm\": 0.5624659028914348,\n\
\ \"acc_norm_stderr\": 0.011590210326976888\n },\n \"community|race_ar|0\"\
: {\n \"acc_norm\": 0.3901399878271455,\n \"acc_norm_stderr\": 0.006948482411120556\n\
\ },\n \"community|sciq_ar|0\": {\n \"acc_norm\": 0.5547738693467337,\n\
\ \"acc_norm_stderr\": 0.01576358994150297\n },\n \"community|toxigen_ar|0\"\
: {\n \"acc_norm\": 0.4374331550802139,\n \"acc_norm_stderr\": 0.01623190443350122\n\
\ },\n \"lighteval|xstory_cloze:ar|0\": {\n \"acc\": 0.5598941098610192,\n\
\ \"acc_stderr\": 0.012774475160716331\n },\n \"community|acva:_average|0\"\
: {\n \"acc_norm\": 0.5523583277835948,\n \"acc_norm_stderr\": 0.04789632543236864\n\
\ },\n \"community|alghafa:_average|0\": {\n \"acc_norm\": 0.4730138876990558,\n\
\ \"acc_norm_stderr\": 0.0225513346017991\n },\n \"community|arabic_mmlu:_average|0\"\
: {\n \"acc_norm\": 0.3902423063869031,\n \"acc_norm_stderr\": 0.03573142224104745\n\
\ }\n}\n```"
repo_url: https://huggingface.co/microsoft/Phi-3-medium-4k-instruct
configs:
- config_name: community_acva_Algeria_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Algeria|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Algeria|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Ancient_Egypt_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Ancient_Egypt|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Ancient_Egypt|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arab_Empire_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arab_Empire|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arab_Empire|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Architecture_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Architecture|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Architecture|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Art_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Art|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Art|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Astronomy_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Astronomy|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Astronomy|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Calligraphy_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Calligraphy|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Calligraphy|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Ceremony_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Ceremony|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Ceremony|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Clothing_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Clothing|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Clothing|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Culture_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Culture|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Culture|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Food_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Food|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Food|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Funeral_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Funeral|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Funeral|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Geography_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Geography|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Geography|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_History_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_History|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_History|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Language_Origin_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Language_Origin|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Language_Origin|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Literature_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Literature|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Literature|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Math_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Math|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Math|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Medicine_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Medicine|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Medicine|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Music_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Music|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Music|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Ornament_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Ornament|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Ornament|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Philosophy_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Philosophy|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Philosophy|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Physics_and_Chemistry_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Physics_and_Chemistry|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Physics_and_Chemistry|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Arabic_Wedding_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Arabic_Wedding|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Arabic_Wedding|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Bahrain_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Bahrain|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Bahrain|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Comoros_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Comoros|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Comoros|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Egypt_modern_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Egypt_modern|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Egypt_modern|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_InfluenceFromAncientEgypt_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:InfluenceFromAncientEgypt|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:InfluenceFromAncientEgypt|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_InfluenceFromByzantium_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:InfluenceFromByzantium|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:InfluenceFromByzantium|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_InfluenceFromChina_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:InfluenceFromChina|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:InfluenceFromChina|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_InfluenceFromGreece_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:InfluenceFromGreece|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:InfluenceFromGreece|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_InfluenceFromIslam_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:InfluenceFromIslam|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:InfluenceFromIslam|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_InfluenceFromPersia_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:InfluenceFromPersia|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:InfluenceFromPersia|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_InfluenceFromRome_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:InfluenceFromRome|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:InfluenceFromRome|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Iraq_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Iraq|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Iraq|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Islam_Education_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Islam_Education|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Islam_Education|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Islam_branches_and_schools_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Islam_branches_and_schools|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Islam_branches_and_schools|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Islamic_law_system_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Islamic_law_system|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Islamic_law_system|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Jordan_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Jordan|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Jordan|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Kuwait_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Kuwait|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Kuwait|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Lebanon_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Lebanon|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Lebanon|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Libya_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Libya|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Libya|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Mauritania_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Mauritania|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Mauritania|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Mesopotamia_civilization_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Mesopotamia_civilization|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Mesopotamia_civilization|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Morocco_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Morocco|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Morocco|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Oman_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Oman|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Oman|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Palestine_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Palestine|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Palestine|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Qatar_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Qatar|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Qatar|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Saudi_Arabia_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Saudi_Arabia|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Saudi_Arabia|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Somalia_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Somalia|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Somalia|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Sudan_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Sudan|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Sudan|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Syria_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Syria|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Syria|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Tunisia_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Tunisia|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Tunisia|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_United_Arab_Emirates_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:United_Arab_Emirates|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:United_Arab_Emirates|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_Yemen_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:Yemen|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:Yemen|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_communication_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:communication|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:communication|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_computer_and_phone_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:computer_and_phone|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:computer_and_phone|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_daily_life_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:daily_life|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:daily_life|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_acva_entertainment_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|acva:entertainment|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|acva:entertainment|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_alghafa_mcq_exams_test_ar_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|alghafa:mcq_exams_test_ar|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|alghafa:mcq_exams_test_ar|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_alghafa_meta_ar_dialects_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|alghafa:meta_ar_dialects|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|alghafa:meta_ar_dialects|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_alghafa_meta_ar_msa_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|alghafa:meta_ar_msa|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|alghafa:meta_ar_msa|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_alghafa_multiple_choice_facts_truefalse_balanced_task_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|alghafa:multiple_choice_facts_truefalse_balanced_task|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|alghafa:multiple_choice_facts_truefalse_balanced_task|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_alghafa_multiple_choice_grounded_statement_soqal_task_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|alghafa:multiple_choice_grounded_statement_soqal_task|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|alghafa:multiple_choice_grounded_statement_soqal_task|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_alghafa_multiple_choice_grounded_statement_xglue_mlqa_task_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_alghafa_multiple_choice_rating_sentiment_no_neutral_task_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_alghafa_multiple_choice_rating_sentiment_task_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|alghafa:multiple_choice_rating_sentiment_task|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|alghafa:multiple_choice_rating_sentiment_task|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_alghafa_multiple_choice_sentiment_task_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|alghafa:multiple_choice_sentiment_task|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|alghafa:multiple_choice_sentiment_task|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_exams_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_exams|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_exams|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_abstract_algebra_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:abstract_algebra|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:abstract_algebra|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_anatomy_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:anatomy|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:anatomy|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_astronomy_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:astronomy|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:astronomy|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_business_ethics_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:business_ethics|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:business_ethics|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_clinical_knowledge_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:clinical_knowledge|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:clinical_knowledge|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_college_biology_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:college_biology|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:college_biology|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_college_chemistry_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:college_chemistry|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:college_chemistry|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_college_computer_science_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:college_computer_science|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:college_computer_science|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_college_mathematics_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:college_mathematics|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:college_mathematics|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_college_medicine_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:college_medicine|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:college_medicine|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_college_physics_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:college_physics|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:college_physics|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_computer_security_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:computer_security|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:computer_security|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_conceptual_physics_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:conceptual_physics|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:conceptual_physics|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_econometrics_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:econometrics|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:econometrics|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_electrical_engineering_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:electrical_engineering|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:electrical_engineering|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_elementary_mathematics_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:elementary_mathematics|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:elementary_mathematics|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_formal_logic_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:formal_logic|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:formal_logic|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_global_facts_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:global_facts|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:global_facts|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_high_school_biology_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:high_school_biology|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_biology|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_high_school_chemistry_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:high_school_chemistry|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_chemistry|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_high_school_computer_science_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:high_school_computer_science|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_computer_science|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_high_school_european_history_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:high_school_european_history|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_european_history|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_high_school_geography_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:high_school_geography|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_geography|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_high_school_government_and_politics_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:high_school_government_and_politics|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_government_and_politics|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_high_school_macroeconomics_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:high_school_macroeconomics|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_macroeconomics|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_high_school_mathematics_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:high_school_mathematics|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_mathematics|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_high_school_microeconomics_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:high_school_microeconomics|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_microeconomics|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_high_school_physics_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:high_school_physics|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_physics|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_high_school_psychology_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:high_school_psychology|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_psychology|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_high_school_statistics_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:high_school_statistics|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_statistics|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_high_school_us_history_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:high_school_us_history|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_us_history|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_high_school_world_history_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:high_school_world_history|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:high_school_world_history|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_human_aging_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:human_aging|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:human_aging|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_human_sexuality_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:human_sexuality|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:human_sexuality|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_international_law_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:international_law|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:international_law|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_jurisprudence_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:jurisprudence|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:jurisprudence|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_logical_fallacies_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:logical_fallacies|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:logical_fallacies|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_machine_learning_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:machine_learning|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:machine_learning|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_management_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:management|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:management|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_marketing_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:marketing|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:marketing|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_medical_genetics_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:medical_genetics|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:medical_genetics|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_miscellaneous_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:miscellaneous|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:miscellaneous|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_moral_disputes_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:moral_disputes|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:moral_disputes|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_moral_scenarios_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:moral_scenarios|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:moral_scenarios|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_nutrition_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:nutrition|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:nutrition|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_philosophy_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:philosophy|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:philosophy|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_prehistory_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:prehistory|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:prehistory|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_professional_accounting_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:professional_accounting|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:professional_accounting|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_professional_law_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:professional_law|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:professional_law|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_professional_medicine_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:professional_medicine|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:professional_medicine|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_professional_psychology_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:professional_psychology|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:professional_psychology|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_public_relations_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:public_relations|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:public_relations|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_security_studies_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:security_studies|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:security_studies|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_sociology_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:sociology|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:sociology|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_us_foreign_policy_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:us_foreign_policy|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:us_foreign_policy|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_virology_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:virology|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:virology|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arabic_mmlu_world_religions_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arabic_mmlu:world_religions|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arabic_mmlu:world_religions|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arc_challenge_okapi_ar_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arc_challenge_okapi_ar|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arc_challenge_okapi_ar|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_arc_easy_ar_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|arc_easy_ar|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|arc_easy_ar|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_boolq_ar_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|boolq_ar|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|boolq_ar|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_copa_ext_ar_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|copa_ext_ar|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|copa_ext_ar|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_hellaswag_okapi_ar_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|hellaswag_okapi_ar|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|hellaswag_okapi_ar|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_openbook_qa_ext_ar_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|openbook_qa_ext_ar|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|openbook_qa_ext_ar|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_piqa_ar_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|piqa_ar|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|piqa_ar|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_race_ar_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|race_ar|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|race_ar|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_sciq_ar_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|sciq_ar|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|sciq_ar|0_2024-10-11T17-01-44.350127.parquet'
- config_name: community_toxigen_ar_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_community|toxigen_ar|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_community|toxigen_ar|0_2024-10-11T17-01-44.350127.parquet'
- config_name: lighteval_xstory_cloze_ar_0
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- '**/details_lighteval|xstory_cloze:ar|0_2024-10-11T17-01-44.350127.parquet'
- split: latest
path:
- '**/details_lighteval|xstory_cloze:ar|0_2024-10-11T17-01-44.350127.parquet'
- config_name: results
data_files:
- split: 2024_10_11T17_01_44.350127
path:
- results_2024-10-11T17-01-44.350127.parquet
- split: latest
path:
- results_2024-10-11T17-01-44.350127.parquet
---
# Dataset Card for Evaluation run of microsoft/Phi-3-medium-4k-instruct
<!-- Provide a quick summary of the dataset. -->
Dataset automatically created during the evaluation run of model [microsoft/Phi-3-medium-4k-instruct](https://huggingface.co/microsoft/Phi-3-medium-4k-instruct).
The dataset is composed of 136 configuration, each one coresponding to one of the evaluated task.
The dataset has been created from 1 run(s). Each run can be found as a specific split in each configuration, the split being named using the timestamp of the run.The "train" split is always pointing to the latest results.
An additional configuration "results" store all the aggregated results of the run.
To load the details from a run, you can for instance do the following:
```python
from datasets import load_dataset
data = load_dataset("OALL/details_microsoft__Phi-3-medium-4k-instruct",
"lighteval_xstory_cloze_ar_0",
split="train")
```
## Latest results
These are the [latest results from run 2024-10-11T17:01:44.350127](https://huggingface.co/datasets/OALL/details_microsoft__Phi-3-medium-4k-instruct/blob/main/results_2024-10-11T17-01-44.350127.json)(note that their might be results for other tasks in the repos if successive evals didn't cover the same tasks. You find each in the results and the "latest" split for each eval):
```python
{
"all": {
"acc_norm": 0.47111505023267153,
"acc_norm_stderr": 0.03852465552924525,
"acc": 0.5598941098610192,
"acc_stderr": 0.012774475160716331
},
"community|acva:Algeria|0": {
"acc_norm": 0.5846153846153846,
"acc_norm_stderr": 0.03538013280575029
},
"community|acva:Ancient_Egypt|0": {
"acc_norm": 0.6,
"acc_norm_stderr": 0.0276465406550454
},
"community|acva:Arab_Empire|0": {
"acc_norm": 0.35471698113207545,
"acc_norm_stderr": 0.029445175328199593
},
"community|acva:Arabic_Architecture|0": {
"acc_norm": 0.6564102564102564,
"acc_norm_stderr": 0.03409627301409855
},
"community|acva:Arabic_Art|0": {
"acc_norm": 0.36923076923076925,
"acc_norm_stderr": 0.03464841141863756
},
"community|acva:Arabic_Astronomy|0": {
"acc_norm": 0.48717948717948717,
"acc_norm_stderr": 0.03588610523192216
},
"community|acva:Arabic_Calligraphy|0": {
"acc_norm": 0.796078431372549,
"acc_norm_stderr": 0.025280907058814615
},
"community|acva:Arabic_Ceremony|0": {
"acc_norm": 0.6,
"acc_norm_stderr": 0.03611575592573071
},
"community|acva:Arabic_Clothing|0": {
"acc_norm": 0.4307692307692308,
"acc_norm_stderr": 0.03555213252058761
},
"community|acva:Arabic_Culture|0": {
"acc_norm": 0.6615384615384615,
"acc_norm_stderr": 0.03397280032734095
},
"community|acva:Arabic_Food|0": {
"acc_norm": 0.6666666666666666,
"acc_norm_stderr": 0.033844872171120644
},
"community|acva:Arabic_Funeral|0": {
"acc_norm": 0.4105263157894737,
"acc_norm_stderr": 0.050738635645512106
},
"community|acva:Arabic_Geography|0": {
"acc_norm": 0.5586206896551724,
"acc_norm_stderr": 0.04137931034482758
},
"community|acva:Arabic_History|0": {
"acc_norm": 0.35384615384615387,
"acc_norm_stderr": 0.03433004254147036
},
"community|acva:Arabic_Language_Origin|0": {
"acc_norm": 0.7052631578947368,
"acc_norm_stderr": 0.047025008739248385
},
"community|acva:Arabic_Literature|0": {
"acc_norm": 0.6206896551724138,
"acc_norm_stderr": 0.040434618619167466
},
"community|acva:Arabic_Math|0": {
"acc_norm": 0.4307692307692308,
"acc_norm_stderr": 0.0355521325205876
},
"community|acva:Arabic_Medicine|0": {
"acc_norm": 0.6137931034482759,
"acc_norm_stderr": 0.04057324734419035
},
"community|acva:Arabic_Music|0": {
"acc_norm": 0.2517985611510791,
"acc_norm_stderr": 0.03694846055443904
},
"community|acva:Arabic_Ornament|0": {
"acc_norm": 0.5794871794871795,
"acc_norm_stderr": 0.03544138389303483
},
"community|acva:Arabic_Philosophy|0": {
"acc_norm": 0.593103448275862,
"acc_norm_stderr": 0.04093793981266236
},
"community|acva:Arabic_Physics_and_Chemistry|0": {
"acc_norm": 0.558974358974359,
"acc_norm_stderr": 0.03564732931853579
},
"community|acva:Arabic_Wedding|0": {
"acc_norm": 0.676923076923077,
"acc_norm_stderr": 0.03357544396403132
},
"community|acva:Bahrain|0": {
"acc_norm": 0.5111111111111111,
"acc_norm_stderr": 0.07535922203472523
},
"community|acva:Comoros|0": {
"acc_norm": 0.4888888888888889,
"acc_norm_stderr": 0.07535922203472523
},
"community|acva:Egypt_modern|0": {
"acc_norm": 0.5894736842105263,
"acc_norm_stderr": 0.05073863564551208
},
"community|acva:InfluenceFromAncientEgypt|0": {
"acc_norm": 0.6666666666666666,
"acc_norm_stderr": 0.033844872171120644
},
"community|acva:InfluenceFromByzantium|0": {
"acc_norm": 0.7103448275862069,
"acc_norm_stderr": 0.03780019230438015
},
"community|acva:InfluenceFromChina|0": {
"acc_norm": 0.27692307692307694,
"acc_norm_stderr": 0.032127058190759304
},
"community|acva:InfluenceFromGreece|0": {
"acc_norm": 0.7794871794871795,
"acc_norm_stderr": 0.029766004661644124
},
"community|acva:InfluenceFromIslam|0": {
"acc_norm": 0.7034482758620689,
"acc_norm_stderr": 0.03806142687309992
},
"community|acva:InfluenceFromPersia|0": {
"acc_norm": 0.8228571428571428,
"acc_norm_stderr": 0.028943391569621377
},
"community|acva:InfluenceFromRome|0": {
"acc_norm": 0.5692307692307692,
"acc_norm_stderr": 0.035552132520587594
},
"community|acva:Iraq|0": {
"acc_norm": 0.6,
"acc_norm_stderr": 0.05345224838248487
},
"community|acva:Islam_Education|0": {
"acc_norm": 0.558974358974359,
"acc_norm_stderr": 0.0356473293185358
},
"community|acva:Islam_branches_and_schools|0": {
"acc_norm": 0.5714285714285714,
"acc_norm_stderr": 0.037516123674206446
},
"community|acva:Islamic_law_system|0": {
"acc_norm": 0.6153846153846154,
"acc_norm_stderr": 0.03492896993742303
},
"community|acva:Jordan|0": {
"acc_norm": 0.4222222222222222,
"acc_norm_stderr": 0.07446027270295806
},
"community|acva:Kuwait|0": {
"acc_norm": 0.6444444444444445,
"acc_norm_stderr": 0.07216392363431011
},
"community|acva:Lebanon|0": {
"acc_norm": 0.5333333333333333,
"acc_norm_stderr": 0.0752101433090355
},
"community|acva:Libya|0": {
"acc_norm": 0.5777777777777777,
"acc_norm_stderr": 0.07446027270295806
},
"community|acva:Mauritania|0": {
"acc_norm": 0.6222222222222222,
"acc_norm_stderr": 0.07309112127323451
},
"community|acva:Mesopotamia_civilization|0": {
"acc_norm": 0.632258064516129,
"acc_norm_stderr": 0.03885602832856746
},
"community|acva:Morocco|0": {
"acc_norm": 0.5111111111111111,
"acc_norm_stderr": 0.07535922203472523
},
"community|acva:Oman|0": {
"acc_norm": 0.6888888888888889,
"acc_norm_stderr": 0.06979205927323111
},
"community|acva:Palestine|0": {
"acc_norm": 0.5058823529411764,
"acc_norm_stderr": 0.05455069703232772
},
"community|acva:Qatar|0": {
"acc_norm": 0.5777777777777777,
"acc_norm_stderr": 0.07446027270295805
},
"community|acva:Saudi_Arabia|0": {
"acc_norm": 0.517948717948718,
"acc_norm_stderr": 0.03587477098773826
},
"community|acva:Somalia|0": {
"acc_norm": 0.5555555555555556,
"acc_norm_stderr": 0.07491109582924915
},
"community|acva:Sudan|0": {
"acc_norm": 0.4,
"acc_norm_stderr": 0.07385489458759965
},
"community|acva:Syria|0": {
"acc_norm": 0.6,
"acc_norm_stderr": 0.07385489458759965
},
"community|acva:Tunisia|0": {
"acc_norm": 0.5333333333333333,
"acc_norm_stderr": 0.0752101433090355
},
"community|acva:United_Arab_Emirates|0": {
"acc_norm": 0.5176470588235295,
"acc_norm_stderr": 0.05452048340661897
},
"community|acva:Yemen|0": {
"acc_norm": 0.4,
"acc_norm_stderr": 0.16329931618554522
},
"community|acva:communication|0": {
"acc_norm": 0.48626373626373626,
"acc_norm_stderr": 0.026233288793681565
},
"community|acva:computer_and_phone|0": {
"acc_norm": 0.47796610169491527,
"acc_norm_stderr": 0.029132263908368084
},
"community|acva:daily_life|0": {
"acc_norm": 0.3560830860534125,
"acc_norm_stderr": 0.02612287368198665
},
"community|acva:entertainment|0": {
"acc_norm": 0.45084745762711864,
"acc_norm_stderr": 0.029019347731871377
},
"community|alghafa:mcq_exams_test_ar|0": {
"acc_norm": 0.3105924596050269,
"acc_norm_stderr": 0.019624385782512334
},
"community|alghafa:meta_ar_dialects|0": {
"acc_norm": 0.318628359592215,
"acc_norm_stderr": 0.006344227814191393
},
"community|alghafa:meta_ar_msa|0": {
"acc_norm": 0.3776536312849162,
"acc_norm_stderr": 0.016214148752136632
},
"community|alghafa:multiple_choice_facts_truefalse_balanced_task|0": {
"acc_norm": 0.52,
"acc_norm_stderr": 0.05807730170189531
},
"community|alghafa:multiple_choice_grounded_statement_soqal_task|0": {
"acc_norm": 0.6066666666666667,
"acc_norm_stderr": 0.040018638461474625
},
"community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0": {
"acc_norm": 0.4,
"acc_norm_stderr": 0.040134003725439044
},
"community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0": {
"acc_norm": 0.8033771106941838,
"acc_norm_stderr": 0.004445234658793058
},
"community|alghafa:multiple_choice_rating_sentiment_task|0": {
"acc_norm": 0.5457881567973311,
"acc_norm_stderr": 0.006431065182552263
},
"community|alghafa:multiple_choice_sentiment_task|0": {
"acc_norm": 0.3744186046511628,
"acc_norm_stderr": 0.011673005337197204
},
"community|arabic_exams|0": {
"acc_norm": 0.34823091247672255,
"acc_norm_stderr": 0.02057776223602678
},
"community|arabic_mmlu:abstract_algebra|0": {
"acc_norm": 0.36,
"acc_norm_stderr": 0.04824181513244218
},
"community|arabic_mmlu:anatomy|0": {
"acc_norm": 0.2814814814814815,
"acc_norm_stderr": 0.03885004245800254
},
"community|arabic_mmlu:astronomy|0": {
"acc_norm": 0.42105263157894735,
"acc_norm_stderr": 0.04017901275981749
},
"community|arabic_mmlu:business_ethics|0": {
"acc_norm": 0.56,
"acc_norm_stderr": 0.049888765156985884
},
"community|arabic_mmlu:clinical_knowledge|0": {
"acc_norm": 0.4075471698113208,
"acc_norm_stderr": 0.030242233800854498
},
"community|arabic_mmlu:college_biology|0": {
"acc_norm": 0.2916666666666667,
"acc_norm_stderr": 0.038009680605548594
},
"community|arabic_mmlu:college_chemistry|0": {
"acc_norm": 0.28,
"acc_norm_stderr": 0.045126085985421276
},
"community|arabic_mmlu:college_computer_science|0": {
"acc_norm": 0.32,
"acc_norm_stderr": 0.046882617226215034
},
"community|arabic_mmlu:college_mathematics|0": {
"acc_norm": 0.24,
"acc_norm_stderr": 0.04292346959909282
},
"community|arabic_mmlu:college_medicine|0": {
"acc_norm": 0.3063583815028902,
"acc_norm_stderr": 0.03514942551267437
},
"community|arabic_mmlu:college_physics|0": {
"acc_norm": 0.22549019607843138,
"acc_norm_stderr": 0.041583075330832865
},
"community|arabic_mmlu:computer_security|0": {
"acc_norm": 0.46,
"acc_norm_stderr": 0.05009082659620333
},
"community|arabic_mmlu:conceptual_physics|0": {
"acc_norm": 0.39148936170212767,
"acc_norm_stderr": 0.031907012423268113
},
"community|arabic_mmlu:econometrics|0": {
"acc_norm": 0.2894736842105263,
"acc_norm_stderr": 0.04266339443159394
},
"community|arabic_mmlu:electrical_engineering|0": {
"acc_norm": 0.4827586206896552,
"acc_norm_stderr": 0.04164188720169377
},
"community|arabic_mmlu:elementary_mathematics|0": {
"acc_norm": 0.4523809523809524,
"acc_norm_stderr": 0.02563425811555496
},
"community|arabic_mmlu:formal_logic|0": {
"acc_norm": 0.38095238095238093,
"acc_norm_stderr": 0.04343525428949098
},
"community|arabic_mmlu:global_facts|0": {
"acc_norm": 0.35,
"acc_norm_stderr": 0.047937248544110175
},
"community|arabic_mmlu:high_school_biology|0": {
"acc_norm": 0.45483870967741935,
"acc_norm_stderr": 0.02832774309156107
},
"community|arabic_mmlu:high_school_chemistry|0": {
"acc_norm": 0.4433497536945813,
"acc_norm_stderr": 0.03495334582162934
},
"community|arabic_mmlu:high_school_computer_science|0": {
"acc_norm": 0.55,
"acc_norm_stderr": 0.05
},
"community|arabic_mmlu:high_school_european_history|0": {
"acc_norm": 0.23636363636363636,
"acc_norm_stderr": 0.033175059300091805
},
"community|arabic_mmlu:high_school_geography|0": {
"acc_norm": 0.32323232323232326,
"acc_norm_stderr": 0.03332299921070645
},
"community|arabic_mmlu:high_school_government_and_politics|0": {
"acc_norm": 0.40414507772020725,
"acc_norm_stderr": 0.0354150857888402
},
"community|arabic_mmlu:high_school_macroeconomics|0": {
"acc_norm": 0.43333333333333335,
"acc_norm_stderr": 0.025124653525885127
},
"community|arabic_mmlu:high_school_mathematics|0": {
"acc_norm": 0.2814814814814815,
"acc_norm_stderr": 0.027420019350945277
},
"community|arabic_mmlu:high_school_microeconomics|0": {
"acc_norm": 0.3907563025210084,
"acc_norm_stderr": 0.03169380235712997
},
"community|arabic_mmlu:high_school_physics|0": {
"acc_norm": 0.2847682119205298,
"acc_norm_stderr": 0.036848815213890225
},
"community|arabic_mmlu:high_school_psychology|0": {
"acc_norm": 0.3651376146788991,
"acc_norm_stderr": 0.020642801454383998
},
"community|arabic_mmlu:high_school_statistics|0": {
"acc_norm": 0.2962962962962963,
"acc_norm_stderr": 0.03114144782353603
},
"community|arabic_mmlu:high_school_us_history|0": {
"acc_norm": 0.21568627450980393,
"acc_norm_stderr": 0.028867431449849313
},
"community|arabic_mmlu:high_school_world_history|0": {
"acc_norm": 0.29957805907172996,
"acc_norm_stderr": 0.029818024749753095
},
"community|arabic_mmlu:human_aging|0": {
"acc_norm": 0.42152466367713004,
"acc_norm_stderr": 0.03314190222110657
},
"community|arabic_mmlu:human_sexuality|0": {
"acc_norm": 0.4351145038167939,
"acc_norm_stderr": 0.04348208051644858
},
"community|arabic_mmlu:international_law|0": {
"acc_norm": 0.6115702479338843,
"acc_norm_stderr": 0.04449270350068382
},
"community|arabic_mmlu:jurisprudence|0": {
"acc_norm": 0.5092592592592593,
"acc_norm_stderr": 0.04832853553437055
},
"community|arabic_mmlu:logical_fallacies|0": {
"acc_norm": 0.44785276073619634,
"acc_norm_stderr": 0.039069474794566024
},
"community|arabic_mmlu:machine_learning|0": {
"acc_norm": 0.33035714285714285,
"acc_norm_stderr": 0.04464285714285712
},
"community|arabic_mmlu:management|0": {
"acc_norm": 0.49514563106796117,
"acc_norm_stderr": 0.049505043821289195
},
"community|arabic_mmlu:marketing|0": {
"acc_norm": 0.6025641025641025,
"acc_norm_stderr": 0.03205953453789293
},
"community|arabic_mmlu:medical_genetics|0": {
"acc_norm": 0.42,
"acc_norm_stderr": 0.04960449637488584
},
"community|arabic_mmlu:miscellaneous|0": {
"acc_norm": 0.37547892720306514,
"acc_norm_stderr": 0.01731661319718279
},
"community|arabic_mmlu:moral_disputes|0": {
"acc_norm": 0.4595375722543353,
"acc_norm_stderr": 0.026830805998952233
},
"community|arabic_mmlu:moral_scenarios|0": {
"acc_norm": 0.264804469273743,
"acc_norm_stderr": 0.014756906483260666
},
"community|arabic_mmlu:nutrition|0": {
"acc_norm": 0.4673202614379085,
"acc_norm_stderr": 0.02856869975222588
},
"community|arabic_mmlu:philosophy|0": {
"acc_norm": 0.48231511254019294,
"acc_norm_stderr": 0.028380322849077138
},
"community|arabic_mmlu:prehistory|0": {
"acc_norm": 0.3611111111111111,
"acc_norm_stderr": 0.026725868809100783
},
"community|arabic_mmlu:professional_accounting|0": {
"acc_norm": 0.31560283687943264,
"acc_norm_stderr": 0.027724989449509314
},
"community|arabic_mmlu:professional_law|0": {
"acc_norm": 0.288135593220339,
"acc_norm_stderr": 0.011567140661324561
},
"community|arabic_mmlu:professional_medicine|0": {
"acc_norm": 0.1948529411764706,
"acc_norm_stderr": 0.024060599423487417
},
"community|arabic_mmlu:professional_psychology|0": {
"acc_norm": 0.3627450980392157,
"acc_norm_stderr": 0.019450768432505518
},
"community|arabic_mmlu:public_relations|0": {
"acc_norm": 0.43636363636363634,
"acc_norm_stderr": 0.04750185058907297
},
"community|arabic_mmlu:security_studies|0": {
"acc_norm": 0.49387755102040815,
"acc_norm_stderr": 0.03200682020163907
},
"community|arabic_mmlu:sociology|0": {
"acc_norm": 0.5472636815920398,
"acc_norm_stderr": 0.03519702717576915
},
"community|arabic_mmlu:us_foreign_policy|0": {
"acc_norm": 0.61,
"acc_norm_stderr": 0.04902071300001974
},
"community|arabic_mmlu:virology|0": {
"acc_norm": 0.43373493975903615,
"acc_norm_stderr": 0.03858158940685517
},
"community|arabic_mmlu:world_religions|0": {
"acc_norm": 0.39766081871345027,
"acc_norm_stderr": 0.0375363895576169
},
"community|arc_challenge_okapi_ar|0": {
"acc_norm": 0.3905172413793103,
"acc_norm_stderr": 0.014330425995124086
},
"community|arc_easy_ar|0": {
"acc_norm": 0.39382402707275804,
"acc_norm_stderr": 0.010051215921647427
},
"community|boolq_ar|0": {
"acc_norm": 0.7447852760736197,
"acc_norm_stderr": 0.007637060376807818
},
"community|copa_ext_ar|0": {
"acc_norm": 0.5111111111111111,
"acc_norm_stderr": 0.05298680599073449
},
"community|hellaswag_okapi_ar|0": {
"acc_norm": 0.2830661868934685,
"acc_norm_stderr": 0.004704341723374388
},
"community|openbook_qa_ext_ar|0": {
"acc_norm": 0.44646464646464645,
"acc_norm_stderr": 0.022366742858015935
},
"community|piqa_ar|0": {
"acc_norm": 0.5624659028914348,
"acc_norm_stderr": 0.011590210326976888
},
"community|race_ar|0": {
"acc_norm": 0.3901399878271455,
"acc_norm_stderr": 0.006948482411120556
},
"community|sciq_ar|0": {
"acc_norm": 0.5547738693467337,
"acc_norm_stderr": 0.01576358994150297
},
"community|toxigen_ar|0": {
"acc_norm": 0.4374331550802139,
"acc_norm_stderr": 0.01623190443350122
},
"lighteval|xstory_cloze:ar|0": {
"acc": 0.5598941098610192,
"acc_stderr": 0.012774475160716331
},
"community|acva:_average|0": {
"acc_norm": 0.5523583277835948,
"acc_norm_stderr": 0.04789632543236864
},
"community|alghafa:_average|0": {
"acc_norm": 0.4730138876990558,
"acc_norm_stderr": 0.0225513346017991
},
"community|arabic_mmlu:_average|0": {
"acc_norm": 0.3902423063869031,
"acc_norm_stderr": 0.03573142224104745
}
}
```
## Dataset Details
### Dataset Description
<!-- Provide a longer summary of what this dataset is. -->
- **Curated by:** [More Information Needed]
- **Funded by [optional]:** [More Information Needed]
- **Shared by [optional]:** [More Information Needed]
- **Language(s) (NLP):** [More Information Needed]
- **License:** [More Information Needed]
### Dataset Sources [optional]
<!-- Provide the basic links for the dataset. -->
- **Repository:** [More Information Needed]
- **Paper [optional]:** [More Information Needed]
- **Demo [optional]:** [More Information Needed]
## Uses
<!-- Address questions around how the dataset is intended to be used. -->
### Direct Use
<!-- This section describes suitable use cases for the dataset. -->
[More Information Needed]
### Out-of-Scope Use
<!-- This section addresses misuse, malicious use, and uses that the dataset will not work well for. -->
[More Information Needed]
## Dataset Structure
<!-- This section provides a description of the dataset fields, and additional information about the dataset structure such as criteria used to create the splits, relationships between data points, etc. -->
[More Information Needed]
## Dataset Creation
### Curation Rationale
<!-- Motivation for the creation of this dataset. -->
[More Information Needed]
### Source Data
<!-- This section describes the source data (e.g. news text and headlines, social media posts, translated sentences, ...). -->
#### Data Collection and Processing
<!-- This section describes the data collection and processing process such as data selection criteria, filtering and normalization methods, tools and libraries used, etc. -->
[More Information Needed]
#### Who are the source data producers?
<!-- This section describes the people or systems who originally created the data. It should also include self-reported demographic or identity information for the source data creators if this information is available. -->
[More Information Needed]
### Annotations [optional]
<!-- If the dataset contains annotations which are not part of the initial data collection, use this section to describe them. -->
#### Annotation process
<!-- This section describes the annotation process such as annotation tools used in the process, the amount of data annotated, annotation guidelines provided to the annotators, interannotator statistics, annotation validation, etc. -->
[More Information Needed]
#### Who are the annotators?
<!-- This section describes the people or systems who created the annotations. -->
[More Information Needed]
#### Personal and Sensitive Information
<!-- State whether the dataset contains data that might be considered personal, sensitive, or private (e.g., data that reveals addresses, uniquely identifiable names or aliases, racial or ethnic origins, sexual orientations, religious beliefs, political opinions, financial or health data, etc.). If efforts were made to anonymize the data, describe the anonymization process. -->
[More Information Needed]
## Bias, Risks, and Limitations
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
[More Information Needed]
### Recommendations
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
Users should be made aware of the risks, biases and limitations of the dataset. More information needed for further recommendations.
## Citation [optional]
<!-- If there is a paper or blog post introducing the dataset, the APA and Bibtex information for that should go in this section. -->
**BibTeX:**
[More Information Needed]
**APA:**
[More Information Needed]
## Glossary [optional]
<!-- If relevant, include terms and calculations in this section that can help readers understand the dataset or dataset card. -->
[More Information Needed]
## More Information [optional]
[More Information Needed]
## Dataset Card Authors [optional]
[More Information Needed]
## Dataset Card Contact
[More Information Needed]
---
pretty_name: 评估运行结果:microsoft/Phi-3-medium-4k-instruct
repo_url: https://huggingface.co/microsoft/Phi-3-medium-4k-instruct
configs:
- 配置名称:community_acva_Algeria_0
数据文件:
- 划分(split):2024_10_11T17_01_44.350127
路径:
- '**/details_community|acva:Algeria|0_2024-10-11T17-01-44.350127.parquet'
- 划分(split):latest
路径:
- '**/details_community|acva:Algeria|0_2024-10-11T17-01-44.350127.parquet'
- 配置名称:community_acva_Ancient_Egypt_0
数据文件:
- 划分(split):2024_10_11T17_01_44.350127
路径:
- '**/details_community|acva:Ancient_Egypt|0_2024-10-11T17-01-44.350127.parquet'
- 划分(split):latest
路径:
- '**/details_community|acva:Ancient_Egypt|0_2024-10-11T17-01-44.350127.parquet'
- 配置名称:community_acva_Arab_Empire_0
数据文件:
- 划分(split):2024_10_11T17_01_44.350127
路径:
- '**/details_community|acva:Arab_Empire|0_2024-10-11T17-01-44.350127.parquet'
- 划分(split):latest
路径:
- '**/details_community|acva:Arab_Empire|0_2024-10-11T17-01-44.350127.parquet'
...(其余配置项格式与上述一致,完整保留原文配置结构)
- 配置名称:results
数据文件:
- 划分(split):2024_10_11T17_01_44.350127
路径:
- results_2024-10-11T17-01-44.350127.parquet
- 划分(split):latest
路径:
- results_2024-10-11T17-01-44.350127.parquet
---
## 数据集卡片:microsoft/Phi-3-medium-4k-instruct模型评估运行结果
<!-- 请简要描述本数据集 -->
本数据集系针对模型[microsoft/Phi-3-medium-4k-instruct](https://huggingface.co/microsoft/Phi-3-medium-4k-instruct)开展评估运行时自动生成的数据集。
该数据集包含136个配置项,每个配置项对应一个被评估的任务。
本数据集仅由1次评估运行生成,每次运行的结果会以划分(split)的形式存储在对应配置中,划分名称采用运行的时间戳(timestamp)命名。其中`train`划分始终指向最新的评估结果。
此外,额外配置项`results`用于存储本次运行的所有聚合评估结果。
若需加载某次运行的详细结果,可参考如下示例代码:
python
from datasets import load_dataset
data = load_dataset("OALL/details_microsoft__Phi-3-medium-4k-instruct",
"lighteval_xstory_cloze_ar_0",
split="train")
## 最新评估结果
以下为[2024-10-11T17:01:44.350127运行的最新结果](https://huggingface.co/datasets/OALL/details_microsoft__Phi-3-medium-4k-instruct/blob/main/results_2024-10-11T17-01-44.350127.json)(注:若多次连续评估未覆盖完全相同的任务,仓库中可能存在其他任务的评估结果,你可以在`results`配置项以及每个评估的`latest`划分中找到对应结果):
python
{
"all": {
"acc_norm": 0.47111505023267153,
"acc_norm_stderr": 0.03852465552924525,
"acc": 0.5598941098610192,
"acc_stderr": 0.012774475160716331
},
"community|acva:Algeria|0": {
"acc_norm": 0.5846153846153846,
"acc_norm_stderr": 0.03538013280575029
},
...(其余评估结果内容与原文完全一致)
}
## 数据集详情
### 数据集描述
<!-- 请详细描述本数据集 -->
- **整理者:** [需补充更多信息]
- **资助方 [可选]:** [需补充更多信息]
- **分享者 [可选]:** [需补充更多信息]
- **自然语言(NLP):** [需补充更多信息]
- **许可证:** [需补充更多信息]
### 数据集来源 [可选]
<!-- 请提供数据集的基本链接 -->
- **仓库:** [需补充更多信息]
- **论文 [可选]:** [需补充更多信息]
- **演示 [可选]:** [需补充更多信息]
## 数据集用途
<!-- 请说明本数据集的预期使用场景 -->
### 直接使用
<!-- 本节描述本数据集的适用场景 -->
[需补充更多信息]
### 不适用场景
<!-- 本节描述误用、恶意使用以及本数据集无法良好适配的使用场景 -->
[需补充更多信息]
## 数据集结构
<!-- 本节描述数据集的字段信息,以及其他相关结构信息,例如划分标准、数据点间的关系等 -->
[需补充更多信息]
## 数据集构建
### 构建逻辑
<!-- 描述创建本数据集的动机 -->
[需补充更多信息]
### 源数据
<!-- 本节描述源数据(例如新闻文本与标题、社交媒体帖子、翻译后的句子等) -->
#### 数据收集与处理
<!-- 本节描述数据收集与处理流程,例如数据选择标准、过滤与归一化方法、使用的工具与库等 -->
[需补充更多信息]
#### 源数据生产者
<!-- 本节描述最初创建数据的个人或系统。若源数据创建者有自我报告的人口统计或身份信息,也应在此处说明 -->
[需补充更多信息]
### 标注 [可选]
<!-- 若数据集包含非初始数据收集阶段的标注,请使用本节描述标注信息 -->
#### 标注流程
<!-- 本节描述标注流程,例如标注中使用的工具、标注的数据量、提供给标注者的标注指南、标注者间统计数据、标注验证等 -->
[需补充更多信息]
#### 标注人员
<!-- 本节描述创建标注的个人或系统 -->
[需补充更多信息]
#### 个人与敏感信息
<!-- 说明数据集是否包含可能被视为个人、敏感或隐私的数据(例如显示地址、唯一可识别的姓名或别名、种族或族裔起源、性取向、宗教信仰、政治观点、财务或健康数据等)。如果对数据进行了匿名处理,请描述匿名化流程 -->
[需补充更多信息]
## 偏差、风险与局限性
<!-- 本节旨在传达技术与社会技术层面的局限性 -->
[需补充更多信息]
### 建议
<!-- 本节旨在传达关于偏差、风险和技术局限性的建议 -->
用户应了解本数据集的风险、偏差和局限性。需补充更多信息以提供进一步建议。
## 引用 [可选]
<!-- 如果有介绍本数据集的论文或博客文章,应在此处给出对应的APA和BibTeX格式引用信息 -->
**BibTeX:**
[需补充更多信息]
**APA:**
[需补充更多信息]
## 术语表 [可选]
<!-- 若相关,请在此处包含可帮助读者理解本数据集或数据集卡片的术语和计算方法 -->
[需补充更多信息]
## 更多信息 [可选]
[需补充更多信息]
## 数据集卡片撰写者 [可选]
[需补充更多信息]
## 数据集卡片联系人
[需补充更多信息]
提供机构:
OALL



