five

OALL/details_zhengr__MixTAO-7Bx2-MoE-v8.1

收藏
Hugging Face2024-05-20 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/OALL/details_zhengr__MixTAO-7Bx2-MoE-v8.1
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: Evaluation run of zhengr/MixTAO-7Bx2-MoE-v8.1 dataset_summary: "Dataset automatically created during the evaluation run of model\ \ [zhengr/MixTAO-7Bx2-MoE-v8.1](https://huggingface.co/zhengr/MixTAO-7Bx2-MoE-v8.1).\n\ \nThe dataset is composed of 136 configuration, each one coresponding to one of\ \ the evaluated task.\n\nThe dataset has been created from 1 run(s). Each run can\ \ be found as a specific split in each configuration, the split being named using\ \ the timestamp of the run.The \"train\" split is always pointing to the latest\ \ results.\n\nAn additional configuration \"results\" store all the aggregated results\ \ of the run.\n\nTo load the details from a run, you can for instance do the following:\n\ ```python\nfrom datasets import load_dataset\ndata = load_dataset(\"OALL/details_zhengr__MixTAO-7Bx2-MoE-v8.1\"\ ,\n\t\"lighteval_xstory_cloze_ar_0\",\n\tsplit=\"train\")\n```\n\n## Latest results\n\ \nThese are the [latest results from run 2024-05-20T17:39:11.123569](https://huggingface.co/datasets/OALL/details_zhengr__MixTAO-7Bx2-MoE-v8.1/blob/main/results_2024-05-20T17-39-11.123569.json)(note\ \ that their might be results for other tasks in the repos if successive evals didn't\ \ cover the same tasks. You find each in the results and the \"latest\" split for\ \ each eval):\n\n```python\n{\n \"all\": {\n \"acc_norm\": 0.47749705985404584,\n\ \ \"acc_norm_stderr\": 0.03795069261216331,\n \"acc\": 0.5784248841826605,\n\ \ \"acc_stderr\": 0.012707862131801898\n },\n \"community|acva:Algeria|0\"\ : {\n \"acc_norm\": 0.5794871794871795,\n \"acc_norm_stderr\": 0.035441383893034833\n\ \ },\n \"community|acva:Ancient_Egypt|0\": {\n \"acc_norm\": 0.5714285714285714,\n\ \ \"acc_norm_stderr\": 0.02792722339076032\n },\n \"community|acva:Arab_Empire|0\"\ : {\n \"acc_norm\": 0.3660377358490566,\n \"acc_norm_stderr\": 0.029647813539365252\n\ \ },\n \"community|acva:Arabic_Architecture|0\": {\n \"acc_norm\":\ \ 0.5846153846153846,\n \"acc_norm_stderr\": 0.035380132805750295\n },\n\ \ \"community|acva:Arabic_Art|0\": {\n \"acc_norm\": 0.558974358974359,\n\ \ \"acc_norm_stderr\": 0.035647329318535786\n },\n \"community|acva:Arabic_Astronomy|0\"\ : {\n \"acc_norm\": 0.47692307692307695,\n \"acc_norm_stderr\": 0.0358596530894741\n\ \ },\n \"community|acva:Arabic_Calligraphy|0\": {\n \"acc_norm\": 0.6862745098039216,\n\ \ \"acc_norm_stderr\": 0.02911434198875567\n },\n \"community|acva:Arabic_Ceremony|0\"\ : {\n \"acc_norm\": 0.6486486486486487,\n \"acc_norm_stderr\": 0.03519384049793635\n\ \ },\n \"community|acva:Arabic_Clothing|0\": {\n \"acc_norm\": 0.5333333333333333,\n\ \ \"acc_norm_stderr\": 0.035818045967822315\n },\n \"community|acva:Arabic_Culture|0\"\ : {\n \"acc_norm\": 0.6256410256410256,\n \"acc_norm_stderr\": 0.03474608430626236\n\ \ },\n \"community|acva:Arabic_Food|0\": {\n \"acc_norm\": 0.5846153846153846,\n\ \ \"acc_norm_stderr\": 0.035380132805750295\n },\n \"community|acva:Arabic_Funeral|0\"\ : {\n \"acc_norm\": 0.7578947368421053,\n \"acc_norm_stderr\": 0.04418172153936914\n\ \ },\n \"community|acva:Arabic_Geography|0\": {\n \"acc_norm\": 0.593103448275862,\n\ \ \"acc_norm_stderr\": 0.04093793981266236\n },\n \"community|acva:Arabic_History|0\"\ : {\n \"acc_norm\": 0.49230769230769234,\n \"acc_norm_stderr\": 0.03589365940635213\n\ \ },\n \"community|acva:Arabic_Language_Origin|0\": {\n \"acc_norm\"\ : 0.6947368421052632,\n \"acc_norm_stderr\": 0.047498887145627784\n },\n\ \ \"community|acva:Arabic_Literature|0\": {\n \"acc_norm\": 0.7034482758620689,\n\ \ \"acc_norm_stderr\": 0.03806142687309993\n },\n \"community|acva:Arabic_Math|0\"\ : {\n \"acc_norm\": 0.31794871794871793,\n \"acc_norm_stderr\": 0.03343383454355787\n\ \ },\n \"community|acva:Arabic_Medicine|0\": {\n \"acc_norm\": 0.6689655172413793,\n\ \ \"acc_norm_stderr\": 0.03921545312467122\n },\n \"community|acva:Arabic_Music|0\"\ : {\n \"acc_norm\": 0.7266187050359713,\n \"acc_norm_stderr\": 0.0379400712153362\n\ \ },\n \"community|acva:Arabic_Ornament|0\": {\n \"acc_norm\": 0.7743589743589744,\n\ \ \"acc_norm_stderr\": 0.030010921825357008\n },\n \"community|acva:Arabic_Philosophy|0\"\ : {\n \"acc_norm\": 0.6689655172413793,\n \"acc_norm_stderr\": 0.03921545312467122\n\ \ },\n \"community|acva:Arabic_Physics_and_Chemistry|0\": {\n \"acc_norm\"\ : 0.6307692307692307,\n \"acc_norm_stderr\": 0.034648411418637566\n },\n\ \ \"community|acva:Arabic_Wedding|0\": {\n \"acc_norm\": 0.5846153846153846,\n\ \ \"acc_norm_stderr\": 0.03538013280575031\n },\n \"community|acva:Bahrain|0\"\ : {\n \"acc_norm\": 0.6222222222222222,\n \"acc_norm_stderr\": 0.07309112127323451\n\ \ },\n \"community|acva:Comoros|0\": {\n \"acc_norm\": 0.4666666666666667,\n\ \ \"acc_norm_stderr\": 0.0752101433090355\n },\n \"community|acva:Egypt_modern|0\"\ : {\n \"acc_norm\": 0.6421052631578947,\n \"acc_norm_stderr\": 0.04944436957628254\n\ \ },\n \"community|acva:InfluenceFromAncientEgypt|0\": {\n \"acc_norm\"\ : 0.7538461538461538,\n \"acc_norm_stderr\": 0.030927428371225685\n },\n\ \ \"community|acva:InfluenceFromByzantium|0\": {\n \"acc_norm\": 0.8,\n\ \ \"acc_norm_stderr\": 0.0333333333333333\n },\n \"community|acva:InfluenceFromChina|0\"\ : {\n \"acc_norm\": 0.28205128205128205,\n \"acc_norm_stderr\": 0.032307986017991154\n\ \ },\n \"community|acva:InfluenceFromGreece|0\": {\n \"acc_norm\":\ \ 0.8153846153846154,\n \"acc_norm_stderr\": 0.027855716655754165\n },\n\ \ \"community|acva:InfluenceFromIslam|0\": {\n \"acc_norm\": 0.7517241379310344,\n\ \ \"acc_norm_stderr\": 0.036001056927277716\n },\n \"community|acva:InfluenceFromPersia|0\"\ : {\n \"acc_norm\": 0.7885714285714286,\n \"acc_norm_stderr\": 0.03095478075830146\n\ \ },\n \"community|acva:InfluenceFromRome|0\": {\n \"acc_norm\": 0.6666666666666666,\n\ \ \"acc_norm_stderr\": 0.03384487217112065\n },\n \"community|acva:Iraq|0\"\ : {\n \"acc_norm\": 0.6,\n \"acc_norm_stderr\": 0.05345224838248487\n\ \ },\n \"community|acva:Islam_Education|0\": {\n \"acc_norm\": 0.6871794871794872,\n\ \ \"acc_norm_stderr\": 0.033287550657248546\n },\n \"community|acva:Islam_branches_and_schools|0\"\ : {\n \"acc_norm\": 0.5657142857142857,\n \"acc_norm_stderr\": 0.037576101528126626\n\ \ },\n \"community|acva:Islamic_law_system|0\": {\n \"acc_norm\": 0.7076923076923077,\n\ \ \"acc_norm_stderr\": 0.032654383937495104\n },\n \"community|acva:Jordan|0\"\ : {\n \"acc_norm\": 0.4666666666666667,\n \"acc_norm_stderr\": 0.0752101433090355\n\ \ },\n \"community|acva:Kuwait|0\": {\n \"acc_norm\": 0.7777777777777778,\n\ \ \"acc_norm_stderr\": 0.06267511942419628\n },\n \"community|acva:Lebanon|0\"\ : {\n \"acc_norm\": 0.5555555555555556,\n \"acc_norm_stderr\": 0.07491109582924914\n\ \ },\n \"community|acva:Libya|0\": {\n \"acc_norm\": 0.6222222222222222,\n\ \ \"acc_norm_stderr\": 0.07309112127323451\n },\n \"community|acva:Mauritania|0\"\ : {\n \"acc_norm\": 0.6,\n \"acc_norm_stderr\": 0.07385489458759965\n\ \ },\n \"community|acva:Mesopotamia_civilization|0\": {\n \"acc_norm\"\ : 0.6709677419354839,\n \"acc_norm_stderr\": 0.037862535985883836\n },\n\ \ \"community|acva:Morocco|0\": {\n \"acc_norm\": 0.6222222222222222,\n\ \ \"acc_norm_stderr\": 0.07309112127323451\n },\n \"community|acva:Oman|0\"\ : {\n \"acc_norm\": 0.6,\n \"acc_norm_stderr\": 0.07385489458759965\n\ \ },\n \"community|acva:Palestine|0\": {\n \"acc_norm\": 0.5411764705882353,\n\ \ \"acc_norm_stderr\": 0.0543691634273002\n },\n \"community|acva:Qatar|0\"\ : {\n \"acc_norm\": 0.6444444444444445,\n \"acc_norm_stderr\": 0.07216392363431012\n\ \ },\n \"community|acva:Saudi_Arabia|0\": {\n \"acc_norm\": 0.6205128205128205,\n\ \ \"acc_norm_stderr\": 0.03483959266365358\n },\n \"community|acva:Somalia|0\"\ : {\n \"acc_norm\": 0.6222222222222222,\n \"acc_norm_stderr\": 0.07309112127323451\n\ \ },\n \"community|acva:Sudan|0\": {\n \"acc_norm\": 0.5777777777777777,\n\ \ \"acc_norm_stderr\": 0.07446027270295806\n },\n \"community|acva:Syria|0\"\ : {\n \"acc_norm\": 0.6444444444444445,\n \"acc_norm_stderr\": 0.07216392363431012\n\ \ },\n \"community|acva:Tunisia|0\": {\n \"acc_norm\": 0.35555555555555557,\n\ \ \"acc_norm_stderr\": 0.07216392363431012\n },\n \"community|acva:United_Arab_Emirates|0\"\ : {\n \"acc_norm\": 0.5647058823529412,\n \"acc_norm_stderr\": 0.054095720804810316\n\ \ },\n \"community|acva:Yemen|0\": {\n \"acc_norm\": 0.4,\n \ \ \"acc_norm_stderr\": 0.1632993161855452\n },\n \"community|acva:communication|0\"\ : {\n \"acc_norm\": 0.5302197802197802,\n \"acc_norm_stderr\": 0.026195217787616888\n\ \ },\n \"community|acva:computer_and_phone|0\": {\n \"acc_norm\": 0.6067796610169491,\n\ \ \"acc_norm_stderr\": 0.02848786016617071\n },\n \"community|acva:daily_life|0\"\ : {\n \"acc_norm\": 0.5400593471810089,\n \"acc_norm_stderr\": 0.027189548976070146\n\ \ },\n \"community|acva:entertainment|0\": {\n \"acc_norm\": 0.6372881355932203,\n\ \ \"acc_norm_stderr\": 0.028039814248303797\n },\n \"community|alghafa:mcq_exams_test_ar|0\"\ : {\n \"acc_norm\": 0.3267504488330341,\n \"acc_norm_stderr\": 0.0198910970748856\n\ \ },\n \"community|alghafa:meta_ar_dialects|0\": {\n \"acc_norm\":\ \ 0.3241890639481001,\n \"acc_norm_stderr\": 0.006373181940508726\n },\n\ \ \"community|alghafa:meta_ar_msa|0\": {\n \"acc_norm\": 0.3754189944134078,\n\ \ \"acc_norm_stderr\": 0.01619510424846353\n },\n \"community|alghafa:multiple_choice_facts_truefalse_balanced_task|0\"\ : {\n \"acc_norm\": 0.52,\n \"acc_norm_stderr\": 0.05807730170189531\n\ \ },\n \"community|alghafa:multiple_choice_grounded_statement_soqal_task|0\"\ : {\n \"acc_norm\": 0.6266666666666667,\n \"acc_norm_stderr\": 0.03962538976206637\n\ \ },\n \"community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0\"\ : {\n \"acc_norm\": 0.5266666666666666,\n \"acc_norm_stderr\": 0.040903298047964325\n\ \ },\n \"community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0\"\ : {\n \"acc_norm\": 0.8440275171982489,\n \"acc_norm_stderr\": 0.004058076442677078\n\ \ },\n \"community|alghafa:multiple_choice_rating_sentiment_task|0\": {\n\ \ \"acc_norm\": 0.51976647206005,\n \"acc_norm_stderr\": 0.006453153566642388\n\ \ },\n \"community|alghafa:multiple_choice_sentiment_task|0\": {\n \ \ \"acc_norm\": 0.39651162790697675,\n \"acc_norm_stderr\": 0.011798437025916928\n\ \ },\n \"community|arabic_exams|0\": {\n \"acc_norm\": 0.329608938547486,\n\ \ \"acc_norm_stderr\": 0.02030398121835852\n },\n \"community|arabic_mmlu:abstract_algebra|0\"\ : {\n \"acc_norm\": 0.27,\n \"acc_norm_stderr\": 0.044619604333847394\n\ \ },\n \"community|arabic_mmlu:anatomy|0\": {\n \"acc_norm\": 0.3333333333333333,\n\ \ \"acc_norm_stderr\": 0.04072314811876837\n },\n \"community|arabic_mmlu:astronomy|0\"\ : {\n \"acc_norm\": 0.34868421052631576,\n \"acc_norm_stderr\": 0.0387813988879761\n\ \ },\n \"community|arabic_mmlu:business_ethics|0\": {\n \"acc_norm\"\ : 0.47,\n \"acc_norm_stderr\": 0.050161355804659205\n },\n \"community|arabic_mmlu:clinical_knowledge|0\"\ : {\n \"acc_norm\": 0.4037735849056604,\n \"acc_norm_stderr\": 0.03019761160019795\n\ \ },\n \"community|arabic_mmlu:college_biology|0\": {\n \"acc_norm\"\ : 0.3333333333333333,\n \"acc_norm_stderr\": 0.039420826399272135\n },\n\ \ \"community|arabic_mmlu:college_chemistry|0\": {\n \"acc_norm\": 0.31,\n\ \ \"acc_norm_stderr\": 0.04648231987117316\n },\n \"community|arabic_mmlu:college_computer_science|0\"\ : {\n \"acc_norm\": 0.26,\n \"acc_norm_stderr\": 0.04408440022768078\n\ \ },\n \"community|arabic_mmlu:college_mathematics|0\": {\n \"acc_norm\"\ : 0.24,\n \"acc_norm_stderr\": 0.04292346959909282\n },\n \"community|arabic_mmlu:college_medicine|0\"\ : {\n \"acc_norm\": 0.3179190751445087,\n \"acc_norm_stderr\": 0.0355068398916558\n\ \ },\n \"community|arabic_mmlu:college_physics|0\": {\n \"acc_norm\"\ : 0.21568627450980393,\n \"acc_norm_stderr\": 0.04092563958237655\n },\n\ \ \"community|arabic_mmlu:computer_security|0\": {\n \"acc_norm\": 0.49,\n\ \ \"acc_norm_stderr\": 0.05024183937956911\n },\n \"community|arabic_mmlu:conceptual_physics|0\"\ : {\n \"acc_norm\": 0.32340425531914896,\n \"acc_norm_stderr\": 0.030579442773610337\n\ \ },\n \"community|arabic_mmlu:econometrics|0\": {\n \"acc_norm\":\ \ 0.2719298245614035,\n \"acc_norm_stderr\": 0.04185774424022056\n },\n\ \ \"community|arabic_mmlu:electrical_engineering|0\": {\n \"acc_norm\"\ : 0.4,\n \"acc_norm_stderr\": 0.040824829046386284\n },\n \"community|arabic_mmlu:elementary_mathematics|0\"\ : {\n \"acc_norm\": 0.30423280423280424,\n \"acc_norm_stderr\": 0.023695415009463087\n\ \ },\n \"community|arabic_mmlu:formal_logic|0\": {\n \"acc_norm\":\ \ 0.30952380952380953,\n \"acc_norm_stderr\": 0.041349130183033156\n },\n\ \ \"community|arabic_mmlu:global_facts|0\": {\n \"acc_norm\": 0.29,\n\ \ \"acc_norm_stderr\": 0.045604802157206845\n },\n \"community|arabic_mmlu:high_school_biology|0\"\ : {\n \"acc_norm\": 0.3967741935483871,\n \"acc_norm_stderr\": 0.027831231605767944\n\ \ },\n \"community|arabic_mmlu:high_school_chemistry|0\": {\n \"acc_norm\"\ : 0.3497536945812808,\n \"acc_norm_stderr\": 0.03355400904969565\n },\n\ \ \"community|arabic_mmlu:high_school_computer_science|0\": {\n \"acc_norm\"\ : 0.36,\n \"acc_norm_stderr\": 0.04824181513244218\n },\n \"community|arabic_mmlu:high_school_european_history|0\"\ : {\n \"acc_norm\": 0.23636363636363636,\n \"acc_norm_stderr\": 0.03317505930009179\n\ \ },\n \"community|arabic_mmlu:high_school_geography|0\": {\n \"acc_norm\"\ : 0.3383838383838384,\n \"acc_norm_stderr\": 0.03371124142626303\n },\n\ \ \"community|arabic_mmlu:high_school_government_and_politics|0\": {\n \ \ \"acc_norm\": 0.3316062176165803,\n \"acc_norm_stderr\": 0.03397636541089116\n\ \ },\n \"community|arabic_mmlu:high_school_macroeconomics|0\": {\n \ \ \"acc_norm\": 0.3333333333333333,\n \"acc_norm_stderr\": 0.02390115797940254\n\ \ },\n \"community|arabic_mmlu:high_school_mathematics|0\": {\n \"\ acc_norm\": 0.31851851851851853,\n \"acc_norm_stderr\": 0.028406533090608456\n\ \ },\n \"community|arabic_mmlu:high_school_microeconomics|0\": {\n \ \ \"acc_norm\": 0.2773109243697479,\n \"acc_norm_stderr\": 0.02907937453948001\n\ \ },\n \"community|arabic_mmlu:high_school_physics|0\": {\n \"acc_norm\"\ : 0.2913907284768212,\n \"acc_norm_stderr\": 0.037101857261199946\n },\n\ \ \"community|arabic_mmlu:high_school_psychology|0\": {\n \"acc_norm\"\ : 0.3247706422018349,\n \"acc_norm_stderr\": 0.02007772910931033\n },\n\ \ \"community|arabic_mmlu:high_school_statistics|0\": {\n \"acc_norm\"\ : 0.3472222222222222,\n \"acc_norm_stderr\": 0.032468872436376486\n },\n\ \ \"community|arabic_mmlu:high_school_us_history|0\": {\n \"acc_norm\"\ : 0.2549019607843137,\n \"acc_norm_stderr\": 0.030587591351604246\n },\n\ \ \"community|arabic_mmlu:high_school_world_history|0\": {\n \"acc_norm\"\ : 0.32489451476793246,\n \"acc_norm_stderr\": 0.030486039389105303\n },\n\ \ \"community|arabic_mmlu:human_aging|0\": {\n \"acc_norm\": 0.34080717488789236,\n\ \ \"acc_norm_stderr\": 0.03181149747055359\n },\n \"community|arabic_mmlu:human_sexuality|0\"\ : {\n \"acc_norm\": 0.37404580152671757,\n \"acc_norm_stderr\": 0.04243869242230524\n\ \ },\n \"community|arabic_mmlu:international_law|0\": {\n \"acc_norm\"\ : 0.48760330578512395,\n \"acc_norm_stderr\": 0.045629515481807666\n },\n\ \ \"community|arabic_mmlu:jurisprudence|0\": {\n \"acc_norm\": 0.48148148148148145,\n\ \ \"acc_norm_stderr\": 0.04830366024635331\n },\n \"community|arabic_mmlu:logical_fallacies|0\"\ : {\n \"acc_norm\": 0.3496932515337423,\n \"acc_norm_stderr\": 0.03746668325470022\n\ \ },\n \"community|arabic_mmlu:machine_learning|0\": {\n \"acc_norm\"\ : 0.29464285714285715,\n \"acc_norm_stderr\": 0.0432704093257873\n },\n\ \ \"community|arabic_mmlu:management|0\": {\n \"acc_norm\": 0.42718446601941745,\n\ \ \"acc_norm_stderr\": 0.04897957737781168\n },\n \"community|arabic_mmlu:marketing|0\"\ : {\n \"acc_norm\": 0.45726495726495725,\n \"acc_norm_stderr\": 0.03263622596380688\n\ \ },\n \"community|arabic_mmlu:medical_genetics|0\": {\n \"acc_norm\"\ : 0.28,\n \"acc_norm_stderr\": 0.04512608598542127\n },\n \"community|arabic_mmlu:miscellaneous|0\"\ : {\n \"acc_norm\": 0.384418901660281,\n \"acc_norm_stderr\": 0.01739568874281962\n\ \ },\n \"community|arabic_mmlu:moral_disputes|0\": {\n \"acc_norm\"\ : 0.407514450867052,\n \"acc_norm_stderr\": 0.026454578146931498\n },\n\ \ \"community|arabic_mmlu:moral_scenarios|0\": {\n \"acc_norm\": 0.2547486033519553,\n\ \ \"acc_norm_stderr\": 0.01457265038340916\n },\n \"community|arabic_mmlu:nutrition|0\"\ : {\n \"acc_norm\": 0.43137254901960786,\n \"acc_norm_stderr\": 0.028358956313423556\n\ \ },\n \"community|arabic_mmlu:philosophy|0\": {\n \"acc_norm\": 0.40192926045016075,\n\ \ \"acc_norm_stderr\": 0.02784647600593048\n },\n \"community|arabic_mmlu:prehistory|0\"\ : {\n \"acc_norm\": 0.33024691358024694,\n \"acc_norm_stderr\": 0.026168298456732846\n\ \ },\n \"community|arabic_mmlu:professional_accounting|0\": {\n \"\ acc_norm\": 0.25886524822695034,\n \"acc_norm_stderr\": 0.026129572527180844\n\ \ },\n \"community|arabic_mmlu:professional_law|0\": {\n \"acc_norm\"\ : 0.29726205997392435,\n \"acc_norm_stderr\": 0.01167334617308604\n },\n\ \ \"community|arabic_mmlu:professional_medicine|0\": {\n \"acc_norm\"\ : 0.2647058823529412,\n \"acc_norm_stderr\": 0.026799562024887657\n },\n\ \ \"community|arabic_mmlu:professional_psychology|0\": {\n \"acc_norm\"\ : 0.2761437908496732,\n \"acc_norm_stderr\": 0.018087276935663137\n },\n\ \ \"community|arabic_mmlu:public_relations|0\": {\n \"acc_norm\": 0.4090909090909091,\n\ \ \"acc_norm_stderr\": 0.04709306978661896\n },\n \"community|arabic_mmlu:security_studies|0\"\ : {\n \"acc_norm\": 0.42857142857142855,\n \"acc_norm_stderr\": 0.031680911612338825\n\ \ },\n \"community|arabic_mmlu:sociology|0\": {\n \"acc_norm\": 0.44776119402985076,\n\ \ \"acc_norm_stderr\": 0.03516184772952167\n },\n \"community|arabic_mmlu:us_foreign_policy|0\"\ : {\n \"acc_norm\": 0.47,\n \"acc_norm_stderr\": 0.05016135580465919\n\ \ },\n \"community|arabic_mmlu:virology|0\": {\n \"acc_norm\": 0.3373493975903614,\n\ \ \"acc_norm_stderr\": 0.036807836907275814\n },\n \"community|arabic_mmlu:world_religions|0\"\ : {\n \"acc_norm\": 0.29239766081871343,\n \"acc_norm_stderr\": 0.03488647713457922\n\ \ },\n \"community|arc_challenge_okapi_ar|0\": {\n \"acc_norm\": 0.375,\n\ \ \"acc_norm_stderr\": 0.014220469151254982\n },\n \"community|arc_easy_ar|0\"\ : {\n \"acc_norm\": 0.383248730964467,\n \"acc_norm_stderr\": 0.010001462888736162\n\ \ },\n \"community|boolq_ar|0\": {\n \"acc_norm\": 0.7095092024539877,\n\ \ \"acc_norm_stderr\": 0.007952488057502703\n },\n \"community|copa_ext_ar|0\"\ : {\n \"acc_norm\": 0.5333333333333333,\n \"acc_norm_stderr\": 0.05288198530254015\n\ \ },\n \"community|hellaswag_okapi_ar|0\": {\n \"acc_norm\": 0.299094973285356,\n\ \ \"acc_norm_stderr\": 0.004781338339681938\n },\n \"community|openbook_qa_ext_ar|0\"\ : {\n \"acc_norm\": 0.4484848484848485,\n \"acc_norm_stderr\": 0.022376344379324554\n\ \ },\n \"community|piqa_ar|0\": {\n \"acc_norm\": 0.5935624659028914,\n\ \ \"acc_norm_stderr\": 0.011475388153907532\n },\n \"community|race_ar|0\"\ : {\n \"acc_norm\": 0.4286873605193751,\n \"acc_norm_stderr\": 0.007049720616100983\n\ \ },\n \"community|sciq_ar|0\": {\n \"acc_norm\": 0.6010050251256281,\n\ \ \"acc_norm_stderr\": 0.015532078342747238\n },\n \"community|toxigen_ar|0\"\ : {\n \"acc_norm\": 0.558288770053476,\n \"acc_norm_stderr\": 0.016248947232761816\n\ \ },\n \"lighteval|xstory_cloze:ar|0\": {\n \"acc\": 0.5784248841826605,\n\ \ \"acc_stderr\": 0.012707862131801898\n },\n \"community|acva:_average|0\"\ : {\n \"acc_norm\": 0.6065540602982304,\n \"acc_norm_stderr\": 0.04709698815079447\n\ \ },\n \"community|alghafa:_average|0\": {\n \"acc_norm\": 0.49555527307701674,\n\ \ \"acc_norm_stderr\": 0.02259722664566892\n },\n \"community|arabic_mmlu:_average|0\"\ : {\n \"acc_norm\": 0.3431955522216634,\n \"acc_norm_stderr\": 0.03518454291933395\n\ \ }\n}\n```" repo_url: https://huggingface.co/zhengr/MixTAO-7Bx2-MoE-v8.1 configs: - config_name: community_acva_Algeria_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Algeria|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Algeria|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Ancient_Egypt_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Ancient_Egypt|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Ancient_Egypt|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arab_Empire_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arab_Empire|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arab_Empire|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Architecture_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Architecture|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Architecture|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Art_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Art|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Art|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Astronomy_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Astronomy|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Astronomy|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Calligraphy_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Calligraphy|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Calligraphy|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Ceremony_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Ceremony|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Ceremony|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Clothing_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Clothing|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Clothing|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Culture_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Culture|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Culture|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Food_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Food|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Food|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Funeral_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Funeral|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Funeral|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Geography_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Geography|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Geography|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_History_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_History|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_History|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Language_Origin_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Language_Origin|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Language_Origin|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Literature_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Literature|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Literature|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Math_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Math|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Math|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Medicine_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Medicine|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Medicine|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Music_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Music|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Music|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Ornament_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Ornament|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Ornament|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Philosophy_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Philosophy|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Philosophy|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Physics_and_Chemistry_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Physics_and_Chemistry|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Physics_and_Chemistry|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Arabic_Wedding_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Arabic_Wedding|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Arabic_Wedding|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Bahrain_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Bahrain|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Bahrain|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Comoros_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Comoros|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Comoros|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Egypt_modern_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Egypt_modern|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Egypt_modern|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_InfluenceFromAncientEgypt_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:InfluenceFromAncientEgypt|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:InfluenceFromAncientEgypt|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_InfluenceFromByzantium_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:InfluenceFromByzantium|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:InfluenceFromByzantium|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_InfluenceFromChina_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:InfluenceFromChina|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:InfluenceFromChina|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_InfluenceFromGreece_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:InfluenceFromGreece|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:InfluenceFromGreece|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_InfluenceFromIslam_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:InfluenceFromIslam|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:InfluenceFromIslam|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_InfluenceFromPersia_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:InfluenceFromPersia|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:InfluenceFromPersia|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_InfluenceFromRome_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:InfluenceFromRome|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:InfluenceFromRome|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Iraq_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Iraq|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Iraq|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Islam_Education_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Islam_Education|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Islam_Education|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Islam_branches_and_schools_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Islam_branches_and_schools|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Islam_branches_and_schools|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Islamic_law_system_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Islamic_law_system|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Islamic_law_system|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Jordan_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Jordan|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Jordan|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Kuwait_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Kuwait|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Kuwait|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Lebanon_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Lebanon|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Lebanon|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Libya_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Libya|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Libya|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Mauritania_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Mauritania|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Mauritania|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Mesopotamia_civilization_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Mesopotamia_civilization|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Mesopotamia_civilization|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Morocco_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Morocco|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Morocco|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Oman_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Oman|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Oman|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Palestine_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Palestine|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Palestine|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Qatar_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Qatar|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Qatar|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Saudi_Arabia_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Saudi_Arabia|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Saudi_Arabia|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Somalia_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Somalia|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Somalia|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Sudan_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Sudan|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Sudan|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Syria_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Syria|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Syria|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Tunisia_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Tunisia|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Tunisia|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_United_Arab_Emirates_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:United_Arab_Emirates|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:United_Arab_Emirates|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_Yemen_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:Yemen|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:Yemen|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_communication_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:communication|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:communication|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_computer_and_phone_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:computer_and_phone|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:computer_and_phone|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_daily_life_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:daily_life|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:daily_life|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_acva_entertainment_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|acva:entertainment|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|acva:entertainment|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_alghafa_mcq_exams_test_ar_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|alghafa:mcq_exams_test_ar|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|alghafa:mcq_exams_test_ar|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_alghafa_meta_ar_dialects_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|alghafa:meta_ar_dialects|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|alghafa:meta_ar_dialects|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_alghafa_meta_ar_msa_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|alghafa:meta_ar_msa|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|alghafa:meta_ar_msa|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_alghafa_multiple_choice_facts_truefalse_balanced_task_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|alghafa:multiple_choice_facts_truefalse_balanced_task|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|alghafa:multiple_choice_facts_truefalse_balanced_task|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_alghafa_multiple_choice_grounded_statement_soqal_task_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|alghafa:multiple_choice_grounded_statement_soqal_task|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|alghafa:multiple_choice_grounded_statement_soqal_task|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_alghafa_multiple_choice_grounded_statement_xglue_mlqa_task_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_alghafa_multiple_choice_rating_sentiment_no_neutral_task_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_alghafa_multiple_choice_rating_sentiment_task_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|alghafa:multiple_choice_rating_sentiment_task|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|alghafa:multiple_choice_rating_sentiment_task|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_alghafa_multiple_choice_sentiment_task_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|alghafa:multiple_choice_sentiment_task|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|alghafa:multiple_choice_sentiment_task|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_exams_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_exams|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_exams|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_abstract_algebra_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:abstract_algebra|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:abstract_algebra|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_anatomy_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:anatomy|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:anatomy|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_astronomy_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:astronomy|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:astronomy|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_business_ethics_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:business_ethics|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:business_ethics|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_clinical_knowledge_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:clinical_knowledge|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:clinical_knowledge|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_college_biology_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:college_biology|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:college_biology|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_college_chemistry_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:college_chemistry|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:college_chemistry|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_college_computer_science_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:college_computer_science|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:college_computer_science|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_college_mathematics_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:college_mathematics|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:college_mathematics|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_college_medicine_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:college_medicine|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:college_medicine|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_college_physics_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:college_physics|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:college_physics|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_computer_security_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:computer_security|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:computer_security|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_conceptual_physics_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:conceptual_physics|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:conceptual_physics|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_econometrics_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:econometrics|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:econometrics|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_electrical_engineering_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:electrical_engineering|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:electrical_engineering|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_elementary_mathematics_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:elementary_mathematics|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:elementary_mathematics|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_formal_logic_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:formal_logic|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:formal_logic|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_global_facts_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:global_facts|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:global_facts|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_high_school_biology_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:high_school_biology|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_biology|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_high_school_chemistry_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:high_school_chemistry|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_chemistry|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_high_school_computer_science_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:high_school_computer_science|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_computer_science|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_high_school_european_history_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:high_school_european_history|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_european_history|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_high_school_geography_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:high_school_geography|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_geography|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_high_school_government_and_politics_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:high_school_government_and_politics|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_government_and_politics|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_high_school_macroeconomics_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:high_school_macroeconomics|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_macroeconomics|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_high_school_mathematics_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:high_school_mathematics|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_mathematics|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_high_school_microeconomics_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:high_school_microeconomics|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_microeconomics|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_high_school_physics_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:high_school_physics|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_physics|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_high_school_psychology_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:high_school_psychology|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_psychology|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_high_school_statistics_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:high_school_statistics|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_statistics|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_high_school_us_history_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:high_school_us_history|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_us_history|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_high_school_world_history_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:high_school_world_history|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_world_history|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_human_aging_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:human_aging|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:human_aging|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_human_sexuality_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:human_sexuality|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:human_sexuality|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_international_law_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:international_law|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:international_law|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_jurisprudence_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:jurisprudence|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:jurisprudence|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_logical_fallacies_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:logical_fallacies|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:logical_fallacies|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_machine_learning_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:machine_learning|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:machine_learning|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_management_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:management|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:management|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_marketing_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:marketing|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:marketing|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_medical_genetics_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:medical_genetics|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:medical_genetics|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_miscellaneous_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:miscellaneous|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:miscellaneous|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_moral_disputes_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:moral_disputes|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:moral_disputes|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_moral_scenarios_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:moral_scenarios|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:moral_scenarios|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_nutrition_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:nutrition|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:nutrition|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_philosophy_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:philosophy|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:philosophy|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_prehistory_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:prehistory|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:prehistory|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_professional_accounting_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:professional_accounting|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:professional_accounting|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_professional_law_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:professional_law|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:professional_law|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_professional_medicine_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:professional_medicine|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:professional_medicine|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_professional_psychology_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:professional_psychology|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:professional_psychology|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_public_relations_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:public_relations|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:public_relations|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_security_studies_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:security_studies|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:security_studies|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_sociology_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:sociology|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:sociology|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_us_foreign_policy_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:us_foreign_policy|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:us_foreign_policy|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_virology_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:virology|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:virology|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arabic_mmlu_world_religions_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arabic_mmlu:world_religions|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arabic_mmlu:world_religions|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arc_challenge_okapi_ar_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arc_challenge_okapi_ar|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arc_challenge_okapi_ar|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_arc_easy_ar_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|arc_easy_ar|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|arc_easy_ar|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_boolq_ar_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|boolq_ar|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|boolq_ar|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_copa_ext_ar_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|copa_ext_ar|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|copa_ext_ar|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_hellaswag_okapi_ar_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|hellaswag_okapi_ar|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|hellaswag_okapi_ar|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_openbook_qa_ext_ar_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|openbook_qa_ext_ar|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|openbook_qa_ext_ar|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_piqa_ar_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|piqa_ar|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|piqa_ar|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_race_ar_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|race_ar|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|race_ar|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_sciq_ar_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|sciq_ar|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|sciq_ar|0_2024-05-20T17-39-11.123569.parquet' - config_name: community_toxigen_ar_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_community|toxigen_ar|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_community|toxigen_ar|0_2024-05-20T17-39-11.123569.parquet' - config_name: lighteval_xstory_cloze_ar_0 data_files: - split: 2024_05_20T17_39_11.123569 path: - '**/details_lighteval|xstory_cloze:ar|0_2024-05-20T17-39-11.123569.parquet' - split: latest path: - '**/details_lighteval|xstory_cloze:ar|0_2024-05-20T17-39-11.123569.parquet' - config_name: results data_files: - split: 2024_05_20T17_39_11.123569 path: - results_2024-05-20T17-39-11.123569.parquet - split: latest path: - results_2024-05-20T17-39-11.123569.parquet --- # Dataset Card for Evaluation run of zhengr/MixTAO-7Bx2-MoE-v8.1 <!-- Provide a quick summary of the dataset. --> Dataset automatically created during the evaluation run of model [zhengr/MixTAO-7Bx2-MoE-v8.1](https://huggingface.co/zhengr/MixTAO-7Bx2-MoE-v8.1). The dataset is composed of 136 configuration, each one coresponding to one of the evaluated task. The dataset has been created from 1 run(s). Each run can be found as a specific split in each configuration, the split being named using the timestamp of the run.The "train" split is always pointing to the latest results. An additional configuration "results" store all the aggregated results of the run. To load the details from a run, you can for instance do the following: ```python from datasets import load_dataset data = load_dataset("OALL/details_zhengr__MixTAO-7Bx2-MoE-v8.1", "lighteval_xstory_cloze_ar_0", split="train") ``` ## Latest results These are the [latest results from run 2024-05-20T17:39:11.123569](https://huggingface.co/datasets/OALL/details_zhengr__MixTAO-7Bx2-MoE-v8.1/blob/main/results_2024-05-20T17-39-11.123569.json)(note that their might be results for other tasks in the repos if successive evals didn't cover the same tasks. You find each in the results and the "latest" split for each eval): ```python { "all": { "acc_norm": 0.47749705985404584, "acc_norm_stderr": 0.03795069261216331, "acc": 0.5784248841826605, "acc_stderr": 0.012707862131801898 }, "community|acva:Algeria|0": { "acc_norm": 0.5794871794871795, "acc_norm_stderr": 0.035441383893034833 }, "community|acva:Ancient_Egypt|0": { "acc_norm": 0.5714285714285714, "acc_norm_stderr": 0.02792722339076032 }, "community|acva:Arab_Empire|0": { "acc_norm": 0.3660377358490566, "acc_norm_stderr": 0.029647813539365252 }, "community|acva:Arabic_Architecture|0": { "acc_norm": 0.5846153846153846, "acc_norm_stderr": 0.035380132805750295 }, "community|acva:Arabic_Art|0": { "acc_norm": 0.558974358974359, "acc_norm_stderr": 0.035647329318535786 }, "community|acva:Arabic_Astronomy|0": { "acc_norm": 0.47692307692307695, "acc_norm_stderr": 0.0358596530894741 }, "community|acva:Arabic_Calligraphy|0": { "acc_norm": 0.6862745098039216, "acc_norm_stderr": 0.02911434198875567 }, "community|acva:Arabic_Ceremony|0": { "acc_norm": 0.6486486486486487, "acc_norm_stderr": 0.03519384049793635 }, "community|acva:Arabic_Clothing|0": { "acc_norm": 0.5333333333333333, "acc_norm_stderr": 0.035818045967822315 }, "community|acva:Arabic_Culture|0": { "acc_norm": 0.6256410256410256, "acc_norm_stderr": 0.03474608430626236 }, "community|acva:Arabic_Food|0": { "acc_norm": 0.5846153846153846, "acc_norm_stderr": 0.035380132805750295 }, "community|acva:Arabic_Funeral|0": { "acc_norm": 0.7578947368421053, "acc_norm_stderr": 0.04418172153936914 }, "community|acva:Arabic_Geography|0": { "acc_norm": 0.593103448275862, "acc_norm_stderr": 0.04093793981266236 }, "community|acva:Arabic_History|0": { "acc_norm": 0.49230769230769234, "acc_norm_stderr": 0.03589365940635213 }, "community|acva:Arabic_Language_Origin|0": { "acc_norm": 0.6947368421052632, "acc_norm_stderr": 0.047498887145627784 }, "community|acva:Arabic_Literature|0": { "acc_norm": 0.7034482758620689, "acc_norm_stderr": 0.03806142687309993 }, "community|acva:Arabic_Math|0": { "acc_norm": 0.31794871794871793, "acc_norm_stderr": 0.03343383454355787 }, "community|acva:Arabic_Medicine|0": { "acc_norm": 0.6689655172413793, "acc_norm_stderr": 0.03921545312467122 }, "community|acva:Arabic_Music|0": { "acc_norm": 0.7266187050359713, "acc_norm_stderr": 0.0379400712153362 }, "community|acva:Arabic_Ornament|0": { "acc_norm": 0.7743589743589744, "acc_norm_stderr": 0.030010921825357008 }, "community|acva:Arabic_Philosophy|0": { "acc_norm": 0.6689655172413793, "acc_norm_stderr": 0.03921545312467122 }, "community|acva:Arabic_Physics_and_Chemistry|0": { "acc_norm": 0.6307692307692307, "acc_norm_stderr": 0.034648411418637566 }, "community|acva:Arabic_Wedding|0": { "acc_norm": 0.5846153846153846, "acc_norm_stderr": 0.03538013280575031 }, "community|acva:Bahrain|0": { "acc_norm": 0.6222222222222222, "acc_norm_stderr": 0.07309112127323451 }, "community|acva:Comoros|0": { "acc_norm": 0.4666666666666667, "acc_norm_stderr": 0.0752101433090355 }, "community|acva:Egypt_modern|0": { "acc_norm": 0.6421052631578947, "acc_norm_stderr": 0.04944436957628254 }, "community|acva:InfluenceFromAncientEgypt|0": { "acc_norm": 0.7538461538461538, "acc_norm_stderr": 0.030927428371225685 }, "community|acva:InfluenceFromByzantium|0": { "acc_norm": 0.8, "acc_norm_stderr": 0.0333333333333333 }, "community|acva:InfluenceFromChina|0": { "acc_norm": 0.28205128205128205, "acc_norm_stderr": 0.032307986017991154 }, "community|acva:InfluenceFromGreece|0": { "acc_norm": 0.8153846153846154, "acc_norm_stderr": 0.027855716655754165 }, "community|acva:InfluenceFromIslam|0": { "acc_norm": 0.7517241379310344, "acc_norm_stderr": 0.036001056927277716 }, "community|acva:InfluenceFromPersia|0": { "acc_norm": 0.7885714285714286, "acc_norm_stderr": 0.03095478075830146 }, "community|acva:InfluenceFromRome|0": { "acc_norm": 0.6666666666666666, "acc_norm_stderr": 0.03384487217112065 }, "community|acva:Iraq|0": { "acc_norm": 0.6, "acc_norm_stderr": 0.05345224838248487 }, "community|acva:Islam_Education|0": { "acc_norm": 0.6871794871794872, "acc_norm_stderr": 0.033287550657248546 }, "community|acva:Islam_branches_and_schools|0": { "acc_norm": 0.5657142857142857, "acc_norm_stderr": 0.037576101528126626 }, "community|acva:Islamic_law_system|0": { "acc_norm": 0.7076923076923077, "acc_norm_stderr": 0.032654383937495104 }, "community|acva:Jordan|0": { "acc_norm": 0.4666666666666667, "acc_norm_stderr": 0.0752101433090355 }, "community|acva:Kuwait|0": { "acc_norm": 0.7777777777777778, "acc_norm_stderr": 0.06267511942419628 }, "community|acva:Lebanon|0": { "acc_norm": 0.5555555555555556, "acc_norm_stderr": 0.07491109582924914 }, "community|acva:Libya|0": { "acc_norm": 0.6222222222222222, "acc_norm_stderr": 0.07309112127323451 }, "community|acva:Mauritania|0": { "acc_norm": 0.6, "acc_norm_stderr": 0.07385489458759965 }, "community|acva:Mesopotamia_civilization|0": { "acc_norm": 0.6709677419354839, "acc_norm_stderr": 0.037862535985883836 }, "community|acva:Morocco|0": { "acc_norm": 0.6222222222222222, "acc_norm_stderr": 0.07309112127323451 }, "community|acva:Oman|0": { "acc_norm": 0.6, "acc_norm_stderr": 0.07385489458759965 }, "community|acva:Palestine|0": { "acc_norm": 0.5411764705882353, "acc_norm_stderr": 0.0543691634273002 }, "community|acva:Qatar|0": { "acc_norm": 0.6444444444444445, "acc_norm_stderr": 0.07216392363431012 }, "community|acva:Saudi_Arabia|0": { "acc_norm": 0.6205128205128205, "acc_norm_stderr": 0.03483959266365358 }, "community|acva:Somalia|0": { "acc_norm": 0.6222222222222222, "acc_norm_stderr": 0.07309112127323451 }, "community|acva:Sudan|0": { "acc_norm": 0.5777777777777777, "acc_norm_stderr": 0.07446027270295806 }, "community|acva:Syria|0": { "acc_norm": 0.6444444444444445, "acc_norm_stderr": 0.07216392363431012 }, "community|acva:Tunisia|0": { "acc_norm": 0.35555555555555557, "acc_norm_stderr": 0.07216392363431012 }, "community|acva:United_Arab_Emirates|0": { "acc_norm": 0.5647058823529412, "acc_norm_stderr": 0.054095720804810316 }, "community|acva:Yemen|0": { "acc_norm": 0.4, "acc_norm_stderr": 0.1632993161855452 }, "community|acva:communication|0": { "acc_norm": 0.5302197802197802, "acc_norm_stderr": 0.026195217787616888 }, "community|acva:computer_and_phone|0": { "acc_norm": 0.6067796610169491, "acc_norm_stderr": 0.02848786016617071 }, "community|acva:daily_life|0": { "acc_norm": 0.5400593471810089, "acc_norm_stderr": 0.027189548976070146 }, "community|acva:entertainment|0": { "acc_norm": 0.6372881355932203, "acc_norm_stderr": 0.028039814248303797 }, "community|alghafa:mcq_exams_test_ar|0": { "acc_norm": 0.3267504488330341, "acc_norm_stderr": 0.0198910970748856 }, "community|alghafa:meta_ar_dialects|0": { "acc_norm": 0.3241890639481001, "acc_norm_stderr": 0.006373181940508726 }, "community|alghafa:meta_ar_msa|0": { "acc_norm": 0.3754189944134078, "acc_norm_stderr": 0.01619510424846353 }, "community|alghafa:multiple_choice_facts_truefalse_balanced_task|0": { "acc_norm": 0.52, "acc_norm_stderr": 0.05807730170189531 }, "community|alghafa:multiple_choice_grounded_statement_soqal_task|0": { "acc_norm": 0.6266666666666667, "acc_norm_stderr": 0.03962538976206637 }, "community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0": { "acc_norm": 0.5266666666666666, "acc_norm_stderr": 0.040903298047964325 }, "community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0": { "acc_norm": 0.8440275171982489, "acc_norm_stderr": 0.004058076442677078 }, "community|alghafa:multiple_choice_rating_sentiment_task|0": { "acc_norm": 0.51976647206005, "acc_norm_stderr": 0.006453153566642388 }, "community|alghafa:multiple_choice_sentiment_task|0": { "acc_norm": 0.39651162790697675, "acc_norm_stderr": 0.011798437025916928 }, "community|arabic_exams|0": { "acc_norm": 0.329608938547486, "acc_norm_stderr": 0.02030398121835852 }, "community|arabic_mmlu:abstract_algebra|0": { "acc_norm": 0.27, "acc_norm_stderr": 0.044619604333847394 }, "community|arabic_mmlu:anatomy|0": { "acc_norm": 0.3333333333333333, "acc_norm_stderr": 0.04072314811876837 }, "community|arabic_mmlu:astronomy|0": { "acc_norm": 0.34868421052631576, "acc_norm_stderr": 0.0387813988879761 }, "community|arabic_mmlu:business_ethics|0": { "acc_norm": 0.47, "acc_norm_stderr": 0.050161355804659205 }, "community|arabic_mmlu:clinical_knowledge|0": { "acc_norm": 0.4037735849056604, "acc_norm_stderr": 0.03019761160019795 }, "community|arabic_mmlu:college_biology|0": { "acc_norm": 0.3333333333333333, "acc_norm_stderr": 0.039420826399272135 }, "community|arabic_mmlu:college_chemistry|0": { "acc_norm": 0.31, "acc_norm_stderr": 0.04648231987117316 }, "community|arabic_mmlu:college_computer_science|0": { "acc_norm": 0.26, "acc_norm_stderr": 0.04408440022768078 }, "community|arabic_mmlu:college_mathematics|0": { "acc_norm": 0.24, "acc_norm_stderr": 0.04292346959909282 }, "community|arabic_mmlu:college_medicine|0": { "acc_norm": 0.3179190751445087, "acc_norm_stderr": 0.0355068398916558 }, "community|arabic_mmlu:college_physics|0": { "acc_norm": 0.21568627450980393, "acc_norm_stderr": 0.04092563958237655 }, "community|arabic_mmlu:computer_security|0": { "acc_norm": 0.49, "acc_norm_stderr": 0.05024183937956911 }, "community|arabic_mmlu:conceptual_physics|0": { "acc_norm": 0.32340425531914896, "acc_norm_stderr": 0.030579442773610337 }, "community|arabic_mmlu:econometrics|0": { "acc_norm": 0.2719298245614035, "acc_norm_stderr": 0.04185774424022056 }, "community|arabic_mmlu:electrical_engineering|0": { "acc_norm": 0.4, "acc_norm_stderr": 0.040824829046386284 }, "community|arabic_mmlu:elementary_mathematics|0": { "acc_norm": 0.30423280423280424, "acc_norm_stderr": 0.023695415009463087 }, "community|arabic_mmlu:formal_logic|0": { "acc_norm": 0.30952380952380953, "acc_norm_stderr": 0.041349130183033156 }, "community|arabic_mmlu:global_facts|0": { "acc_norm": 0.29, "acc_norm_stderr": 0.045604802157206845 }, "community|arabic_mmlu:high_school_biology|0": { "acc_norm": 0.3967741935483871, "acc_norm_stderr": 0.027831231605767944 }, "community|arabic_mmlu:high_school_chemistry|0": { "acc_norm": 0.3497536945812808, "acc_norm_stderr": 0.03355400904969565 }, "community|arabic_mmlu:high_school_computer_science|0": { "acc_norm": 0.36, "acc_norm_stderr": 0.04824181513244218 }, "community|arabic_mmlu:high_school_european_history|0": { "acc_norm": 0.23636363636363636, "acc_norm_stderr": 0.03317505930009179 }, "community|arabic_mmlu:high_school_geography|0": { "acc_norm": 0.3383838383838384, "acc_norm_stderr": 0.03371124142626303 }, "community|arabic_mmlu:high_school_government_and_politics|0": { "acc_norm": 0.3316062176165803, "acc_norm_stderr": 0.03397636541089116 }, "community|arabic_mmlu:high_school_macroeconomics|0": { "acc_norm": 0.3333333333333333, "acc_norm_stderr": 0.02390115797940254 }, "community|arabic_mmlu:high_school_mathematics|0": { "acc_norm": 0.31851851851851853, "acc_norm_stderr": 0.028406533090608456 }, "community|arabic_mmlu:high_school_microeconomics|0": { "acc_norm": 0.2773109243697479, "acc_norm_stderr": 0.02907937453948001 }, "community|arabic_mmlu:high_school_physics|0": { "acc_norm": 0.2913907284768212, "acc_norm_stderr": 0.037101857261199946 }, "community|arabic_mmlu:high_school_psychology|0": { "acc_norm": 0.3247706422018349, "acc_norm_stderr": 0.02007772910931033 }, "community|arabic_mmlu:high_school_statistics|0": { "acc_norm": 0.3472222222222222, "acc_norm_stderr": 0.032468872436376486 }, "community|arabic_mmlu:high_school_us_history|0": { "acc_norm": 0.2549019607843137, "acc_norm_stderr": 0.030587591351604246 }, "community|arabic_mmlu:high_school_world_history|0": { "acc_norm": 0.32489451476793246, "acc_norm_stderr": 0.030486039389105303 }, "community|arabic_mmlu:human_aging|0": { "acc_norm": 0.34080717488789236, "acc_norm_stderr": 0.03181149747055359 }, "community|arabic_mmlu:human_sexuality|0": { "acc_norm": 0.37404580152671757, "acc_norm_stderr": 0.04243869242230524 }, "community|arabic_mmlu:international_law|0": { "acc_norm": 0.48760330578512395, "acc_norm_stderr": 0.045629515481807666 }, "community|arabic_mmlu:jurisprudence|0": { "acc_norm": 0.48148148148148145, "acc_norm_stderr": 0.04830366024635331 }, "community|arabic_mmlu:logical_fallacies|0": { "acc_norm": 0.3496932515337423, "acc_norm_stderr": 0.03746668325470022 }, "community|arabic_mmlu:machine_learning|0": { "acc_norm": 0.29464285714285715, "acc_norm_stderr": 0.0432704093257873 }, "community|arabic_mmlu:management|0": { "acc_norm": 0.42718446601941745, "acc_norm_stderr": 0.04897957737781168 }, "community|arabic_mmlu:marketing|0": { "acc_norm": 0.45726495726495725, "acc_norm_stderr": 0.03263622596380688 }, "community|arabic_mmlu:medical_genetics|0": { "acc_norm": 0.28, "acc_norm_stderr": 0.04512608598542127 }, "community|arabic_mmlu:miscellaneous|0": { "acc_norm": 0.384418901660281, "acc_norm_stderr": 0.01739568874281962 }, "community|arabic_mmlu:moral_disputes|0": { "acc_norm": 0.407514450867052, "acc_norm_stderr": 0.026454578146931498 }, "community|arabic_mmlu:moral_scenarios|0": { "acc_norm": 0.2547486033519553, "acc_norm_stderr": 0.01457265038340916 }, "community|arabic_mmlu:nutrition|0": { "acc_norm": 0.43137254901960786, "acc_norm_stderr": 0.028358956313423556 }, "community|arabic_mmlu:philosophy|0": { "acc_norm": 0.40192926045016075, "acc_norm_stderr": 0.02784647600593048 }, "community|arabic_mmlu:prehistory|0": { "acc_norm": 0.33024691358024694, "acc_norm_stderr": 0.026168298456732846 }, "community|arabic_mmlu:professional_accounting|0": { "acc_norm": 0.25886524822695034, "acc_norm_stderr": 0.026129572527180844 }, "community|arabic_mmlu:professional_law|0": { "acc_norm": 0.29726205997392435, "acc_norm_stderr": 0.01167334617308604 }, "community|arabic_mmlu:professional_medicine|0": { "acc_norm": 0.2647058823529412, "acc_norm_stderr": 0.026799562024887657 }, "community|arabic_mmlu:professional_psychology|0": { "acc_norm": 0.2761437908496732, "acc_norm_stderr": 0.018087276935663137 }, "community|arabic_mmlu:public_relations|0": { "acc_norm": 0.4090909090909091, "acc_norm_stderr": 0.04709306978661896 }, "community|arabic_mmlu:security_studies|0": { "acc_norm": 0.42857142857142855, "acc_norm_stderr": 0.031680911612338825 }, "community|arabic_mmlu:sociology|0": { "acc_norm": 0.44776119402985076, "acc_norm_stderr": 0.03516184772952167 }, "community|arabic_mmlu:us_foreign_policy|0": { "acc_norm": 0.47, "acc_norm_stderr": 0.05016135580465919 }, "community|arabic_mmlu:virology|0": { "acc_norm": 0.3373493975903614, "acc_norm_stderr": 0.036807836907275814 }, "community|arabic_mmlu:world_religions|0": { "acc_norm": 0.29239766081871343, "acc_norm_stderr": 0.03488647713457922 }, "community|arc_challenge_okapi_ar|0": { "acc_norm": 0.375, "acc_norm_stderr": 0.014220469151254982 }, "community|arc_easy_ar|0": { "acc_norm": 0.383248730964467, "acc_norm_stderr": 0.010001462888736162 }, "community|boolq_ar|0": { "acc_norm": 0.7095092024539877, "acc_norm_stderr": 0.007952488057502703 }, "community|copa_ext_ar|0": { "acc_norm": 0.5333333333333333, "acc_norm_stderr": 0.05288198530254015 }, "community|hellaswag_okapi_ar|0": { "acc_norm": 0.299094973285356, "acc_norm_stderr": 0.004781338339681938 }, "community|openbook_qa_ext_ar|0": { "acc_norm": 0.4484848484848485, "acc_norm_stderr": 0.022376344379324554 }, "community|piqa_ar|0": { "acc_norm": 0.5935624659028914, "acc_norm_stderr": 0.011475388153907532 }, "community|race_ar|0": { "acc_norm": 0.4286873605193751, "acc_norm_stderr": 0.007049720616100983 }, "community|sciq_ar|0": { "acc_norm": 0.6010050251256281, "acc_norm_stderr": 0.015532078342747238 }, "community|toxigen_ar|0": { "acc_norm": 0.558288770053476, "acc_norm_stderr": 0.016248947232761816 }, "lighteval|xstory_cloze:ar|0": { "acc": 0.5784248841826605, "acc_stderr": 0.012707862131801898 }, "community|acva:_average|0": { "acc_norm": 0.6065540602982304, "acc_norm_stderr": 0.04709698815079447 }, "community|alghafa:_average|0": { "acc_norm": 0.49555527307701674, "acc_norm_stderr": 0.02259722664566892 }, "community|arabic_mmlu:_average|0": { "acc_norm": 0.3431955522216634, "acc_norm_stderr": 0.03518454291933395 } } ``` ## Dataset Details ### Dataset Description <!-- Provide a longer summary of what this dataset is. --> - **Curated by:** [More Information Needed] - **Funded by [optional]:** [More Information Needed] - **Shared by [optional]:** [More Information Needed] - **Language(s) (NLP):** [More Information Needed] - **License:** [More Information Needed] ### Dataset Sources [optional] <!-- Provide the basic links for the dataset. --> - **Repository:** [More Information Needed] - **Paper [optional]:** [More Information Needed] - **Demo [optional]:** [More Information Needed] ## Uses <!-- Address questions around how the dataset is intended to be used. --> ### Direct Use <!-- This section describes suitable use cases for the dataset. --> [More Information Needed] ### Out-of-Scope Use <!-- This section addresses misuse, malicious use, and uses that the dataset will not work well for. --> [More Information Needed] ## Dataset Structure <!-- This section provides a description of the dataset fields, and additional information about the dataset structure such as criteria used to create the splits, relationships between data points, etc. --> [More Information Needed] ## Dataset Creation ### Curation Rationale <!-- Motivation for the creation of this dataset. --> [More Information Needed] ### Source Data <!-- This section describes the source data (e.g. news text and headlines, social media posts, translated sentences, ...). --> #### Data Collection and Processing <!-- This section describes the data collection and processing process such as data selection criteria, filtering and normalization methods, tools and libraries used, etc. --> [More Information Needed] #### Who are the source data producers? <!-- This section describes the people or systems who originally created the data. It should also include self-reported demographic or identity information for the source data creators if this information is available. --> [More Information Needed] ### Annotations [optional] <!-- If the dataset contains annotations which are not part of the initial data collection, use this section to describe them. --> #### Annotation process <!-- This section describes the annotation process such as annotation tools used in the process, the amount of data annotated, annotation guidelines provided to the annotators, interannotator statistics, annotation validation, etc. --> [More Information Needed] #### Who are the annotators? <!-- This section describes the people or systems who created the annotations. --> [More Information Needed] #### Personal and Sensitive Information <!-- State whether the dataset contains data that might be considered personal, sensitive, or private (e.g., data that reveals addresses, uniquely identifiable names or aliases, racial or ethnic origins, sexual orientations, religious beliefs, political opinions, financial or health data, etc.). If efforts were made to anonymize the data, describe the anonymization process. --> [More Information Needed] ## Bias, Risks, and Limitations <!-- This section is meant to convey both technical and sociotechnical limitations. --> [More Information Needed] ### Recommendations <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. --> Users should be made aware of the risks, biases and limitations of the dataset. More information needed for further recommendations. ## Citation [optional] <!-- If there is a paper or blog post introducing the dataset, the APA and Bibtex information for that should go in this section. --> **BibTeX:** [More Information Needed] **APA:** [More Information Needed] ## Glossary [optional] <!-- If relevant, include terms and calculations in this section that can help readers understand the dataset or dataset card. --> [More Information Needed] ## More Information [optional] [More Information Needed] ## Dataset Card Authors [optional] [More Information Needed] ## Dataset Card Contact [More Information Needed]
提供机构:
OALL
原始信息汇总

数据集概述

数据集简介

该数据集是在评估模型zhengr/MixTAO-7Bx2-MoE-v8.1的过程中自动创建的。数据集包含136个配置,每个配置对应一个评估任务。

数据集结构

  • 数据集由1次运行创建,每个运行可以在每个配置中找到特定的分割,分割名称使用运行的时间戳。
  • "train"分割始终指向最新的结果。
  • 额外的配置"results"存储所有运行的聚合结果。

数据加载示例

以下是加载数据集详细信息的示例代码: python from datasets import load_dataset data = load_dataset("OALL/details_zhengr__MixTAO-7Bx2-MoE-v8.1", "lighteval_xstory_cloze_ar_0", split="train")

最新结果

以下是2024-05-20T17:39:11.123569运行的最新结果: python { "all": { "acc_norm": 0.47749705985404584, "acc_norm_stderr": 0.03795069261216331, "acc": 0.5784248841826605, "acc_stderr": 0.012707862131801898 }, "community|acva:Algeria|0": { "acc_norm": 0.5794871794871795, "acc_norm_stderr": 0.035441383893034833 }, "community|acva:Ancient_Egypt|0": { "acc_norm": 0.5714285714285714, "acc_norm_stderr": 0.02792722339076032 }, "community|acva:Arab_Empire|0": { "acc_norm": 0.3660377358490566, "acc_norm_stderr": 0.029647813539365252 }, "community|acva:Arabic_Architecture|0": { "acc_norm": 0.5846153846153846, "acc_norm_stderr": 0.035380132805750295 }, "community|acva:Arabic_Art|0": { "acc_norm": 0.558974358974359, "acc_norm_stderr": 0.035647329318535786 }, "community|acva:Arabic_Astronomy|0": { "acc_norm": 0.47692307692307695, "acc_norm_stderr": 0.0358596530894741 }, "community|acva:Arabic_Calligraphy|0": { "acc_norm": 0.6862745098039216, "acc_norm_stderr": 0.02911434198875567 }, "community|acva:Arabic_Ceremony|0": { "acc_norm": 0.6486486486486487, "acc_norm_stderr": 0.03519384049793635 }, "community|acva:Arabic_Clothing|0": { "acc_norm": 0.5333333333333333, "acc_norm_stderr": 0.035818045967822315 }, "community|acva:Arabic_Culture|0": { "acc_norm": 0.6256410256410256, "acc_norm_stderr": 0.03474608430626236 }, "community|acva:Arabic_Food|0": { "acc_norm": 0.5846153846153846, "acc_norm_stderr": 0.035380132805750295 }, "community|acva:Arabic_Funeral|0": { "acc_norm": 0.7578947368421053, "acc_norm_stderr": 0.04418172153936914 }, "community|acva:Arabic_Geography|0": { "acc_norm": 0.593103448275862, "acc_norm_stderr": 0.04093793981266236 }, "community|acva:Arabic_History|0": { "acc_norm": 0.49230769230769234, "acc_norm_stderr": 0.03589365940635213 }, "community|acva:Arabic_Language_Origin|0": { "acc_norm": 0.6947368421052632, "acc_norm_stderr": 0.047498887145627784 }, "community|acva:Arabic_Literature|0": { "acc_norm": 0.7034482758620689, "acc_norm_stderr": 0.03806142687309993 }, "community|acva:Arabic_Math|0": { "acc_norm": 0.31794871794871793, "acc_norm_stderr": 0.03343383454355787 }, "community|acva:Arabic_Medicine|0": { "acc_norm": 0.6689655172413793, "acc_norm_stderr": 0.03921545312467122 }, "community|acva:Arabic_Music|0": { "acc_norm": 0.7266187050359713, "acc_norm_stderr": 0.0379400712153362 }, "community|acva:Arabic_Ornament|0": { "acc_norm": 0.7743589743589744, "acc_norm_stderr": 0.030010921825357008 }, "community|acva:Arabic_Philosophy|0": { "acc_norm": 0.6689655172413793, "acc_norm_stderr": 0.03921545312467122 }, "community|acva:Arabic_Physics_and_Chemistry|0": { "acc_norm": 0.6307692307692307, "acc_norm_stderr": 0.034648411418637566 }, "community|acva:Arabic_Wedding|0": { "acc_norm": 0.5846153846153846, "acc_norm_stderr": 0.03538013280575031 }, "community|acva:Bahrain|0": { "acc_norm": 0.6222222222222222, "acc_norm_stderr": 0.07309112127323451 }, "community|acva:Comoros|0": { "acc_norm": 0.4666666666666667, "acc_norm_stderr": 0.0752101433090355 }, "community|acva:Egypt_modern|0": { "acc_norm": 0.6421052631578947, "acc_norm_stderr": 0.04944436957628254 }, "community|acva:InfluenceFromAncientEgypt|0": { "acc_norm": 0.7538461538461538, "acc_norm_stderr": 0.030927428371225685 }, "community|acva:InfluenceFromByzantium|0": { "acc_norm": 0.8, "acc_norm_stderr": 0.0333333333333333 }, "community|acva:InfluenceFromChina|0": { "acc_norm": 0.28205128205128205, "acc_norm_stderr": 0.032307986017991154 }, "community|acva:InfluenceFromGreece|0": { "acc_norm": 0.8153846153846154, "acc_norm_stderr": 0.027855716655754165 }, "community|acva:InfluenceFromIslam|0": { "acc_norm": 0.7517241379310344, "acc_norm_stderr": 0.036001056927277716 }, "community|acva:InfluenceFromPersia|0": { "acc_norm": 0.7885714285714286, "acc_norm_stderr": 0.03095478075830146 }, "community|acva:InfluenceFromRome|0": { "acc_norm": 0.6666666666666666, "acc_norm_stderr": 0.03384487217112065 }, "community|acva:Iraq|0": { "acc_norm": 0.6, "acc_norm_stderr": 0.05345224838248487 }, "community|acva:Islam_Education|0": { "acc_norm": 0.6871794871794872, "acc_norm_stderr": 0.033287550657248546 }, "community|acva:Islam_branches_and_schools|0": { "acc_norm": 0.5657142857142857, "acc_norm_stderr": 0.037576101528126626 }, "community|acva:Islamic_law_system|0": { "acc_norm": 0.7076923076923077, "acc_norm_stderr": 0.032654383937495104 }, "community|acva:Jordan|0": { "acc_norm": 0.4666666666666667, "acc_norm_stderr": 0.0752101433090355 }, "community|acva:Kuwait|0": { "acc_norm": 0.7777777777777778, "acc_norm_stderr": 0.06267511942419628 }, "community|acva:Lebanon|0": { "acc_norm": 0.5555555555555556, "acc_norm_stderr": 0.07491109582924914 }, "community|acva:Libya|0": { "acc_norm": 0.6222222222222222, "acc_norm_stderr": 0.07309112127323451 }, "community|acva:Mauritania|0": { "acc_norm": 0.6, "acc_norm_stderr": 0.07385489458759965 }, "community|acva:Mesopotamia_civilization|0": { "acc_norm": 0.6709677419354839, "acc_norm_stderr": 0.037862535985883836 }, "community|acva:Morocco|0": { "acc_norm": 0.6222222222222222, "acc_norm_stderr": 0.07309112127323451 }, "community|acva:Oman|0": { "acc_norm": 0.6, "acc_norm_stderr": 0.07385489458759965 }, "community|acva:Palestine|0": { "acc_norm": 0.5411764705882353, "acc_norm_stderr": 0.0543691634273002 }, "community|acva:Qatar|0": { "acc_norm": 0.6444444444444445, "acc_norm_stderr": 0.07216392363431012 }, "community|acva:Saudi_Arabia|0": { "acc_norm": 0.6205128205128205, "acc_norm_stderr": 0.03483959266365358 }, "community|acva:Somalia|0": { "acc_norm": 0.6222222222222222, "acc_norm_stderr": 0.0730

搜集汇总
数据集介绍
main_image_url
构建方式
该数据集是在对模型 zhengr/MixTAO-7Bx2-MoE-v8.1 进行评估的过程中自动生成的。它涵盖了 136 个配置,每个配置对应一个被评估的任务。数据集基于单次运行创建,每次运行在配置中作为一个独立的分割存在,分割采用时间戳命名。其中,“train”分割始终指向最新的评估结果。此外,还包含一个名为“results”的额外配置,用于存储所有运行的综合结果。
特点
数据集结构精巧,将每个评估任务独立配置,便于按需访问。其分割机制灵活,时间戳命名确保了结果的可追溯性,而“train”分割自动指向最新数据,简化了迭代分析。结果配置汇总了全部任务的聚合指标,如准确率及其标准误差,为模型性能的宏观审视提供了便利。整体上,该数据集兼具细粒度与整体性,特别适合多任务评估场景。
使用方法
用户可通过 Hugging Face 的 datasets 库便捷加载数据。例如,使用 load_dataset 函数指定数据集名称和任务配置(如 lighteval_xstory_cloze_ar_0),并设置分割为“train”即可获取最新结果。对于历史运行,可依据时间戳分割进行回溯。此外,通过访问“results”配置,能够直接获取所有任务的聚合性能数据,适用于对比分析和报告生成。
背景与挑战
背景概述
在自然语言处理领域,多语言模型评估一直是衡量模型泛化能力的关键环节。OALL/details_zhengr__MixTAO-7Bx2-MoE-v8.1数据集由Open Arabic LLM Leaderboard于2024年5月创建,旨在系统评估混合专家模型zhengr/MixTAO-7Bx2-MoE-v8.1在多种阿拉伯语任务上的表现。该数据集涵盖136个配置,对应136项评估任务,涉及阿拉伯文化知识、方言理解、情感分析、学术考试等多个维度,为阿拉伯语大语言模型的性能评测提供了标准化基准。其核心研究问题在于:混合专家架构能否在阿拉伯语这一低资源语言上实现知识的高效融合与迁移。该数据集的发布推动了阿拉伯语NLP评测体系的完善,为后续多语言模型优化提供了重要参考。
当前挑战
该数据集所解决的领域问题在于,阿拉伯语大语言模型的性能评估长期缺乏统一、多维度的基准,现有评测多聚焦于英语或高资源语言,导致阿拉伯语模型在文化理解、方言识别等任务上的能力难以量化。构建过程中面临的挑战尤为突出:其一,阿拉伯语方言差异显著,从现代标准阿拉伯语到各地口语变体,需设计覆盖广泛且平衡的评测任务;其二,数据集需整合136个异构任务,包括文化知识、学术考试、情感分析等,确保各任务间评估标准的一致性;其三,混合专家模型的评估涉及多轮运行结果的管理,需在数据集中实现时间戳分片与最新结果自动指向,增加了数据存储与版本控制的复杂性。
常用场景
经典使用场景
在自然语言处理与多语言模型评估的交叉领域中,OALL/details_zhengr__MixTAO-7Bx2-MoE-v8.1数据集为研究者提供了对混合专家模型(MoE)在阿拉伯语及多元文化场景下性能的精细化评测平台。该数据集囊括了136项任务配置,覆盖从阿拉伯历史、文学、医学到现代科技与社会生活等广泛主题,特别适用于评估模型在多语言、多文化语境中的知识推理与语言理解能力。其经典用法在于通过标准化评测框架(如lighteval)加载各任务的最新结果,从而系统性地分析模型在阿拉伯语特定领域(如阿拉伯书法、伊斯兰法律体系)与通用知识基准(如Arabic MMLU)上的表现,为多语言MoE模型的优化提供实证依据。
衍生相关工作
该数据集衍生了一系列关于混合专家模型评估与优化的前沿工作。研究者基于其136个任务的细粒度结果,分析了MoE路由策略在不同文化主题上的稀疏激活模式,催生了针对阿拉伯语特定领域的专家网络微调方法。同时,数据集中的Arabic MMLU子集(覆盖57个学科)被广泛用作跨语言知识迁移的测试床,推动了多语言LLM在低资源场景下知识蒸馏与持续预训练策略的研究。此外,该数据集与lighteval框架的深度整合,激励了社区开发自动化评测流水线,使得后续模型(如更大规模的MoE变体)能够无缝接入同一评估体系,形成了模型迭代与性能追踪的闭环生态。
数据集最近研究
最新研究方向
该数据集聚焦于对混合专家模型(MoE)在阿拉伯语及多元文化语境下的评估,是当前多语言自然语言处理前沿方向的重要实践。随着大规模语言模型在低资源语言与跨文化场景中的部署需求激增,MixTAO-7Bx2-MoE-v8.1的评测结果揭示了模型在阿拉伯语方言、历史文明(如古埃及、美索不达米亚)及现代学科(如医学、法律)上的表现差异。其意义在于为细粒度文化适应性评估提供了基准,尤其呼应了近期阿拉伯语AI社区对模型在宗教、地域知识及情感理解中偏差问题的关注,推动了更具包容性的多模态评估体系的构建。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作