five

OALL/details_01-ai__Yi-1.5-9B-Chat

收藏
Hugging Face2024-05-17 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/OALL/details_01-ai__Yi-1.5-9B-Chat
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: Evaluation run of 01-ai/Yi-1.5-9B-Chat dataset_summary: "Dataset automatically created during the evaluation run of model\ \ [01-ai/Yi-1.5-9B-Chat](https://huggingface.co/01-ai/Yi-1.5-9B-Chat).\n\nThe dataset\ \ is composed of 136 configuration, each one coresponding to one of the evaluated\ \ task.\n\nThe dataset has been created from 1 run(s). Each run can be found as\ \ a specific split in each configuration, the split being named using the timestamp\ \ of the run.The \"train\" split is always pointing to the latest results.\n\nAn\ \ additional configuration \"results\" store all the aggregated results of the run.\n\ \nTo load the details from a run, you can for instance do the following:\n```python\n\ from datasets import load_dataset\ndata = load_dataset(\"OALL/details_01-ai__Yi-1.5-9B-Chat\"\ ,\n\t\"lighteval_xstory_cloze_ar_0\",\n\tsplit=\"train\")\n```\n\n## Latest results\n\ \nThese are the [latest results from run 2024-05-17T21:28:15.479048](https://huggingface.co/datasets/OALL/details_01-ai__Yi-1.5-9B-Chat/blob/main/results_2024-05-17T21-28-15.479048.json)(note\ \ that their might be results for other tasks in the repos if successive evals didn't\ \ cover the same tasks. You find each in the results and the \"latest\" split for\ \ each eval):\n\n```python\n{\n \"all\": {\n \"acc_norm\": 0.397794446745894,\n\ \ \"acc_norm_stderr\": 0.03764570531373537,\n \"acc\": 0.5413633355393779,\n\ \ \"acc_stderr\": 0.01282302034016982\n },\n \"community|acva:Algeria|0\"\ : {\n \"acc_norm\": 0.5333333333333333,\n \"acc_norm_stderr\": 0.03581804596782232\n\ \ },\n \"community|acva:Ancient_Egypt|0\": {\n \"acc_norm\": 0.24761904761904763,\n\ \ \"acc_norm_stderr\": 0.0243582507291411\n },\n \"community|acva:Arab_Empire|0\"\ : {\n \"acc_norm\": 0.5660377358490566,\n \"acc_norm_stderr\": 0.030503292013342596\n\ \ },\n \"community|acva:Arabic_Architecture|0\": {\n \"acc_norm\":\ \ 0.5435897435897435,\n \"acc_norm_stderr\": 0.03576123096991215\n },\n\ \ \"community|acva:Arabic_Art|0\": {\n \"acc_norm\": 0.47692307692307695,\n\ \ \"acc_norm_stderr\": 0.0358596530894741\n },\n \"community|acva:Arabic_Astronomy|0\"\ : {\n \"acc_norm\": 0.4666666666666667,\n \"acc_norm_stderr\": 0.03581804596782233\n\ \ },\n \"community|acva:Arabic_Calligraphy|0\": {\n \"acc_norm\": 0.6235294117647059,\n\ \ \"acc_norm_stderr\": 0.030400248938906704\n },\n \"community|acva:Arabic_Ceremony|0\"\ : {\n \"acc_norm\": 0.5945945945945946,\n \"acc_norm_stderr\": 0.03619481276442171\n\ \ },\n \"community|acva:Arabic_Clothing|0\": {\n \"acc_norm\": 0.5333333333333333,\n\ \ \"acc_norm_stderr\": 0.03581804596782232\n },\n \"community|acva:Arabic_Culture|0\"\ : {\n \"acc_norm\": 0.49230769230769234,\n \"acc_norm_stderr\": 0.03589365940635213\n\ \ },\n \"community|acva:Arabic_Food|0\": {\n \"acc_norm\": 0.5692307692307692,\n\ \ \"acc_norm_stderr\": 0.03555213252058761\n },\n \"community|acva:Arabic_Funeral|0\"\ : {\n \"acc_norm\": 0.4,\n \"acc_norm_stderr\": 0.050529115263991134\n\ \ },\n \"community|acva:Arabic_Geography|0\": {\n \"acc_norm\": 0.6206896551724138,\n\ \ \"acc_norm_stderr\": 0.04043461861916747\n },\n \"community|acva:Arabic_History|0\"\ : {\n \"acc_norm\": 0.2923076923076923,\n \"acc_norm_stderr\": 0.032654383937495125\n\ \ },\n \"community|acva:Arabic_Language_Origin|0\": {\n \"acc_norm\"\ : 0.6105263157894737,\n \"acc_norm_stderr\": 0.05029529117145395\n },\n\ \ \"community|acva:Arabic_Literature|0\": {\n \"acc_norm\": 0.5379310344827586,\n\ \ \"acc_norm_stderr\": 0.041546596717075474\n },\n \"community|acva:Arabic_Math|0\"\ : {\n \"acc_norm\": 0.3230769230769231,\n \"acc_norm_stderr\": 0.03357544396403133\n\ \ },\n \"community|acva:Arabic_Medicine|0\": {\n \"acc_norm\": 0.6758620689655173,\n\ \ \"acc_norm_stderr\": 0.03900432069185555\n },\n \"community|acva:Arabic_Music|0\"\ : {\n \"acc_norm\": 0.302158273381295,\n \"acc_norm_stderr\": 0.03908914479291562\n\ \ },\n \"community|acva:Arabic_Ornament|0\": {\n \"acc_norm\": 0.7384615384615385,\n\ \ \"acc_norm_stderr\": 0.0315522880274276\n },\n \"community|acva:Arabic_Philosophy|0\"\ : {\n \"acc_norm\": 0.5793103448275863,\n \"acc_norm_stderr\": 0.0411391498118926\n\ \ },\n \"community|acva:Arabic_Physics_and_Chemistry|0\": {\n \"acc_norm\"\ : 0.5384615384615384,\n \"acc_norm_stderr\": 0.03579154352544572\n },\n\ \ \"community|acva:Arabic_Wedding|0\": {\n \"acc_norm\": 0.48205128205128206,\n\ \ \"acc_norm_stderr\": 0.035874770987738294\n },\n \"community|acva:Bahrain|0\"\ : {\n \"acc_norm\": 0.5777777777777777,\n \"acc_norm_stderr\": 0.07446027270295806\n\ \ },\n \"community|acva:Comoros|0\": {\n \"acc_norm\": 0.4,\n \ \ \"acc_norm_stderr\": 0.07385489458759965\n },\n \"community|acva:Egypt_modern|0\"\ : {\n \"acc_norm\": 0.5263157894736842,\n \"acc_norm_stderr\": 0.05149958471474543\n\ \ },\n \"community|acva:InfluenceFromAncientEgypt|0\": {\n \"acc_norm\"\ : 0.7128205128205128,\n \"acc_norm_stderr\": 0.03248373338539887\n },\n\ \ \"community|acva:InfluenceFromByzantium|0\": {\n \"acc_norm\": 0.7241379310344828,\n\ \ \"acc_norm_stderr\": 0.03724563619774632\n },\n \"community|acva:InfluenceFromChina|0\"\ : {\n \"acc_norm\": 0.3333333333333333,\n \"acc_norm_stderr\": 0.033844872171120644\n\ \ },\n \"community|acva:InfluenceFromGreece|0\": {\n \"acc_norm\":\ \ 0.6717948717948717,\n \"acc_norm_stderr\": 0.033712437824137076\n },\n\ \ \"community|acva:InfluenceFromIslam|0\": {\n \"acc_norm\": 0.4482758620689655,\n\ \ \"acc_norm_stderr\": 0.04144311810878151\n },\n \"community|acva:InfluenceFromPersia|0\"\ : {\n \"acc_norm\": 0.8114285714285714,\n \"acc_norm_stderr\": 0.029654354112075433\n\ \ },\n \"community|acva:InfluenceFromRome|0\": {\n \"acc_norm\": 0.5846153846153846,\n\ \ \"acc_norm_stderr\": 0.035380132805750295\n },\n \"community|acva:Iraq|0\"\ : {\n \"acc_norm\": 0.5411764705882353,\n \"acc_norm_stderr\": 0.0543691634273002\n\ \ },\n \"community|acva:Islam_Education|0\": {\n \"acc_norm\": 0.4512820512820513,\n\ \ \"acc_norm_stderr\": 0.03572709860318392\n },\n \"community|acva:Islam_branches_and_schools|0\"\ : {\n \"acc_norm\": 0.4514285714285714,\n \"acc_norm_stderr\": 0.03772562898529836\n\ \ },\n \"community|acva:Islamic_law_system|0\": {\n \"acc_norm\": 0.4461538461538462,\n\ \ \"acc_norm_stderr\": 0.03568913546569232\n },\n \"community|acva:Jordan|0\"\ : {\n \"acc_norm\": 0.35555555555555557,\n \"acc_norm_stderr\": 0.07216392363431012\n\ \ },\n \"community|acva:Kuwait|0\": {\n \"acc_norm\": 0.28888888888888886,\n\ \ \"acc_norm_stderr\": 0.06832943242540508\n },\n \"community|acva:Lebanon|0\"\ : {\n \"acc_norm\": 0.4444444444444444,\n \"acc_norm_stderr\": 0.07491109582924915\n\ \ },\n \"community|acva:Libya|0\": {\n \"acc_norm\": 0.5777777777777777,\n\ \ \"acc_norm_stderr\": 0.07446027270295806\n },\n \"community|acva:Mauritania|0\"\ : {\n \"acc_norm\": 0.5111111111111111,\n \"acc_norm_stderr\": 0.07535922203472523\n\ \ },\n \"community|acva:Mesopotamia_civilization|0\": {\n \"acc_norm\"\ : 0.5612903225806452,\n \"acc_norm_stderr\": 0.03998729476451436\n },\n\ \ \"community|acva:Morocco|0\": {\n \"acc_norm\": 0.28888888888888886,\n\ \ \"acc_norm_stderr\": 0.06832943242540507\n },\n \"community|acva:Oman|0\"\ : {\n \"acc_norm\": 0.28888888888888886,\n \"acc_norm_stderr\": 0.06832943242540508\n\ \ },\n \"community|acva:Palestine|0\": {\n \"acc_norm\": 0.3411764705882353,\n\ \ \"acc_norm_stderr\": 0.051729042973619264\n },\n \"community|acva:Qatar|0\"\ : {\n \"acc_norm\": 0.4444444444444444,\n \"acc_norm_stderr\": 0.07491109582924915\n\ \ },\n \"community|acva:Saudi_Arabia|0\": {\n \"acc_norm\": 0.3435897435897436,\n\ \ \"acc_norm_stderr\": 0.03409627301409855\n },\n \"community|acva:Somalia|0\"\ : {\n \"acc_norm\": 0.26666666666666666,\n \"acc_norm_stderr\": 0.06666666666666665\n\ \ },\n \"community|acva:Sudan|0\": {\n \"acc_norm\": 0.4222222222222222,\n\ \ \"acc_norm_stderr\": 0.07446027270295806\n },\n \"community|acva:Syria|0\"\ : {\n \"acc_norm\": 0.37777777777777777,\n \"acc_norm_stderr\": 0.07309112127323451\n\ \ },\n \"community|acva:Tunisia|0\": {\n \"acc_norm\": 0.35555555555555557,\n\ \ \"acc_norm_stderr\": 0.07216392363431012\n },\n \"community|acva:United_Arab_Emirates|0\"\ : {\n \"acc_norm\": 0.4235294117647059,\n \"acc_norm_stderr\": 0.05391265523477461\n\ \ },\n \"community|acva:Yemen|0\": {\n \"acc_norm\": 0.3,\n \ \ \"acc_norm_stderr\": 0.15275252316519464\n },\n \"community|acva:communication|0\"\ : {\n \"acc_norm\": 0.49175824175824173,\n \"acc_norm_stderr\": 0.026239628591083888\n\ \ },\n \"community|acva:computer_and_phone|0\": {\n \"acc_norm\": 0.5559322033898305,\n\ \ \"acc_norm_stderr\": 0.02897756513294154\n },\n \"community|acva:daily_life|0\"\ : {\n \"acc_norm\": 0.2878338278931751,\n \"acc_norm_stderr\": 0.024699715357282315\n\ \ },\n \"community|acva:entertainment|0\": {\n \"acc_norm\": 0.3559322033898305,\n\ \ \"acc_norm_stderr\": 0.027923880374505525\n },\n \"community|alghafa:mcq_exams_test_ar|0\"\ : {\n \"acc_norm\": 0.2800718132854578,\n \"acc_norm_stderr\": 0.019043286203795345\n\ \ },\n \"community|alghafa:meta_ar_dialects|0\": {\n \"acc_norm\":\ \ 0.2895273401297498,\n \"acc_norm_stderr\": 0.006175370293841651\n },\n\ \ \"community|alghafa:meta_ar_msa|0\": {\n \"acc_norm\": 0.32849162011173183,\n\ \ \"acc_norm_stderr\": 0.015707935398496457\n },\n \"community|alghafa:multiple_choice_facts_truefalse_balanced_task|0\"\ : {\n \"acc_norm\": 0.5333333333333333,\n \"acc_norm_stderr\": 0.05799451149344531\n\ \ },\n \"community|alghafa:multiple_choice_grounded_statement_soqal_task|0\"\ : {\n \"acc_norm\": 0.56,\n \"acc_norm_stderr\": 0.040665603096078445\n\ \ },\n \"community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0\"\ : {\n \"acc_norm\": 0.38,\n \"acc_norm_stderr\": 0.039764406869602295\n\ \ },\n \"community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0\"\ : {\n \"acc_norm\": 0.7972482801751094,\n \"acc_norm_stderr\": 0.004496731917745599\n\ \ },\n \"community|alghafa:multiple_choice_rating_sentiment_task|0\": {\n\ \ \"acc_norm\": 0.5279399499582986,\n \"acc_norm_stderr\": 0.006448111196626818\n\ \ },\n \"community|alghafa:multiple_choice_sentiment_task|0\": {\n \ \ \"acc_norm\": 0.3843023255813954,\n \"acc_norm_stderr\": 0.011732277442725819\n\ \ },\n \"community|arabic_exams|0\": {\n \"acc_norm\": 0.2811918063314711,\n\ \ \"acc_norm_stderr\": 0.019418936671758837\n },\n \"community|arabic_mmlu:abstract_algebra|0\"\ : {\n \"acc_norm\": 0.34,\n \"acc_norm_stderr\": 0.047609522856952365\n\ \ },\n \"community|arabic_mmlu:anatomy|0\": {\n \"acc_norm\": 0.28888888888888886,\n\ \ \"acc_norm_stderr\": 0.03915450630414251\n },\n \"community|arabic_mmlu:astronomy|0\"\ : {\n \"acc_norm\": 0.3223684210526316,\n \"acc_norm_stderr\": 0.038035102483515854\n\ \ },\n \"community|arabic_mmlu:business_ethics|0\": {\n \"acc_norm\"\ : 0.36,\n \"acc_norm_stderr\": 0.04824181513244218\n },\n \"community|arabic_mmlu:clinical_knowledge|0\"\ : {\n \"acc_norm\": 0.2830188679245283,\n \"acc_norm_stderr\": 0.0277242364927009\n\ \ },\n \"community|arabic_mmlu:college_biology|0\": {\n \"acc_norm\"\ : 0.2916666666666667,\n \"acc_norm_stderr\": 0.03800968060554857\n },\n\ \ \"community|arabic_mmlu:college_chemistry|0\": {\n \"acc_norm\": 0.2,\n\ \ \"acc_norm_stderr\": 0.040201512610368445\n },\n \"community|arabic_mmlu:college_computer_science|0\"\ : {\n \"acc_norm\": 0.2,\n \"acc_norm_stderr\": 0.04020151261036843\n\ \ },\n \"community|arabic_mmlu:college_mathematics|0\": {\n \"acc_norm\"\ : 0.27,\n \"acc_norm_stderr\": 0.044619604333847415\n },\n \"community|arabic_mmlu:college_medicine|0\"\ : {\n \"acc_norm\": 0.23699421965317918,\n \"acc_norm_stderr\": 0.03242414757483099\n\ \ },\n \"community|arabic_mmlu:college_physics|0\": {\n \"acc_norm\"\ : 0.24509803921568626,\n \"acc_norm_stderr\": 0.04280105837364396\n },\n\ \ \"community|arabic_mmlu:computer_security|0\": {\n \"acc_norm\": 0.45,\n\ \ \"acc_norm_stderr\": 0.05\n },\n \"community|arabic_mmlu:conceptual_physics|0\"\ : {\n \"acc_norm\": 0.2680851063829787,\n \"acc_norm_stderr\": 0.028957342788342347\n\ \ },\n \"community|arabic_mmlu:econometrics|0\": {\n \"acc_norm\":\ \ 0.30701754385964913,\n \"acc_norm_stderr\": 0.04339138322579861\n },\n\ \ \"community|arabic_mmlu:electrical_engineering|0\": {\n \"acc_norm\"\ : 0.41379310344827586,\n \"acc_norm_stderr\": 0.04104269211806232\n },\n\ \ \"community|arabic_mmlu:elementary_mathematics|0\": {\n \"acc_norm\"\ : 0.36772486772486773,\n \"acc_norm_stderr\": 0.02483383982556242\n },\n\ \ \"community|arabic_mmlu:formal_logic|0\": {\n \"acc_norm\": 0.2619047619047619,\n\ \ \"acc_norm_stderr\": 0.03932537680392871\n },\n \"community|arabic_mmlu:global_facts|0\"\ : {\n \"acc_norm\": 0.31,\n \"acc_norm_stderr\": 0.04648231987117316\n\ \ },\n \"community|arabic_mmlu:high_school_biology|0\": {\n \"acc_norm\"\ : 0.3258064516129032,\n \"acc_norm_stderr\": 0.026662010578567104\n },\n\ \ \"community|arabic_mmlu:high_school_chemistry|0\": {\n \"acc_norm\"\ : 0.2955665024630542,\n \"acc_norm_stderr\": 0.032104944337514575\n },\n\ \ \"community|arabic_mmlu:high_school_computer_science|0\": {\n \"acc_norm\"\ : 0.42,\n \"acc_norm_stderr\": 0.049604496374885836\n },\n \"community|arabic_mmlu:high_school_european_history|0\"\ : {\n \"acc_norm\": 0.23636363636363636,\n \"acc_norm_stderr\": 0.033175059300091805\n\ \ },\n \"community|arabic_mmlu:high_school_geography|0\": {\n \"acc_norm\"\ : 0.32323232323232326,\n \"acc_norm_stderr\": 0.03332299921070644\n },\n\ \ \"community|arabic_mmlu:high_school_government_and_politics|0\": {\n \ \ \"acc_norm\": 0.30569948186528495,\n \"acc_norm_stderr\": 0.033248379397581594\n\ \ },\n \"community|arabic_mmlu:high_school_macroeconomics|0\": {\n \ \ \"acc_norm\": 0.3230769230769231,\n \"acc_norm_stderr\": 0.023710888501970562\n\ \ },\n \"community|arabic_mmlu:high_school_mathematics|0\": {\n \"\ acc_norm\": 0.3074074074074074,\n \"acc_norm_stderr\": 0.028133252578815632\n\ \ },\n \"community|arabic_mmlu:high_school_microeconomics|0\": {\n \ \ \"acc_norm\": 0.2647058823529412,\n \"acc_norm_stderr\": 0.028657491285071963\n\ \ },\n \"community|arabic_mmlu:high_school_physics|0\": {\n \"acc_norm\"\ : 0.2847682119205298,\n \"acc_norm_stderr\": 0.03684881521389023\n },\n\ \ \"community|arabic_mmlu:high_school_psychology|0\": {\n \"acc_norm\"\ : 0.25688073394495414,\n \"acc_norm_stderr\": 0.018732492928342462\n },\n\ \ \"community|arabic_mmlu:high_school_statistics|0\": {\n \"acc_norm\"\ : 0.24074074074074073,\n \"acc_norm_stderr\": 0.02915752218460562\n },\n\ \ \"community|arabic_mmlu:high_school_us_history|0\": {\n \"acc_norm\"\ : 0.22058823529411764,\n \"acc_norm_stderr\": 0.029102254389674065\n },\n\ \ \"community|arabic_mmlu:high_school_world_history|0\": {\n \"acc_norm\"\ : 0.26582278481012656,\n \"acc_norm_stderr\": 0.02875679962965834\n },\n\ \ \"community|arabic_mmlu:human_aging|0\": {\n \"acc_norm\": 0.29596412556053814,\n\ \ \"acc_norm_stderr\": 0.0306365913486998\n },\n \"community|arabic_mmlu:human_sexuality|0\"\ : {\n \"acc_norm\": 0.2824427480916031,\n \"acc_norm_stderr\": 0.03948406125768362\n\ \ },\n \"community|arabic_mmlu:international_law|0\": {\n \"acc_norm\"\ : 0.4214876033057851,\n \"acc_norm_stderr\": 0.04507732278775094\n },\n\ \ \"community|arabic_mmlu:jurisprudence|0\": {\n \"acc_norm\": 0.3148148148148148,\n\ \ \"acc_norm_stderr\": 0.04489931073591312\n },\n \"community|arabic_mmlu:logical_fallacies|0\"\ : {\n \"acc_norm\": 0.3067484662576687,\n \"acc_norm_stderr\": 0.036230899157241474\n\ \ },\n \"community|arabic_mmlu:machine_learning|0\": {\n \"acc_norm\"\ : 0.26785714285714285,\n \"acc_norm_stderr\": 0.04203277291467762\n },\n\ \ \"community|arabic_mmlu:management|0\": {\n \"acc_norm\": 0.23300970873786409,\n\ \ \"acc_norm_stderr\": 0.041858325989283136\n },\n \"community|arabic_mmlu:marketing|0\"\ : {\n \"acc_norm\": 0.3888888888888889,\n \"acc_norm_stderr\": 0.031937057262002924\n\ \ },\n \"community|arabic_mmlu:medical_genetics|0\": {\n \"acc_norm\"\ : 0.28,\n \"acc_norm_stderr\": 0.04512608598542128\n },\n \"community|arabic_mmlu:miscellaneous|0\"\ : {\n \"acc_norm\": 0.32567049808429116,\n \"acc_norm_stderr\": 0.016757989458549682\n\ \ },\n \"community|arabic_mmlu:moral_disputes|0\": {\n \"acc_norm\"\ : 0.3959537572254335,\n \"acc_norm_stderr\": 0.02632981334194624\n },\n\ \ \"community|arabic_mmlu:moral_scenarios|0\": {\n \"acc_norm\": 0.23798882681564246,\n\ \ \"acc_norm_stderr\": 0.014242630070574901\n },\n \"community|arabic_mmlu:nutrition|0\"\ : {\n \"acc_norm\": 0.3300653594771242,\n \"acc_norm_stderr\": 0.026925654653615693\n\ \ },\n \"community|arabic_mmlu:philosophy|0\": {\n \"acc_norm\": 0.34726688102893893,\n\ \ \"acc_norm_stderr\": 0.027040745502307336\n },\n \"community|arabic_mmlu:prehistory|0\"\ : {\n \"acc_norm\": 0.3425925925925926,\n \"acc_norm_stderr\": 0.026406145973625665\n\ \ },\n \"community|arabic_mmlu:professional_accounting|0\": {\n \"\ acc_norm\": 0.30851063829787234,\n \"acc_norm_stderr\": 0.027553366165101376\n\ \ },\n \"community|arabic_mmlu:professional_law|0\": {\n \"acc_norm\"\ : 0.27249022164276404,\n \"acc_norm_stderr\": 0.011371658294311535\n },\n\ \ \"community|arabic_mmlu:professional_medicine|0\": {\n \"acc_norm\"\ : 0.1875,\n \"acc_norm_stderr\": 0.023709788253811766\n },\n \"community|arabic_mmlu:professional_psychology|0\"\ : {\n \"acc_norm\": 0.30718954248366015,\n \"acc_norm_stderr\": 0.018663359671463667\n\ \ },\n \"community|arabic_mmlu:public_relations|0\": {\n \"acc_norm\"\ : 0.24545454545454545,\n \"acc_norm_stderr\": 0.041220665028782834\n },\n\ \ \"community|arabic_mmlu:security_studies|0\": {\n \"acc_norm\": 0.2979591836734694,\n\ \ \"acc_norm_stderr\": 0.029279567411065684\n },\n \"community|arabic_mmlu:sociology|0\"\ : {\n \"acc_norm\": 0.3880597014925373,\n \"acc_norm_stderr\": 0.03445789964362749\n\ \ },\n \"community|arabic_mmlu:us_foreign_policy|0\": {\n \"acc_norm\"\ : 0.42,\n \"acc_norm_stderr\": 0.049604496374885836\n },\n \"community|arabic_mmlu:virology|0\"\ : {\n \"acc_norm\": 0.3132530120481928,\n \"acc_norm_stderr\": 0.03610805018031024\n\ \ },\n \"community|arabic_mmlu:world_religions|0\": {\n \"acc_norm\"\ : 0.27485380116959063,\n \"acc_norm_stderr\": 0.034240429246915824\n },\n\ \ \"community|arc_challenge_okapi_ar|0\": {\n \"acc_norm\": 0.3293103448275862,\n\ \ \"acc_norm_stderr\": 0.013804534699579278\n },\n \"community|arc_easy_ar|0\"\ : {\n \"acc_norm\": 0.3236040609137056,\n \"acc_norm_stderr\": 0.009624443258161308\n\ \ },\n \"community|boolq_ar|0\": {\n \"acc_norm\": 0.7085889570552147,\n\ \ \"acc_norm_stderr\": 0.007959907341375319\n },\n \"community|copa_ext_ar|0\"\ : {\n \"acc_norm\": 0.5222222222222223,\n \"acc_norm_stderr\": 0.05294752255076824\n\ \ },\n \"community|hellaswag_okapi_ar|0\": {\n \"acc_norm\": 0.26529277069021917,\n\ \ \"acc_norm_stderr\": 0.004610363799431674\n },\n \"community|openbook_qa_ext_ar|0\"\ : {\n \"acc_norm\": 0.36767676767676766,\n \"acc_norm_stderr\": 0.02169397769879489\n\ \ },\n \"community|piqa_ar|0\": {\n \"acc_norm\": 0.5368248772504092,\n\ \ \"acc_norm_stderr\": 0.01165000722527945\n },\n \"community|race_ar|0\"\ : {\n \"acc_norm\": 0.3327246906066139,\n \"acc_norm_stderr\": 0.006712119702934689\n\ \ },\n \"community|sciq_ar|0\": {\n \"acc_norm\": 0.48542713567839196,\n\ \ \"acc_norm_stderr\": 0.01585229964546976\n },\n \"community|toxigen_ar|0\"\ : {\n \"acc_norm\": 0.4834224598930481,\n \"acc_norm_stderr\": 0.016351505086413663\n\ \ },\n \"lighteval|xstory_cloze:ar|0\": {\n \"acc\": 0.5413633355393779,\n\ \ \"acc_stderr\": 0.01282302034016982\n },\n \"community|acva:_average|0\"\ : {\n \"acc_norm\": 0.47778978781322307,\n \"acc_norm_stderr\": 0.047483596881547824\n\ \ },\n \"community|alghafa:_average|0\": {\n \"acc_norm\": 0.4534349625083418,\n\ \ \"acc_norm_stderr\": 0.022447581545817528\n },\n \"community|arabic_mmlu:_average|0\"\ : {\n \"acc_norm\": 0.30303933090891266,\n \"acc_norm_stderr\": 0.03413101309881009\n\ \ }\n}\n```" repo_url: https://huggingface.co/01-ai/Yi-1.5-9B-Chat configs: - config_name: community_acva_Algeria_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Algeria|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Algeria|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Ancient_Egypt_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Ancient_Egypt|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Ancient_Egypt|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arab_Empire_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arab_Empire|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arab_Empire|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Architecture_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Architecture|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Architecture|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Art_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Art|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Art|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Astronomy_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Astronomy|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Astronomy|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Calligraphy_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Calligraphy|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Calligraphy|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Ceremony_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Ceremony|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Ceremony|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Clothing_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Clothing|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Clothing|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Culture_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Culture|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Culture|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Food_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Food|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Food|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Funeral_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Funeral|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Funeral|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Geography_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Geography|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Geography|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_History_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_History|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_History|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Language_Origin_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Language_Origin|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Language_Origin|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Literature_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Literature|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Literature|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Math_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Math|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Math|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Medicine_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Medicine|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Medicine|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Music_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Music|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Music|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Ornament_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Ornament|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Ornament|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Philosophy_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Philosophy|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Philosophy|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Physics_and_Chemistry_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Physics_and_Chemistry|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Physics_and_Chemistry|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Arabic_Wedding_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Arabic_Wedding|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Arabic_Wedding|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Bahrain_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Bahrain|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Bahrain|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Comoros_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Comoros|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Comoros|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Egypt_modern_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Egypt_modern|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Egypt_modern|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_InfluenceFromAncientEgypt_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:InfluenceFromAncientEgypt|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:InfluenceFromAncientEgypt|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_InfluenceFromByzantium_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:InfluenceFromByzantium|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:InfluenceFromByzantium|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_InfluenceFromChina_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:InfluenceFromChina|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:InfluenceFromChina|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_InfluenceFromGreece_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:InfluenceFromGreece|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:InfluenceFromGreece|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_InfluenceFromIslam_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:InfluenceFromIslam|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:InfluenceFromIslam|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_InfluenceFromPersia_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:InfluenceFromPersia|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:InfluenceFromPersia|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_InfluenceFromRome_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:InfluenceFromRome|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:InfluenceFromRome|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Iraq_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Iraq|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Iraq|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Islam_Education_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Islam_Education|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Islam_Education|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Islam_branches_and_schools_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Islam_branches_and_schools|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Islam_branches_and_schools|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Islamic_law_system_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Islamic_law_system|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Islamic_law_system|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Jordan_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Jordan|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Jordan|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Kuwait_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Kuwait|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Kuwait|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Lebanon_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Lebanon|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Lebanon|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Libya_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Libya|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Libya|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Mauritania_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Mauritania|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Mauritania|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Mesopotamia_civilization_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Mesopotamia_civilization|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Mesopotamia_civilization|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Morocco_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Morocco|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Morocco|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Oman_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Oman|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Oman|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Palestine_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Palestine|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Palestine|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Qatar_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Qatar|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Qatar|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Saudi_Arabia_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Saudi_Arabia|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Saudi_Arabia|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Somalia_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Somalia|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Somalia|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Sudan_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Sudan|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Sudan|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Syria_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Syria|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Syria|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Tunisia_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Tunisia|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Tunisia|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_United_Arab_Emirates_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:United_Arab_Emirates|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:United_Arab_Emirates|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_Yemen_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:Yemen|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:Yemen|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_communication_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:communication|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:communication|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_computer_and_phone_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:computer_and_phone|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:computer_and_phone|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_daily_life_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:daily_life|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:daily_life|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_acva_entertainment_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|acva:entertainment|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|acva:entertainment|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_alghafa_mcq_exams_test_ar_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|alghafa:mcq_exams_test_ar|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|alghafa:mcq_exams_test_ar|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_alghafa_meta_ar_dialects_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|alghafa:meta_ar_dialects|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|alghafa:meta_ar_dialects|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_alghafa_meta_ar_msa_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|alghafa:meta_ar_msa|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|alghafa:meta_ar_msa|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_alghafa_multiple_choice_facts_truefalse_balanced_task_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|alghafa:multiple_choice_facts_truefalse_balanced_task|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|alghafa:multiple_choice_facts_truefalse_balanced_task|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_alghafa_multiple_choice_grounded_statement_soqal_task_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|alghafa:multiple_choice_grounded_statement_soqal_task|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|alghafa:multiple_choice_grounded_statement_soqal_task|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_alghafa_multiple_choice_grounded_statement_xglue_mlqa_task_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_alghafa_multiple_choice_rating_sentiment_no_neutral_task_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_alghafa_multiple_choice_rating_sentiment_task_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|alghafa:multiple_choice_rating_sentiment_task|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|alghafa:multiple_choice_rating_sentiment_task|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_alghafa_multiple_choice_sentiment_task_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|alghafa:multiple_choice_sentiment_task|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|alghafa:multiple_choice_sentiment_task|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_exams_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_exams|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_exams|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_abstract_algebra_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:abstract_algebra|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:abstract_algebra|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_anatomy_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:anatomy|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:anatomy|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_astronomy_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:astronomy|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:astronomy|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_business_ethics_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:business_ethics|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:business_ethics|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_clinical_knowledge_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:clinical_knowledge|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:clinical_knowledge|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_college_biology_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:college_biology|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:college_biology|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_college_chemistry_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:college_chemistry|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:college_chemistry|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_college_computer_science_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:college_computer_science|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:college_computer_science|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_college_mathematics_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:college_mathematics|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:college_mathematics|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_college_medicine_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:college_medicine|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:college_medicine|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_college_physics_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:college_physics|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:college_physics|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_computer_security_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:computer_security|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:computer_security|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_conceptual_physics_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:conceptual_physics|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:conceptual_physics|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_econometrics_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:econometrics|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:econometrics|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_electrical_engineering_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:electrical_engineering|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:electrical_engineering|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_elementary_mathematics_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:elementary_mathematics|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:elementary_mathematics|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_formal_logic_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:formal_logic|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:formal_logic|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_global_facts_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:global_facts|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:global_facts|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_high_school_biology_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:high_school_biology|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_biology|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_high_school_chemistry_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:high_school_chemistry|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_chemistry|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_high_school_computer_science_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:high_school_computer_science|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_computer_science|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_high_school_european_history_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:high_school_european_history|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_european_history|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_high_school_geography_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:high_school_geography|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_geography|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_high_school_government_and_politics_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:high_school_government_and_politics|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_government_and_politics|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_high_school_macroeconomics_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:high_school_macroeconomics|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_macroeconomics|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_high_school_mathematics_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:high_school_mathematics|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_mathematics|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_high_school_microeconomics_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:high_school_microeconomics|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_microeconomics|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_high_school_physics_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:high_school_physics|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_physics|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_high_school_psychology_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:high_school_psychology|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_psychology|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_high_school_statistics_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:high_school_statistics|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_statistics|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_high_school_us_history_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:high_school_us_history|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_us_history|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_high_school_world_history_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:high_school_world_history|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:high_school_world_history|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_human_aging_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:human_aging|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:human_aging|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_human_sexuality_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:human_sexuality|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:human_sexuality|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_international_law_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:international_law|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:international_law|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_jurisprudence_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:jurisprudence|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:jurisprudence|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_logical_fallacies_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:logical_fallacies|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:logical_fallacies|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_machine_learning_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:machine_learning|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:machine_learning|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_management_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:management|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:management|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_marketing_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:marketing|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:marketing|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_medical_genetics_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:medical_genetics|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:medical_genetics|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_miscellaneous_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:miscellaneous|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:miscellaneous|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_moral_disputes_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:moral_disputes|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:moral_disputes|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_moral_scenarios_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:moral_scenarios|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:moral_scenarios|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_nutrition_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:nutrition|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:nutrition|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_philosophy_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:philosophy|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:philosophy|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_prehistory_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:prehistory|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:prehistory|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_professional_accounting_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:professional_accounting|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:professional_accounting|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_professional_law_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:professional_law|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:professional_law|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_professional_medicine_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:professional_medicine|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:professional_medicine|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_professional_psychology_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:professional_psychology|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:professional_psychology|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_public_relations_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:public_relations|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:public_relations|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_security_studies_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:security_studies|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:security_studies|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_sociology_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:sociology|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:sociology|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_us_foreign_policy_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:us_foreign_policy|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:us_foreign_policy|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_virology_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:virology|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:virology|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arabic_mmlu_world_religions_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arabic_mmlu:world_religions|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arabic_mmlu:world_religions|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arc_challenge_okapi_ar_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arc_challenge_okapi_ar|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arc_challenge_okapi_ar|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_arc_easy_ar_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|arc_easy_ar|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|arc_easy_ar|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_boolq_ar_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|boolq_ar|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|boolq_ar|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_copa_ext_ar_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|copa_ext_ar|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|copa_ext_ar|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_hellaswag_okapi_ar_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|hellaswag_okapi_ar|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|hellaswag_okapi_ar|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_openbook_qa_ext_ar_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|openbook_qa_ext_ar|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|openbook_qa_ext_ar|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_piqa_ar_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|piqa_ar|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|piqa_ar|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_race_ar_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|race_ar|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|race_ar|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_sciq_ar_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|sciq_ar|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|sciq_ar|0_2024-05-17T21-28-15.479048.parquet' - config_name: community_toxigen_ar_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_community|toxigen_ar|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_community|toxigen_ar|0_2024-05-17T21-28-15.479048.parquet' - config_name: lighteval_xstory_cloze_ar_0 data_files: - split: 2024_05_17T21_28_15.479048 path: - '**/details_lighteval|xstory_cloze:ar|0_2024-05-17T21-28-15.479048.parquet' - split: latest path: - '**/details_lighteval|xstory_cloze:ar|0_2024-05-17T21-28-15.479048.parquet' - config_name: results data_files: - split: 2024_05_17T21_28_15.479048 path: - results_2024-05-17T21-28-15.479048.parquet - split: latest path: - results_2024-05-17T21-28-15.479048.parquet --- # Dataset Card for Evaluation run of 01-ai/Yi-1.5-9B-Chat <!-- Provide a quick summary of the dataset. --> Dataset automatically created during the evaluation run of model [01-ai/Yi-1.5-9B-Chat](https://huggingface.co/01-ai/Yi-1.5-9B-Chat). The dataset is composed of 136 configuration, each one coresponding to one of the evaluated task. The dataset has been created from 1 run(s). Each run can be found as a specific split in each configuration, the split being named using the timestamp of the run.The "train" split is always pointing to the latest results. An additional configuration "results" store all the aggregated results of the run. To load the details from a run, you can for instance do the following: ```python from datasets import load_dataset data = load_dataset("OALL/details_01-ai__Yi-1.5-9B-Chat", "lighteval_xstory_cloze_ar_0", split="train") ``` ## Latest results These are the [latest results from run 2024-05-17T21:28:15.479048](https://huggingface.co/datasets/OALL/details_01-ai__Yi-1.5-9B-Chat/blob/main/results_2024-05-17T21-28-15.479048.json)(note that their might be results for other tasks in the repos if successive evals didn't cover the same tasks. You find each in the results and the "latest" split for each eval): ```python { "all": { "acc_norm": 0.397794446745894, "acc_norm_stderr": 0.03764570531373537, "acc": 0.5413633355393779, "acc_stderr": 0.01282302034016982 }, "community|acva:Algeria|0": { "acc_norm": 0.5333333333333333, "acc_norm_stderr": 0.03581804596782232 }, "community|acva:Ancient_Egypt|0": { "acc_norm": 0.24761904761904763, "acc_norm_stderr": 0.0243582507291411 }, "community|acva:Arab_Empire|0": { "acc_norm": 0.5660377358490566, "acc_norm_stderr": 0.030503292013342596 }, "community|acva:Arabic_Architecture|0": { "acc_norm": 0.5435897435897435, "acc_norm_stderr": 0.03576123096991215 }, "community|acva:Arabic_Art|0": { "acc_norm": 0.47692307692307695, "acc_norm_stderr": 0.0358596530894741 }, "community|acva:Arabic_Astronomy|0": { "acc_norm": 0.4666666666666667, "acc_norm_stderr": 0.03581804596782233 }, "community|acva:Arabic_Calligraphy|0": { "acc_norm": 0.6235294117647059, "acc_norm_stderr": 0.030400248938906704 }, "community|acva:Arabic_Ceremony|0": { "acc_norm": 0.5945945945945946, "acc_norm_stderr": 0.03619481276442171 }, "community|acva:Arabic_Clothing|0": { "acc_norm": 0.5333333333333333, "acc_norm_stderr": 0.03581804596782232 }, "community|acva:Arabic_Culture|0": { "acc_norm": 0.49230769230769234, "acc_norm_stderr": 0.03589365940635213 }, "community|acva:Arabic_Food|0": { "acc_norm": 0.5692307692307692, "acc_norm_stderr": 0.03555213252058761 }, "community|acva:Arabic_Funeral|0": { "acc_norm": 0.4, "acc_norm_stderr": 0.050529115263991134 }, "community|acva:Arabic_Geography|0": { "acc_norm": 0.6206896551724138, "acc_norm_stderr": 0.04043461861916747 }, "community|acva:Arabic_History|0": { "acc_norm": 0.2923076923076923, "acc_norm_stderr": 0.032654383937495125 }, "community|acva:Arabic_Language_Origin|0": { "acc_norm": 0.6105263157894737, "acc_norm_stderr": 0.05029529117145395 }, "community|acva:Arabic_Literature|0": { "acc_norm": 0.5379310344827586, "acc_norm_stderr": 0.041546596717075474 }, "community|acva:Arabic_Math|0": { "acc_norm": 0.3230769230769231, "acc_norm_stderr": 0.03357544396403133 }, "community|acva:Arabic_Medicine|0": { "acc_norm": 0.6758620689655173, "acc_norm_stderr": 0.03900432069185555 }, "community|acva:Arabic_Music|0": { "acc_norm": 0.302158273381295, "acc_norm_stderr": 0.03908914479291562 }, "community|acva:Arabic_Ornament|0": { "acc_norm": 0.7384615384615385, "acc_norm_stderr": 0.0315522880274276 }, "community|acva:Arabic_Philosophy|0": { "acc_norm": 0.5793103448275863, "acc_norm_stderr": 0.0411391498118926 }, "community|acva:Arabic_Physics_and_Chemistry|0": { "acc_norm": 0.5384615384615384, "acc_norm_stderr": 0.03579154352544572 }, "community|acva:Arabic_Wedding|0": { "acc_norm": 0.48205128205128206, "acc_norm_stderr": 0.035874770987738294 }, "community|acva:Bahrain|0": { "acc_norm": 0.5777777777777777, "acc_norm_stderr": 0.07446027270295806 }, "community|acva:Comoros|0": { "acc_norm": 0.4, "acc_norm_stderr": 0.07385489458759965 }, "community|acva:Egypt_modern|0": { "acc_norm": 0.5263157894736842, "acc_norm_stderr": 0.05149958471474543 }, "community|acva:InfluenceFromAncientEgypt|0": { "acc_norm": 0.7128205128205128, "acc_norm_stderr": 0.03248373338539887 }, "community|acva:InfluenceFromByzantium|0": { "acc_norm": 0.7241379310344828, "acc_norm_stderr": 0.03724563619774632 }, "community|acva:InfluenceFromChina|0": { "acc_norm": 0.3333333333333333, "acc_norm_stderr": 0.033844872171120644 }, "community|acva:InfluenceFromGreece|0": { "acc_norm": 0.6717948717948717, "acc_norm_stderr": 0.033712437824137076 }, "community|acva:InfluenceFromIslam|0": { "acc_norm": 0.4482758620689655, "acc_norm_stderr": 0.04144311810878151 }, "community|acva:InfluenceFromPersia|0": { "acc_norm": 0.8114285714285714, "acc_norm_stderr": 0.029654354112075433 }, "community|acva:InfluenceFromRome|0": { "acc_norm": 0.5846153846153846, "acc_norm_stderr": 0.035380132805750295 }, "community|acva:Iraq|0": { "acc_norm": 0.5411764705882353, "acc_norm_stderr": 0.0543691634273002 }, "community|acva:Islam_Education|0": { "acc_norm": 0.4512820512820513, "acc_norm_stderr": 0.03572709860318392 }, "community|acva:Islam_branches_and_schools|0": { "acc_norm": 0.4514285714285714, "acc_norm_stderr": 0.03772562898529836 }, "community|acva:Islamic_law_system|0": { "acc_norm": 0.4461538461538462, "acc_norm_stderr": 0.03568913546569232 }, "community|acva:Jordan|0": { "acc_norm": 0.35555555555555557, "acc_norm_stderr": 0.07216392363431012 }, "community|acva:Kuwait|0": { "acc_norm": 0.28888888888888886, "acc_norm_stderr": 0.06832943242540508 }, "community|acva:Lebanon|0": { "acc_norm": 0.4444444444444444, "acc_norm_stderr": 0.07491109582924915 }, "community|acva:Libya|0": { "acc_norm": 0.5777777777777777, "acc_norm_stderr": 0.07446027270295806 }, "community|acva:Mauritania|0": { "acc_norm": 0.5111111111111111, "acc_norm_stderr": 0.07535922203472523 }, "community|acva:Mesopotamia_civilization|0": { "acc_norm": 0.5612903225806452, "acc_norm_stderr": 0.03998729476451436 }, "community|acva:Morocco|0": { "acc_norm": 0.28888888888888886, "acc_norm_stderr": 0.06832943242540507 }, "community|acva:Oman|0": { "acc_norm": 0.28888888888888886, "acc_norm_stderr": 0.06832943242540508 }, "community|acva:Palestine|0": { "acc_norm": 0.3411764705882353, "acc_norm_stderr": 0.051729042973619264 }, "community|acva:Qatar|0": { "acc_norm": 0.4444444444444444, "acc_norm_stderr": 0.07491109582924915 }, "community|acva:Saudi_Arabia|0": { "acc_norm": 0.3435897435897436, "acc_norm_stderr": 0.03409627301409855 }, "community|acva:Somalia|0": { "acc_norm": 0.26666666666666666, "acc_norm_stderr": 0.06666666666666665 }, "community|acva:Sudan|0": { "acc_norm": 0.4222222222222222, "acc_norm_stderr": 0.07446027270295806 }, "community|acva:Syria|0": { "acc_norm": 0.37777777777777777, "acc_norm_stderr": 0.07309112127323451 }, "community|acva:Tunisia|0": { "acc_norm": 0.35555555555555557, "acc_norm_stderr": 0.07216392363431012 }, "community|acva:United_Arab_Emirates|0": { "acc_norm": 0.4235294117647059, "acc_norm_stderr": 0.05391265523477461 }, "community|acva:Yemen|0": { "acc_norm": 0.3, "acc_norm_stderr": 0.15275252316519464 }, "community|acva:communication|0": { "acc_norm": 0.49175824175824173, "acc_norm_stderr": 0.026239628591083888 }, "community|acva:computer_and_phone|0": { "acc_norm": 0.5559322033898305, "acc_norm_stderr": 0.02897756513294154 }, "community|acva:daily_life|0": { "acc_norm": 0.2878338278931751, "acc_norm_stderr": 0.024699715357282315 }, "community|acva:entertainment|0": { "acc_norm": 0.3559322033898305, "acc_norm_stderr": 0.027923880374505525 }, "community|alghafa:mcq_exams_test_ar|0": { "acc_norm": 0.2800718132854578, "acc_norm_stderr": 0.019043286203795345 }, "community|alghafa:meta_ar_dialects|0": { "acc_norm": 0.2895273401297498, "acc_norm_stderr": 0.006175370293841651 }, "community|alghafa:meta_ar_msa|0": { "acc_norm": 0.32849162011173183, "acc_norm_stderr": 0.015707935398496457 }, "community|alghafa:multiple_choice_facts_truefalse_balanced_task|0": { "acc_norm": 0.5333333333333333, "acc_norm_stderr": 0.05799451149344531 }, "community|alghafa:multiple_choice_grounded_statement_soqal_task|0": { "acc_norm": 0.56, "acc_norm_stderr": 0.040665603096078445 }, "community|alghafa:multiple_choice_grounded_statement_xglue_mlqa_task|0": { "acc_norm": 0.38, "acc_norm_stderr": 0.039764406869602295 }, "community|alghafa:multiple_choice_rating_sentiment_no_neutral_task|0": { "acc_norm": 0.7972482801751094, "acc_norm_stderr": 0.004496731917745599 }, "community|alghafa:multiple_choice_rating_sentiment_task|0": { "acc_norm": 0.5279399499582986, "acc_norm_stderr": 0.006448111196626818 }, "community|alghafa:multiple_choice_sentiment_task|0": { "acc_norm": 0.3843023255813954, "acc_norm_stderr": 0.011732277442725819 }, "community|arabic_exams|0": { "acc_norm": 0.2811918063314711, "acc_norm_stderr": 0.019418936671758837 }, "community|arabic_mmlu:abstract_algebra|0": { "acc_norm": 0.34, "acc_norm_stderr": 0.047609522856952365 }, "community|arabic_mmlu:anatomy|0": { "acc_norm": 0.28888888888888886, "acc_norm_stderr": 0.03915450630414251 }, "community|arabic_mmlu:astronomy|0": { "acc_norm": 0.3223684210526316, "acc_norm_stderr": 0.038035102483515854 }, "community|arabic_mmlu:business_ethics|0": { "acc_norm": 0.36, "acc_norm_stderr": 0.04824181513244218 }, "community|arabic_mmlu:clinical_knowledge|0": { "acc_norm": 0.2830188679245283, "acc_norm_stderr": 0.0277242364927009 }, "community|arabic_mmlu:college_biology|0": { "acc_norm": 0.2916666666666667, "acc_norm_stderr": 0.03800968060554857 }, "community|arabic_mmlu:college_chemistry|0": { "acc_norm": 0.2, "acc_norm_stderr": 0.040201512610368445 }, "community|arabic_mmlu:college_computer_science|0": { "acc_norm": 0.2, "acc_norm_stderr": 0.04020151261036843 }, "community|arabic_mmlu:college_mathematics|0": { "acc_norm": 0.27, "acc_norm_stderr": 0.044619604333847415 }, "community|arabic_mmlu:college_medicine|0": { "acc_norm": 0.23699421965317918, "acc_norm_stderr": 0.03242414757483099 }, "community|arabic_mmlu:college_physics|0": { "acc_norm": 0.24509803921568626, "acc_norm_stderr": 0.04280105837364396 }, "community|arabic_mmlu:computer_security|0": { "acc_norm": 0.45, "acc_norm_stderr": 0.05 }, "community|arabic_mmlu:conceptual_physics|0": { "acc_norm": 0.2680851063829787, "acc_norm_stderr": 0.028957342788342347 }, "community|arabic_mmlu:econometrics|0": { "acc_norm": 0.30701754385964913, "acc_norm_stderr": 0.04339138322579861 }, "community|arabic_mmlu:electrical_engineering|0": { "acc_norm": 0.41379310344827586, "acc_norm_stderr": 0.04104269211806232 }, "community|arabic_mmlu:elementary_mathematics|0": { "acc_norm": 0.36772486772486773, "acc_norm_stderr": 0.02483383982556242 }, "community|arabic_mmlu:formal_logic|0": { "acc_norm": 0.2619047619047619, "acc_norm_stderr": 0.03932537680392871 }, "community|arabic_mmlu:global_facts|0": { "acc_norm": 0.31, "acc_norm_stderr": 0.04648231987117316 }, "community|arabic_mmlu:high_school_biology|0": { "acc_norm": 0.3258064516129032, "acc_norm_stderr": 0.026662010578567104 }, "community|arabic_mmlu:high_school_chemistry|0": { "acc_norm": 0.2955665024630542, "acc_norm_stderr": 0.032104944337514575 }, "community|arabic_mmlu:high_school_computer_science|0": { "acc_norm": 0.42, "acc_norm_stderr": 0.049604496374885836 }, "community|arabic_mmlu:high_school_european_history|0": { "acc_norm": 0.23636363636363636, "acc_norm_stderr": 0.033175059300091805 }, "community|arabic_mmlu:high_school_geography|0": { "acc_norm": 0.32323232323232326, "acc_norm_stderr": 0.03332299921070644 }, "community|arabic_mmlu:high_school_government_and_politics|0": { "acc_norm": 0.30569948186528495, "acc_norm_stderr": 0.033248379397581594 }, "community|arabic_mmlu:high_school_macroeconomics|0": { "acc_norm": 0.3230769230769231, "acc_norm_stderr": 0.023710888501970562 }, "community|arabic_mmlu:high_school_mathematics|0": { "acc_norm": 0.3074074074074074, "acc_norm_stderr": 0.028133252578815632 }, "community|arabic_mmlu:high_school_microeconomics|0": { "acc_norm": 0.2647058823529412, "acc_norm_stderr": 0.028657491285071963 }, "community|arabic_mmlu:high_school_physics|0": { "acc_norm": 0.2847682119205298, "acc_norm_stderr": 0.03684881521389023 }, "community|arabic_mmlu:high_school_psychology|0": { "acc_norm": 0.25688073394495414, "acc_norm_stderr": 0.018732492928342462 }, "community|arabic_mmlu:high_school_statistics|0": { "acc_norm": 0.24074074074074073, "acc_norm_stderr": 0.02915752218460562 }, "community|arabic_mmlu:high_school_us_history|0": { "acc_norm": 0.22058823529411764, "acc_norm_stderr": 0.029102254389674065 }, "community|arabic_mmlu:high_school_world_history|0": { "acc_norm": 0.26582278481012656, "acc_norm_stderr": 0.02875679962965834 }, "community|arabic_mmlu:human_aging|0": { "acc_norm": 0.29596412556053814, "acc_norm_stderr": 0.0306365913486998 }, "community|arabic_mmlu:human_sexuality|0": { "acc_norm": 0.2824427480916031, "acc_norm_stderr": 0.03948406125768362 }, "community|arabic_mmlu:international_law|0": { "acc_norm": 0.4214876033057851, "acc_norm_stderr": 0.04507732278775094 }, "community|arabic_mmlu:jurisprudence|0": { "acc_norm": 0.3148148148148148, "acc_norm_stderr": 0.04489931073591312 }, "community|arabic_mmlu:logical_fallacies|0": { "acc_norm": 0.3067484662576687, "acc_norm_stderr": 0.036230899157241474 }, "community|arabic_mmlu:machine_learning|0": { "acc_norm": 0.26785714285714285, "acc_norm_stderr": 0.04203277291467762 }, "community|arabic_mmlu:management|0": { "acc_norm": 0.23300970873786409, "acc_norm_stderr": 0.041858325989283136 }, "community|arabic_mmlu:marketing|0": { "acc_norm": 0.3888888888888889, "acc_norm_stderr": 0.031937057262002924 }, "community|arabic_mmlu:medical_genetics|0": { "acc_norm": 0.28, "acc_norm_stderr": 0.04512608598542128 }, "community|arabic_mmlu:miscellaneous|0": { "acc_norm": 0.32567049808429116, "acc_norm_stderr": 0.016757989458549682 }, "community|arabic_mmlu:moral_disputes|0": { "acc_norm": 0.3959537572254335, "acc_norm_stderr": 0.02632981334194624 }, "community|arabic_mmlu:moral_scenarios|0": { "acc_norm": 0.23798882681564246, "acc_norm_stderr": 0.014242630070574901 }, "community|arabic_mmlu:nutrition|0": { "acc_norm": 0.3300653594771242, "acc_norm_stderr": 0.026925654653615693 }, "community|arabic_mmlu:philosophy|0": { "acc_norm": 0.34726688102893893, "acc_norm_stderr": 0.027040745502307336 }, "community|arabic_mmlu:prehistory|0": { "acc_norm": 0.3425925925925926, "acc_norm_stderr": 0.026406145973625665 }, "community|arabic_mmlu:professional_accounting|0": { "acc_norm": 0.30851063829787234, "acc_norm_stderr": 0.027553366165101376 }, "community|arabic_mmlu:professional_law|0": { "acc_norm": 0.27249022164276404, "acc_norm_stderr": 0.011371658294311535 }, "community|arabic_mmlu:professional_medicine|0": { "acc_norm": 0.1875, "acc_norm_stderr": 0.023709788253811766 }, "community|arabic_mmlu:professional_psychology|0": { "acc_norm": 0.30718954248366015, "acc_norm_stderr": 0.018663359671463667 }, "community|arabic_mmlu:public_relations|0": { "acc_norm": 0.24545454545454545, "acc_norm_stderr": 0.041220665028782834 }, "community|arabic_mmlu:security_studies|0": { "acc_norm": 0.2979591836734694, "acc_norm_stderr": 0.029279567411065684 }, "community|arabic_mmlu:sociology|0": { "acc_norm": 0.3880597014925373, "acc_norm_stderr": 0.03445789964362749 }, "community|arabic_mmlu:us_foreign_policy|0": { "acc_norm": 0.42, "acc_norm_stderr": 0.049604496374885836 }, "community|arabic_mmlu:virology|0": { "acc_norm": 0.3132530120481928, "acc_norm_stderr": 0.03610805018031024 }, "community|arabic_mmlu:world_religions|0": { "acc_norm": 0.27485380116959063, "acc_norm_stderr": 0.034240429246915824 }, "community|arc_challenge_okapi_ar|0": { "acc_norm": 0.3293103448275862, "acc_norm_stderr": 0.013804534699579278 }, "community|arc_easy_ar|0": { "acc_norm": 0.3236040609137056, "acc_norm_stderr": 0.009624443258161308 }, "community|boolq_ar|0": { "acc_norm": 0.7085889570552147, "acc_norm_stderr": 0.007959907341375319 }, "community|copa_ext_ar|0": { "acc_norm": 0.5222222222222223, "acc_norm_stderr": 0.05294752255076824 }, "community|hellaswag_okapi_ar|0": { "acc_norm": 0.26529277069021917, "acc_norm_stderr": 0.004610363799431674 }, "community|openbook_qa_ext_ar|0": { "acc_norm": 0.36767676767676766, "acc_norm_stderr": 0.02169397769879489 }, "community|piqa_ar|0": { "acc_norm": 0.5368248772504092, "acc_norm_stderr": 0.01165000722527945 }, "community|race_ar|0": { "acc_norm": 0.3327246906066139, "acc_norm_stderr": 0.006712119702934689 }, "community|sciq_ar|0": { "acc_norm": 0.48542713567839196, "acc_norm_stderr": 0.01585229964546976 }, "community|toxigen_ar|0": { "acc_norm": 0.4834224598930481, "acc_norm_stderr": 0.016351505086413663 }, "lighteval|xstory_cloze:ar|0": { "acc": 0.5413633355393779, "acc_stderr": 0.01282302034016982 }, "community|acva:_average|0": { "acc_norm": 0.47778978781322307, "acc_norm_stderr": 0.047483596881547824 }, "community|alghafa:_average|0": { "acc_norm": 0.4534349625083418, "acc_norm_stderr": 0.022447581545817528 }, "community|arabic_mmlu:_average|0": { "acc_norm": 0.30303933090891266, "acc_norm_stderr": 0.03413101309881009 } } ``` ## Dataset Details ### Dataset Description <!-- Provide a longer summary of what this dataset is. --> - **Curated by:** [More Information Needed] - **Funded by [optional]:** [More Information Needed] - **Shared by [optional]:** [More Information Needed] - **Language(s) (NLP):** [More Information Needed] - **License:** [More Information Needed] ### Dataset Sources [optional] <!-- Provide the basic links for the dataset. --> - **Repository:** [More Information Needed] - **Paper [optional]:** [More Information Needed] - **Demo [optional]:** [More Information Needed] ## Uses <!-- Address questions around how the dataset is intended to be used. --> ### Direct Use <!-- This section describes suitable use cases for the dataset. --> [More Information Needed] ### Out-of-Scope Use <!-- This section addresses misuse, malicious use, and uses that the dataset will not work well for. --> [More Information Needed] ## Dataset Structure <!-- This section provides a description of the dataset fields, and additional information about the dataset structure such as criteria used to create the splits, relationships between data points, etc. --> [More Information Needed] ## Dataset Creation ### Curation Rationale <!-- Motivation for the creation of this dataset. --> [More Information Needed] ### Source Data <!-- This section describes the source data (e.g. news text and headlines, social media posts, translated sentences, ...). --> #### Data Collection and Processing <!-- This section describes the data collection and processing process such as data selection criteria, filtering and normalization methods, tools and libraries used, etc. --> [More Information Needed] #### Who are the source data producers? <!-- This section describes the people or systems who originally created the data. It should also include self-reported demographic or identity information for the source data creators if this information is available. --> [More Information Needed] ### Annotations [optional] <!-- If the dataset contains annotations which are not part of the initial data collection, use this section to describe them. --> #### Annotation process <!-- This section describes the annotation process such as annotation tools used in the process, the amount of data annotated, annotation guidelines provided to the annotators, interannotator statistics, annotation validation, etc. --> [More Information Needed] #### Who are the annotators? <!-- This section describes the people or systems who created the annotations. --> [More Information Needed] #### Personal and Sensitive Information <!-- State whether the dataset contains data that might be considered personal, sensitive, or private (e.g., data that reveals addresses, uniquely identifiable names or aliases, racial or ethnic origins, sexual orientations, religious beliefs, political opinions, financial or health data, etc.). If efforts were made to anonymize the data, describe the anonymization process. --> [More Information Needed] ## Bias, Risks, and Limitations <!-- This section is meant to convey both technical and sociotechnical limitations. --> [More Information Needed] ### Recommendations <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. --> Users should be made aware of the risks, biases and limitations of the dataset. More information needed for further recommendations. ## Citation [optional] <!-- If there is a paper or blog post introducing the dataset, the APA and Bibtex information for that should go in this section. --> **BibTeX:** [More Information Needed] **APA:** [More Information Needed] ## Glossary [optional] <!-- If relevant, include terms and calculations in this section that can help readers understand the dataset or dataset card. --> [More Information Needed] ## More Information [optional] [More Information Needed] ## Dataset Card Authors [optional] [More Information Needed] ## Dataset Card Contact [More Information Needed]

该数据集是在评估模型01-ai/Yi-1.5-9B-Chat时自动创建的。数据集由136个配置组成,每个配置对应一个评估任务。数据集是从1次运行中创建的,每次运行可以在每个配置中找到特定的分割,分割名称使用运行的时间戳。"train"分割始终指向最新的结果。此外,还有一个名为"results"的配置存储了所有运行的聚合结果。
提供机构:
OALL
原始信息汇总

数据集概述

数据集名称

  • Evaluation run of 01-ai/Yi-1.5-9B-Chat

数据集描述

  • 创建目的: 自动生成于模型01-ai/Yi-1.5-9B-Chat的评估运行过程中。
  • 数据集构成: 包含136个配置,每个配置对应一个评估任务。
  • 数据集来源: 从1次运行中创建,每个运行在每个配置中作为一个特定的分割存在,分割名称使用运行的时间戳。
  • 额外配置: 包含一个名为"results"的配置,存储所有运行的聚合结果。

数据集使用示例

python from datasets import load_dataset data = load_dataset("OALL/details_01-ai__Yi-1.5-9B-Chat", "lighteval_xstory_cloze_ar_0", split="train")

最新结果

  • 结果来源: 来自2024-05-17T21:28:15.479048的运行。
  • 结果内容: 包含多个任务的评估结果,每个任务的结果存储在相应的配置中。

数据集详细配置

配置列表

  • 包含多个社区和任务相关的配置,每个配置记录了特定任务的评估结果,如准确率(acc_norm)和标准误差(acc_norm_stderr)。

配置示例

  • community|acva:Algeria|0:
    • acc_norm: 0.5333333333333333
    • acc_norm_stderr: 0.03581804596782232

结果分析

  • 每个配置提供了特定任务的性能指标,有助于分析模型在不同任务上的表现。
搜集汇总
数据集介绍
main_image_url
构建方式
该数据集是围绕模型01-ai/Yi-1.5-9B-Chat的评估运行自动生成的产物。数据集由136个配置构成,每个配置对应一项被评估的任务,全面覆盖了模型在不同维度的表现。通过一次完整的运行流程创建,每次运行的结果以时间戳命名的分割形式存储于各配置中,而'train'分割则始终指向最新一次运行的评估结果。此外,还设有独立的'results'配置,用于汇总所有评估任务的聚合指标,从而构建出一个结构清晰、便于追溯的评估数据体系。
特点
该数据集最显著的特征在于其精细化的任务划分与结果追踪机制。136个配置分别对应不同评估任务,涵盖从阿拉伯语文化知识到多学科问答的广泛领域,如阿拉伯历史、医学、数学及情感分析等。每个配置下的分割命名采用运行时间戳,确保每次评估结果的可追溯性,而'train'分割的动态更新特性则使用户能便捷地获取最新评估数据。这种设计不仅支持对模型在特定任务上的表现进行细粒度分析,还通过聚合的'results'配置提供了全局视角的绩效概览。
使用方法
使用该数据集时,用户可通过Hugging Face的datasets库便捷地加载特定任务的评估详情。例如,采用load_dataset函数,指定数据集名称'OALL/details_01-ai__Yi-1.5-9B-Chat'、对应任务配置(如'lighteval_xstory_cloze_ar_0')以及所需分割(如'train'),即可获取该任务的最新评估结果。若要分析历史运行数据,则可根据具体的时间戳分割进行加载。此外,通过访问'results'配置,用户能够一键获取所有任务的聚合指标,从而高效地评估模型整体性能或进行跨任务对比分析。
背景与挑战
背景概述
该数据集源自对零一万物研发的Yi-1.5-9B-Chat模型在2024年5月进行的系统性评估,由Open Arabic LLM Leaderboard(OALL)平台自动生成并发布。其核心研究问题聚焦于多语言大语言模型在阿拉伯语及相关文化语境下的综合能力表现,涵盖从常识推理、情感分析到专业学科知识(如医学、法学、天文学)等136项细分任务。作为阿拉伯语大模型评测生态的重要基础设施,该数据集通过标准化评估流程,为跨语言模型在低资源语言场景下的性能对比提供了基准,推动了非英语语言自然语言处理研究的边界拓展。
当前挑战
当前数据集面临的挑战主要源于两方面。在领域问题层面,阿拉伯语作为形态丰富、方言众多的语言,其评测任务需兼顾现代标准阿拉伯语与各地域方言(如埃及、摩洛哥、沙特阿拉伯等)的差异,现有结果中部分子任务准确率仅约20%至30%,凸显模型对文化特定知识与地域性表达的泛化能力不足。构建过程中,数据集依赖自动化评估流水线,需处理136个配置项与多轮次运行结果的版本管理,确保不同时间戳下评测数据的一致性与可复现性;同时,社区贡献的评测任务(如acva和alghafa子集)覆盖领域广泛,但数据标注质量与难度平衡的控制仍是待解决的工程挑战。
常用场景
经典使用场景
在自然语言处理与多语言模型评估的交叉领域中,OALL/details_01-ai__Yi-1.5-9B-Chat 数据集作为一项专为阿拉伯语场景设计的精细评估资源,其经典使用场景集中在对大规模语言模型进行细粒度、多任务的能力诊断。该数据集通过136个独立配置,覆盖从阿拉伯文化常识、方言识别到学科知识问答(如阿拉伯语MMLU)等多样化任务,为研究者提供了一个系统化剖析模型在阿拉伯语语境下推理、记忆与文化理解能力的标准化平台。借助其结构化的评估框架,学者能够精确量化模型在特定子领域的表现差异,从而推动针对低资源语言模型优化的实证研究。
实际应用
在实际应用中,该数据集为阿拉伯语智能系统的开发与迭代提供了可靠的性能标尺。技术团队可依据其在情感分析、方言识别、多选问答等任务上的评估结果,针对性地优化客服机器人、教育辅导工具及内容审核系统的阿拉伯语处理模块。例如,通过分析模型在‘阿拉伯婚礼’或‘通信’等生活场景任务上的准确率,开发者能精准调整对话系统的文化敏感性,避免因知识缺失导致的交互失误。此外,数据集中的错误分布统计还可用于指导训练数据的补充采集,从而提升商业产品在阿拉伯市场的用户体验与适配度。
衍生相关工作
此数据集的衍生工作主要围绕阿拉伯语评估基准的构建与模型改进展开。受其启发,研究者开发了更精细的阿拉伯语文化知识图谱,并基于其评估结果提出了针对低资源语言的多任务联合训练策略,显著提升了模型在阿拉伯MMLU子任务上的表现。此外,该数据集催生了若干对比分析工作,深入探讨了预训练语料中阿拉伯语占比对下游任务准确率的影响,以及不同解码策略在方言识别上的鲁棒性差异。这些后续研究不仅验证了该评估框架的有效性,还推动了面向阿拉伯语的文化感知模型架构创新,形成了一条从诊断到优化的闭环研究链条。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作