Name: matiss/P3-Latvian-QuickMT
Creator: matiss
Published: 2026-02-16 06:19:25
License: 暂无描述

下载链接：

https://hf-mirror.com/datasets/matiss/P3-Latvian-QuickMT

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: - config_name: adversarial_qa_dbert_answer_the_following_q features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 10073371 num_examples: 10000 - name: validation num_bytes: 992047 num_examples: 1000 download_size: 2583136 dataset_size: 11065418 - config_name: adversarial_qa_dbert_based_on features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9543041 num_examples: 10000 - name: validation num_bytes: 938956 num_examples: 1000 download_size: 2548825 dataset_size: 10481997 - config_name: adversarial_qa_dbert_generate_question features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9935906 num_examples: 10000 - name: validation num_bytes: 981170 num_examples: 1000 - name: test num_bytes: 1040114 num_examples: 1000 download_size: 2317046 dataset_size: 11957190 - config_name: adversarial_qa_dbert_question_context_answer features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9141988 num_examples: 10000 - name: validation num_bytes: 899724 num_examples: 1000 download_size: 2513875 dataset_size: 10041712 - config_name: adversarial_qa_dbert_tell_what_it_is features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9543304 num_examples: 10000 - name: validation num_bytes: 939766 num_examples: 1000 download_size: 2550715 dataset_size: 10483070 - config_name: adversarial_qa_dbidaf_answer_the_following_q features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9995769 num_examples: 10000 - name: validation num_bytes: 995527 num_examples: 1000 download_size: 2610594 dataset_size: 10991296 - config_name: adversarial_qa_dbidaf_based_on features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9467160 num_examples: 10000 - name: validation num_bytes: 941873 num_examples: 1000 download_size: 2583831 dataset_size: 10409033 - config_name: adversarial_qa_dbidaf_generate_question features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9922079 num_examples: 10000 - name: validation num_bytes: 984612 num_examples: 1000 - name: test num_bytes: 1023021 num_examples: 1000 download_size: 2347181 dataset_size: 11929712 - config_name: adversarial_qa_dbidaf_question_context_answer features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9065166 num_examples: 10000 - name: validation num_bytes: 902890 num_examples: 1000 download_size: 2547566 dataset_size: 9968056 - config_name: adversarial_qa_dbidaf_tell_what_it_is features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9469228 num_examples: 10000 - name: validation num_bytes: 943176 num_examples: 1000 download_size: 2595039 dataset_size: 10412404 - config_name: adversarial_qa_droberta_answer_the_following_q features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9931301 num_examples: 10000 - name: validation num_bytes: 980683 num_examples: 1000 download_size: 2665783 dataset_size: 10911984 - config_name: adversarial_qa_droberta_based_on features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9402830 num_examples: 10000 - name: validation num_bytes: 927020 num_examples: 1000 download_size: 2619552 dataset_size: 10329850 - config_name: adversarial_qa_droberta_generate_question features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9778471 num_examples: 10000 - name: validation num_bytes: 973024 num_examples: 1000 - name: test num_bytes: 1066952 num_examples: 1000 download_size: 2426128 dataset_size: 11818447 - config_name: adversarial_qa_droberta_question_context_answer features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 8999476 num_examples: 10000 - name: validation num_bytes: 888031 num_examples: 1000 download_size: 2595726 dataset_size: 9887507 - config_name: adversarial_qa_droberta_tell_what_it_is features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9400737 num_examples: 10000 - name: validation num_bytes: 927874 num_examples: 1000 download_size: 2633324 dataset_size: 10328611 - config_name: ag_news_classify features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 51201178 num_examples: 120000 - name: test num_bytes: 3233173 num_examples: 7600 download_size: 21150470 dataset_size: 54434351 - config_name: ag_news_classify_question_first features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 51201178 num_examples: 120000 - name: test num_bytes: 3233173 num_examples: 7600 download_size: 21013028 dataset_size: 54434351 - config_name: ag_news_classify_with_choices features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 55521424 num_examples: 120000 - name: test num_bytes: 3506795 num_examples: 7600 download_size: 21835832 dataset_size: 59028219 - config_name: ag_news_classify_with_choices_question_first features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 55521424 num_examples: 120000 - name: test num_bytes: 3506795 num_examples: 7600 download_size: 21693542 dataset_size: 59028219 - config_name: ag_news_recommend features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 51891217 num_examples: 120000 - name: test num_bytes: 3276936 num_examples: 7600 download_size: 21812607 dataset_size: 55168153 - config_name: ag_news_which_section features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 51861207 num_examples: 120000 - name: test num_bytes: 3274999 num_examples: 7600 download_size: 21236892 dataset_size: 55136206 - config_name: ag_news_which_section_choices features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 60619719 num_examples: 120000 - name: test num_bytes: 3829486 num_examples: 7600 download_size: 22487894 dataset_size: 64449205 - config_name: ai2_arc_ARC_Challenge_heres_a_problem features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 465398 num_examples: 1119 - name: validation num_bytes: 127761 num_examples: 299 - name: test num_bytes: 496119 num_examples: 1172 download_size: 457627 dataset_size: 1089278 - config_name: ai2_arc_ARC_Challenge_i_am_hesitating features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 689304 num_examples: 1119 - name: validation num_bytes: 186520 num_examples: 299 - name: test num_bytes: 716071 num_examples: 1172 download_size: 749084 dataset_size: 1591895 - config_name: ai2_arc_ARC_Challenge_multiple_choice features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 748129 num_examples: 1119 - name: validation num_bytes: 202353 num_examples: 299 - name: test num_bytes: 777708 num_examples: 1172 download_size: 769013 dataset_size: 1728190 - config_name: ai2_arc_ARC_Challenge_pick_false_options features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 491201 num_examples: 1119 - name: validation num_bytes: 135537 num_examples: 299 - name: test num_bytes: 525124 num_examples: 1172 download_size: 601174 dataset_size: 1151862 - config_name: ai2_arc_ARC_Challenge_pick_the_most_correct_option features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 462864 num_examples: 1119 - name: validation num_bytes: 127018 num_examples: 299 - name: test num_bytes: 493354 num_examples: 1172 download_size: 462204 dataset_size: 1083236 - config_name: ai2_arc_ARC_Challenge_qa_options features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 561853 num_examples: 1119 - name: validation num_bytes: 152514 num_examples: 299 - name: test num_bytes: 582558 num_examples: 1172 download_size: 723478 dataset_size: 1296925 - config_name: ai2_arc_ARC_Easy_heres_a_problem features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 850868 num_examples: 2251 - name: validation num_bytes: 215754 num_examples: 570 - name: test num_bytes: 902804 num_examples: 2376 download_size: 784671 dataset_size: 1969426 - config_name: ai2_arc_ARC_Easy_i_am_hesitating features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 1263913 num_examples: 2251 - name: validation num_bytes: 337210 num_examples: 570 - name: test num_bytes: 1349197 num_examples: 2376 download_size: 1244736 dataset_size: 2950320 - config_name: ai2_arc_ARC_Easy_multiple_choice features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 1382690 num_examples: 2251 - name: validation num_bytes: 367311 num_examples: 570 - name: test num_bytes: 1474809 num_examples: 2376 download_size: 1282047 dataset_size: 3224810 - config_name: ai2_arc_ARC_Easy_pick_false_options features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 866252 num_examples: 2251 - name: validation num_bytes: 219001 num_examples: 570 - name: test num_bytes: 919188 num_examples: 2376 download_size: 1016220 dataset_size: 2004441 - config_name: ai2_arc_ARC_Easy_pick_the_most_correct_option features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 846126 num_examples: 2251 - name: validation num_bytes: 214684 num_examples: 570 - name: test num_bytes: 897783 num_examples: 2376 download_size: 795969 dataset_size: 1958593 - config_name: ai2_arc_ARC_Easy_qa_options features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 1007817 num_examples: 2251 - name: validation num_bytes: 272367 num_examples: 570 - name: test num_bytes: 1079242 num_examples: 2376 download_size: 1192608 dataset_size: 2359426 - config_name: amazon_polarity_Is_this_product_review_positive features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 2116403330 num_examples: 3600000 - name: test num_bytes: 235047773 num_examples: 400000 download_size: 1234623408 dataset_size: 2351451103 - config_name: amazon_polarity_Is_this_review features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 2332403191 num_examples: 3600000 - name: test num_bytes: 259047773 num_examples: 400000 download_size: 1234078263 dataset_size: 2591450964 - config_name: amazon_polarity_Is_this_review_negative features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 2084003136 num_examples: 3600000 - name: test num_bytes: 231447772 num_examples: 400000 download_size: 1234000458 dataset_size: 2315450908 - config_name: amazon_polarity_User_recommend_this_product features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 2087813177 num_examples: 3600000 - name: test num_bytes: 231855467 num_examples: 400000 download_size: 1173652629 dataset_size: 2319668644 - config_name: amazon_polarity_convey_negative_or_positive_sentiment features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 2436803218 num_examples: 3600000 - name: test num_bytes: 270647775 num_examples: 400000 download_size: 1252792299 dataset_size: 2707450993 - config_name: amazon_polarity_flattering_or_not features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 2309994572 num_examples: 3600000 - name: test num_bytes: 256544770 num_examples: 400000 download_size: 1261191576 dataset_size: 2566539342 - config_name: amazon_polarity_negative_or_positive_tone features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 2480002823 num_examples: 3600000 - name: test num_bytes: 275447773 num_examples: 400000 download_size: 1254172662 dataset_size: 2755450596 - config_name: anli_GPT_3_style_r1 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9115390 num_examples: 16946 - name: validation num_bytes: 536694 num_examples: 1000 - name: test num_bytes: 540176 num_examples: 1000 download_size: 3459817 dataset_size: 10192260 - config_name: anli_GPT_3_style_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 24867598 num_examples: 50838 - name: validation num_bytes: 1460439 num_examples: 3000 - name: test num_bytes: 1470885 num_examples: 3000 download_size: 5023993 dataset_size: 27798922 - config_name: anli_GPT_3_style_r2 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 24000342 num_examples: 45460 - name: validation num_bytes: 532044 num_examples: 1000 - name: test num_bytes: 533883 num_examples: 1000 download_size: 6649999 dataset_size: 25066269 - config_name: anli_GPT_3_style_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 65432804 num_examples: 136380 - name: validation num_bytes: 1446489 num_examples: 3000 - name: test num_bytes: 1452006 num_examples: 3000 download_size: 10738630 dataset_size: 68331299 - config_name: anli_GPT_3_style_r3 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 51902632 num_examples: 100459 - name: validation num_bytes: 629291 num_examples: 1200 - name: test num_bytes: 628740 num_examples: 1200 download_size: 14041053 dataset_size: 53160663 - config_name: anli_GPT_3_style_r3_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 140973371 num_examples: 301377 - name: validation num_bytes: 1708385 num_examples: 3600 - name: test num_bytes: 1706732 num_examples: 3600 download_size: 22892192 dataset_size: 144388488 - config_name: anli_MNLI_crowdsource_r1 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 10642813 num_examples: 16946 - name: validation num_bytes: 626399 num_examples: 1000 - name: test num_bytes: 627777 num_examples: 1000 download_size: 3660956 dataset_size: 11896989 - config_name: anli_MNLI_crowdsource_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 29776793 num_examples: 50838 - name: validation num_bytes: 1753564 num_examples: 3000 - name: test num_bytes: 1757698 num_examples: 3000 download_size: 5491534 dataset_size: 33288055 - config_name: anli_MNLI_crowdsource_r2 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 28143499 num_examples: 45460 - name: validation num_bytes: 621285 num_examples: 1000 - name: test num_bytes: 622546 num_examples: 1000 download_size: 7070331 dataset_size: 29387330 - config_name: anli_MNLI_crowdsource_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 78614594 num_examples: 136380 - name: validation num_bytes: 1738222 num_examples: 3000 - name: test num_bytes: 1742005 num_examples: 3000 download_size: 11848479 dataset_size: 82094821 - config_name: anli_MNLI_crowdsource_r3 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 60941758 num_examples: 100459 - name: validation num_bytes: 735534 num_examples: 1200 - name: test num_bytes: 735326 num_examples: 1200 download_size: 14942935 dataset_size: 62412618 - config_name: anli_MNLI_crowdsource_r3_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 170080650 num_examples: 301377 - name: validation num_bytes: 2055764 num_examples: 3600 - name: test num_bytes: 2055140 num_examples: 3600 download_size: 25275994 dataset_size: 174191554 - config_name: anli_always_sometimes_never_r1 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 11191134 num_examples: 16946 - name: validation num_bytes: 656563 num_examples: 1000 - name: test num_bytes: 658871 num_examples: 1000 download_size: 3587589 dataset_size: 12506568 - config_name: anli_always_sometimes_never_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 27700949 num_examples: 50838 - name: validation num_bytes: 1628101 num_examples: 3000 - name: test num_bytes: 1635025 num_examples: 3000 download_size: 5260211 dataset_size: 30964075 - config_name: anli_always_sometimes_never_r2 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 29812996 num_examples: 45460 - name: validation num_bytes: 654966 num_examples: 1000 - name: test num_bytes: 658147 num_examples: 1000 download_size: 6908336 dataset_size: 31126109 - config_name: anli_always_sometimes_never_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 73576704 num_examples: 136380 - name: validation num_bytes: 1623431 num_examples: 3000 - name: test num_bytes: 1632853 num_examples: 3000 download_size: 11287979 dataset_size: 76832988 - config_name: anli_always_sometimes_never_r3 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 65435333 num_examples: 100459 - name: validation num_bytes: 787929 num_examples: 1200 - name: test num_bytes: 787630 num_examples: 1200 download_size: 14623047 dataset_size: 67010892 - config_name: anli_always_sometimes_never_r3_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 161561672 num_examples: 301377 - name: validation num_bytes: 1953785 num_examples: 3600 - name: test num_bytes: 1952888 num_examples: 3600 download_size: 24140958 dataset_size: 165468345 - config_name: anli_based_on_the_previous_passage_r1 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 10187263 num_examples: 16946 - name: validation num_bytes: 596966 num_examples: 1000 - name: test num_bytes: 600395 num_examples: 1000 download_size: 3580346 dataset_size: 11384624 - config_name: anli_based_on_the_previous_passage_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 27768843 num_examples: 50838 - name: validation num_bytes: 1632270 num_examples: 3000 - name: test num_bytes: 1642557 num_examples: 3000 download_size: 5297339 dataset_size: 31043670 - config_name: anli_based_on_the_previous_passage_r2 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 26981220 num_examples: 45460 - name: validation num_bytes: 593000 num_examples: 1000 - name: test num_bytes: 594249 num_examples: 1000 download_size: 6899624 dataset_size: 28168469 - config_name: anli_based_on_the_previous_passage_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 73278732 num_examples: 136380 - name: validation num_bytes: 1620372 num_examples: 3000 - name: test num_bytes: 1624119 num_examples: 3000 download_size: 11365938 dataset_size: 76523223 - config_name: anli_based_on_the_previous_passage_r3 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 58306464 num_examples: 100459 - name: validation num_bytes: 702240 num_examples: 1200 - name: test num_bytes: 701887 num_examples: 1200 download_size: 14536507 dataset_size: 59710591 - config_name: anli_based_on_the_previous_passage_r3_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 158428558 num_examples: 301377 - name: validation num_bytes: 1916108 num_examples: 3600 - name: test num_bytes: 1915049 num_examples: 3600 download_size: 24238355 dataset_size: 162259715 - config_name: anli_can_we_infer_r1 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9660251 num_examples: 16946 - name: validation num_bytes: 567095 num_examples: 1000 - name: test num_bytes: 569234 num_examples: 1000 download_size: 3540836 dataset_size: 10796580 - config_name: anli_can_we_infer_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 26188002 num_examples: 50838 - name: validation num_bytes: 1542657 num_examples: 3000 - name: test num_bytes: 1549074 num_examples: 3000 download_size: 5185988 dataset_size: 29279733 - config_name: anli_can_we_infer_r2 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 25702496 num_examples: 45460 - name: validation num_bytes: 564027 num_examples: 1000 - name: test num_bytes: 566605 num_examples: 1000 download_size: 6839728 dataset_size: 26833128 - config_name: anli_can_we_infer_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 69442411 num_examples: 136380 - name: validation num_bytes: 1533453 num_examples: 3000 - name: test num_bytes: 1541187 num_examples: 3000 download_size: 11133446 dataset_size: 72517051 - config_name: anli_can_we_infer_r3 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 55711582 num_examples: 100459 - name: validation num_bytes: 671622 num_examples: 1200 - name: test num_bytes: 671041 num_examples: 1200 download_size: 14400590 dataset_size: 57054245 - config_name: anli_can_we_infer_r3_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 150644154 num_examples: 301377 - name: validation num_bytes: 1824254 num_examples: 3600 - name: test num_bytes: 1822511 num_examples: 3600 download_size: 23718191 dataset_size: 154290919 - config_name: anli_claim_true_false_inconclusive_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 28009371 num_examples: 50838 - name: validation num_bytes: 1645608 num_examples: 3000 - name: test num_bytes: 1656036 num_examples: 3000 download_size: 5309202 dataset_size: 31311015 - config_name: anli_claim_true_false_inconclusive_r2 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 27303933 num_examples: 45460 - name: validation num_bytes: 603705 num_examples: 1000 - name: test num_bytes: 605108 num_examples: 1000 download_size: 6909550 dataset_size: 28512746 - config_name: anli_claim_true_false_inconclusive_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 73958163 num_examples: 136380 - name: validation num_bytes: 1634481 num_examples: 3000 - name: test num_bytes: 1638690 num_examples: 3000 download_size: 11378115 dataset_size: 77231334 - config_name: anli_claim_true_false_inconclusive_r3 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 59171538 num_examples: 100459 - name: validation num_bytes: 715035 num_examples: 1200 - name: test num_bytes: 714555 num_examples: 1200 download_size: 14576610 dataset_size: 60601128 - config_name: anli_claim_true_false_inconclusive_r3_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 159869656 num_examples: 301377 - name: validation num_bytes: 1933163 num_examples: 3600 - name: test num_bytes: 1931723 num_examples: 3600 download_size: 24278176 dataset_size: 163734542 - config_name: anli_consider_always_sometimes_never_r1 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 11760024 num_examples: 16946 - name: validation num_bytes: 689682 num_examples: 1000 - name: test num_bytes: 693346 num_examples: 1000 download_size: 3573913 dataset_size: 13143052 - config_name: anli_consider_always_sometimes_never_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 29407373 num_examples: 50838 - name: validation num_bytes: 1727533 num_examples: 3000 - name: test num_bytes: 1738450 num_examples: 3000 download_size: 5322654 dataset_size: 32873356 - config_name: anli_consider_always_sometimes_never_r2 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 31166765 num_examples: 45460 - name: validation num_bytes: 686050 num_examples: 1000 - name: test num_bytes: 687415 num_examples: 1000 download_size: 6920348 dataset_size: 32540230 - config_name: anli_consider_always_sometimes_never_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 77636441 num_examples: 136380 - name: validation num_bytes: 1716562 num_examples: 3000 - name: test num_bytes: 1720657 num_examples: 3000 download_size: 11455538 dataset_size: 81073660 - config_name: anli_consider_always_sometimes_never_r3 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 67603449 num_examples: 100459 - name: validation num_bytes: 813779 num_examples: 1200 - name: test num_bytes: 813586 num_examples: 1200 download_size: 14579500 dataset_size: 69230814 - config_name: anli_does_it_follow_that_r1 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9519548 num_examples: 16946 - name: validation num_bytes: 558881 num_examples: 1000 - name: test num_bytes: 559906 num_examples: 1000 download_size: 3515973 dataset_size: 10638335 - config_name: anli_does_it_follow_that_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 25766034 num_examples: 50838 - name: validation num_bytes: 1518015 num_examples: 3000 - name: test num_bytes: 1521090 num_examples: 3000 download_size: 5128975 dataset_size: 28805139 - config_name: anli_does_it_follow_that_r2 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 25316900 num_examples: 45460 - name: validation num_bytes: 555526 num_examples: 1000 - name: test num_bytes: 557906 num_examples: 1000 download_size: 6775880 dataset_size: 26430332 - config_name: anli_does_it_follow_that_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 68285560 num_examples: 136380 - name: validation num_bytes: 1507950 num_examples: 3000 - name: test num_bytes: 1515090 num_examples: 3000 download_size: 10995030 dataset_size: 71308600 - config_name: anli_does_it_follow_that_r3 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 55098606 num_examples: 100459 - name: validation num_bytes: 663250 num_examples: 1200 - name: test num_bytes: 663142 num_examples: 1200 download_size: 14338970 dataset_size: 56424998 - config_name: anli_does_it_follow_that_r3_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 148805338 num_examples: 301377 - name: validation num_bytes: 1799138 num_examples: 3600 - name: test num_bytes: 1798814 num_examples: 3600 download_size: 23554543 dataset_size: 152403290 - config_name: anli_does_this_imply_r1 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9655086 num_examples: 16946 - name: validation num_bytes: 565658 num_examples: 1000 - name: test num_bytes: 568828 num_examples: 1000 download_size: 3492464 dataset_size: 10789572 - config_name: anli_does_this_imply_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 26172648 num_examples: 50838 - name: validation num_bytes: 1538346 num_examples: 3000 - name: test num_bytes: 1547856 num_examples: 3000 download_size: 5134759 dataset_size: 29258850 - config_name: anli_does_this_imply_r2 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 25536784 num_examples: 45460 - name: validation num_bytes: 560912 num_examples: 1000 - name: test num_bytes: 563115 num_examples: 1000 download_size: 6697367 dataset_size: 26660811 - config_name: anli_does_this_imply_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 68945440 num_examples: 136380 - name: validation num_bytes: 1524108 num_examples: 3000 - name: test num_bytes: 1530717 num_examples: 3000 download_size: 10938678 dataset_size: 72000265 - config_name: anli_does_this_imply_r3 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 55037282 num_examples: 100459 - name: validation num_bytes: 663232 num_examples: 1200 - name: test num_bytes: 663019 num_examples: 1200 download_size: 14125310 dataset_size: 56363533 - config_name: anli_does_this_imply_r3_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 148621016 num_examples: 301377 - name: validation num_bytes: 1799084 num_examples: 3600 - name: test num_bytes: 1798445 num_examples: 3600 download_size: 23265299 dataset_size: 152218545 - config_name: anli_guaranteed_possible_impossible_r1 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 10167319 num_examples: 16946 - name: validation num_bytes: 597064 num_examples: 1000 - name: test num_bytes: 599508 num_examples: 1000 download_size: 3575441 dataset_size: 11363891 - config_name: anli_guaranteed_possible_impossible_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 26962553 num_examples: 50838 - name: validation num_bytes: 1584541 num_examples: 3000 - name: test num_bytes: 1591873 num_examples: 3000 download_size: 5250394 dataset_size: 30138967 - config_name: anli_guaranteed_possible_impossible_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 71603447 num_examples: 136380 - name: validation num_bytes: 1579576 num_examples: 3000 - name: test num_bytes: 1588447 num_examples: 3000 download_size: 11266213 dataset_size: 74771470 - config_name: anli_guaranteed_possible_impossible_r3 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 59431742 num_examples: 100459 - name: validation num_bytes: 718066 num_examples: 1200 - name: test num_bytes: 716235 num_examples: 1200 download_size: 14660957 dataset_size: 60866043 - config_name: anli_guaranteed_possible_impossible_r3_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 157322530 num_examples: 301377 - name: validation num_bytes: 1906052 num_examples: 3600 - name: test num_bytes: 1900559 num_examples: 3600 download_size: 24211019 dataset_size: 161129141 - config_name: anli_guaranteed_true_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 26303334 num_examples: 50838 - name: validation num_bytes: 1549782 num_examples: 3000 - name: test num_bytes: 1554123 num_examples: 3000 download_size: 5189596 dataset_size: 29407239 - config_name: anli_guaranteed_true_r2 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 25800134 num_examples: 45460 - name: validation num_bytes: 566492 num_examples: 1000 - name: test num_bytes: 568940 num_examples: 1000 download_size: 6816917 dataset_size: 26935566 - config_name: anli_guaranteed_true_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 69735655 num_examples: 136380 - name: validation num_bytes: 1540848 num_examples: 3000 - name: test num_bytes: 1548192 num_examples: 3000 download_size: 11124774 dataset_size: 72824695 - config_name: anli_guaranteed_true_r3 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 56032445 num_examples: 100459 - name: validation num_bytes: 675708 num_examples: 1200 - name: test num_bytes: 675096 num_examples: 1200 download_size: 14434826 dataset_size: 57383249 - config_name: anli_guaranteed_true_r3_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 151606885 num_examples: 301377 - name: validation num_bytes: 1836512 num_examples: 3600 - name: test num_bytes: 1834676 num_examples: 3600 download_size: 23818570 dataset_size: 155278073 - config_name: anli_justified_in_saying_r1 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9616558 num_examples: 16946 - name: validation num_bytes: 563304 num_examples: 1000 - name: test num_bytes: 566515 num_examples: 1000 download_size: 3521817 dataset_size: 10746377 - config_name: anli_justified_in_saying_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 26057064 num_examples: 50838 - name: validation num_bytes: 1531284 num_examples: 3000 - name: test num_bytes: 1540917 num_examples: 3000 download_size: 5139740 dataset_size: 29129265 - config_name: anli_justified_in_saying_r2 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 25437357 num_examples: 45460 - name: validation num_bytes: 558810 num_examples: 1000 - name: test num_bytes: 560649 num_examples: 1000 download_size: 6730996 dataset_size: 26556816 - config_name: anli_justified_in_saying_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 68647159 num_examples: 136380 - name: validation num_bytes: 1517802 num_examples: 3000 - name: test num_bytes: 1523319 num_examples: 3000 download_size: 10951560 dataset_size: 71688280 - config_name: anli_justified_in_saying_r3 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 54839356 num_examples: 100459 - name: validation num_bytes: 661135 num_examples: 1200 - name: test num_bytes: 660452 num_examples: 1200 download_size: 14173959 dataset_size: 56160943 - config_name: anli_justified_in_saying_r3_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 148027576 num_examples: 301377 - name: validation num_bytes: 1792793 num_examples: 3600 - name: test num_bytes: 1790744 num_examples: 3600 download_size: 23294761 dataset_size: 151611113 - config_name: anli_must_be_true_r1 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9670808 num_examples: 16946 - name: validation num_bytes: 567825 num_examples: 1000 - name: test num_bytes: 569003 num_examples: 1000 download_size: 3529205 dataset_size: 10807636 - config_name: anli_must_be_true_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 26219814 num_examples: 50838 - name: validation num_bytes: 1544847 num_examples: 3000 - name: test num_bytes: 1548381 num_examples: 3000 download_size: 5178314 dataset_size: 29313042 - config_name: anli_must_be_true_r2 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 25724048 num_examples: 45460 - name: validation num_bytes: 564549 num_examples: 1000 - name: test num_bytes: 567086 num_examples: 1000 download_size: 6789593 dataset_size: 26855683 - config_name: anli_must_be_true_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 69507232 num_examples: 136380 - name: validation num_bytes: 1535019 num_examples: 3000 - name: test num_bytes: 1542630 num_examples: 3000 download_size: 11090449 dataset_size: 72584881 - config_name: anli_must_be_true_r3_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 151384159 num_examples: 301377 - name: validation num_bytes: 1831217 num_examples: 3600 - name: test num_bytes: 1830050 num_examples: 3600 download_size: 23694968 dataset_size: 155045426 - config_name: anli_should_assume_r1 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 9808224 num_examples: 16946 - name: validation num_bytes: 576245 num_examples: 1000 - name: test num_bytes: 577674 num_examples: 1000 download_size: 3557170 dataset_size: 10962143 - config_name: anli_should_assume_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 26632062 num_examples: 50838 - name: validation num_bytes: 1570107 num_examples: 3000 - name: test num_bytes: 1574394 num_examples: 3000 download_size: 5227051 dataset_size: 29776563 - config_name: anli_should_assume_r2 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 26138650 num_examples: 45460 - name: validation num_bytes: 573744 num_examples: 1000 - name: test num_bytes: 576288 num_examples: 1000 download_size: 6843519 dataset_size: 27288682 - config_name: anli_should_assume_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 70751038 num_examples: 136380 - name: validation num_bytes: 1562604 num_examples: 3000 - name: test num_bytes: 1570236 num_examples: 3000 download_size: 11203140 dataset_size: 73883878 - config_name: anli_should_assume_r3 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 56811303 num_examples: 100459 - name: validation num_bytes: 684578 num_examples: 1200 - name: test num_bytes: 684389 num_examples: 1200 download_size: 14457250 dataset_size: 58180270 - config_name: anli_should_assume_r3_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 153943435 num_examples: 301377 - name: validation num_bytes: 1863122 num_examples: 3600 - name: test num_bytes: 1862555 num_examples: 3600 download_size: 23996809 dataset_size: 157669112 - config_name: anli_take_the_following_as_truth_r1 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 10155922 num_examples: 16946 - name: validation num_bytes: 596175 num_examples: 1000 - name: test num_bytes: 602483 num_examples: 1000 download_size: 3647827 dataset_size: 11354580 - config_name: anli_take_the_following_as_truth_r1_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 27494019 num_examples: 50838 - name: validation num_bytes: 1611891 num_examples: 3000 - name: test num_bytes: 1630815 num_examples: 3000 download_size: 5358731 dataset_size: 30736725 - config_name: anli_take_the_following_as_truth_r2 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 26847342 num_examples: 45460 - name: validation num_bytes: 593274 num_examples: 1000 - name: test num_bytes: 595952 num_examples: 1000 download_size: 7011166 dataset_size: 28036568 - config_name: anli_take_the_following_as_truth_r2_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 72588670 num_examples: 136380 - name: validation num_bytes: 1603188 num_examples: 3000 - name: test num_bytes: 1611222 num_examples: 3000 download_size: 11470246 dataset_size: 75803080 - config_name: anli_take_the_following_as_truth_r3 features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 58926387 num_examples: 100459 - name: validation num_bytes: 712653 num_examples: 1200 - name: test num_bytes: 710980 num_examples: 1200 download_size: 14817660 dataset_size: 60350020 - config_name: anli_take_the_following_as_truth_r3_score_eval features: - name: idx list: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 159133840 num_examples: 301377 - name: validation num_bytes: 1926017 num_examples: 3600 - name: test num_bytes: 1920998 num_examples: 3600 download_size: 24605953 dataset_size: 162980855 - config_name: app_reviews_categorize_rating_using_review features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 84161782 num_examples: 288065 download_size: 16063169 dataset_size: 84161782 - config_name: app_reviews_convert_to_rating features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 56636258 num_examples: 288065 download_size: 15450009 dataset_size: 56636258 - config_name: app_reviews_convert_to_star_rating features: - name: answer_choices list: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 82142267 num_examples: 288065 download_size: 15479328 dataset_size: 82142267 - config_name: app_reviews_generate_review features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 56378272 num_examples: 288065 download_size: 13190483 dataset_size: 56378272 - config_name: cnn_dailymail_3.0.0_generate_story features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 720471112 num_examples: 287113 - name: validation num_bytes: 33618761 num_examples: 13368 - name: test num_bytes: 28745061 num_examples: 11490 download_size: 494183488 dataset_size: 782834934 - config_name: cnn_dailymail_3.0.0_news_card_view features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 732243635 num_examples: 287113 - name: validation num_bytes: 34166818 num_examples: 13368 - name: test num_bytes: 29216132 num_examples: 11490 download_size: 497253563 dataset_size: 795626585 - config_name: cnn_dailymail_3.0.0_news_stock features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 730808072 num_examples: 287113 - name: validation num_bytes: 34099975 num_examples: 13368 - name: test num_bytes: 29158682 num_examples: 11490 download_size: 496939280 dataset_size: 794066729 - config_name: cnn_dailymail_3.0.0_spice_up_story features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 731668204 num_examples: 287113 - name: validation num_bytes: 34140304 num_examples: 13368 - name: test num_bytes: 29193153 num_examples: 11490 download_size: 495827285 dataset_size: 795001661 - config_name: cnn_dailymail_3.0.0_sum_in_brief features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 713359413 num_examples: 287113 - name: validation num_bytes: 33281342 num_examples: 13368 - name: test num_bytes: 28452485 num_examples: 11490 download_size: 495236620 dataset_size: 775093240 - config_name: wiki_hop_original_generate_subject_and_object features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 324405773 num_examples: 43738 - name: validation num_bytes: 40667716 num_examples: 5129 download_size: 214339064 dataset_size: 365073489 - config_name: wiki_qa_Decide_good_answer features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 6709566 num_examples: 20360 - name: validation num_bytes: 892236 num_examples: 2733 - name: test num_bytes: 2011550 num_examples: 6165 download_size: 3332585 dataset_size: 9613352 - config_name: wiki_qa_Direct_Answer_to_Question features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 247130 num_examples: 1040 - name: validation num_bytes: 33007 num_examples: 140 - name: test num_bytes: 69123 num_examples: 293 download_size: 223105 dataset_size: 349260 - config_name: wiki_qa_Generate_Question_from_Topic features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 288904 num_examples: 1040 - name: validation num_bytes: 39404 num_examples: 140 - name: test num_bytes: 78870 num_examples: 293 download_size: 239887 dataset_size: 407178 - config_name: wiki_qa_Is_This_True_ features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 5530835 num_examples: 20360 - name: validation num_bytes: 732086 num_examples: 2733 - name: test num_bytes: 1659667 num_examples: 6165 download_size: 3174774 dataset_size: 7922588 - config_name: wiki_qa_Jeopardy_style features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 273177 num_examples: 1040 - name: validation num_bytes: 37569 num_examples: 140 - name: test num_bytes: 74673 num_examples: 293 download_size: 237245 dataset_size: 385419 - config_name: wiki_qa_Topic_Prediction_Answer_Only features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 241330 num_examples: 1040 - name: validation num_bytes: 31552 num_examples: 140 - name: test num_bytes: 64548 num_examples: 293 download_size: 210145 dataset_size: 337430 - config_name: wiki_qa_Topic_Prediction_Question_Only features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 115109 num_examples: 1040 - name: validation num_bytes: 14809 num_examples: 140 - name: test num_bytes: 30624 num_examples: 293 download_size: 65785 dataset_size: 160542 - config_name: wiki_qa_Topic_Prediction_Question_and_Answer_Pair features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 315922 num_examples: 1040 - name: validation num_bytes: 41981 num_examples: 140 - name: test num_bytes: 85690 num_examples: 293 download_size: 243365 dataset_size: 443593 - config_name: wiki_qa_automatic_system features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 7509189 num_examples: 20360 - name: validation num_bytes: 999229 num_examples: 2733 - name: test num_bytes: 2259517 num_examples: 6165 download_size: 3413064 dataset_size: 10767935 - config_name: wiki_qa_exercise features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 8969100 num_examples: 20360 - name: validation num_bytes: 1194909 num_examples: 2733 - name: test num_bytes: 2706993 num_examples: 6165 download_size: 3473289 dataset_size: 12871002 - config_name: wiki_qa_found_on_google features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 6406361 num_examples: 20360 - name: validation num_bytes: 851514 num_examples: 2733 - name: test num_bytes: 1927237 num_examples: 6165 download_size: 3286517 dataset_size: 9185112 - config_name: winogrande_winogrande_debiased_Replace features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 2494802 num_examples: 9248 - name: validation num_bytes: 318674 num_examples: 1267 - name: test num_bytes: 474866 num_examples: 1767 download_size: 1133981 dataset_size: 3288342 - config_name: winogrande_winogrande_debiased_Replace_score_eval features: - name: idx sequence: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 4218647 num_examples: 18496 - name: validation num_bytes: 562090 num_examples: 2534 - name: test num_bytes: 800721 num_examples: 3534 download_size: 1223476 dataset_size: 5581458 - config_name: winogrande_winogrande_debiased_does_underscore_refer_to features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 2378512 num_examples: 9248 - name: validation num_bytes: 302729 num_examples: 1267 - name: test num_bytes: 452544 num_examples: 1767 download_size: 1125118 dataset_size: 3133785 - config_name: winogrande_winogrande_debiased_does_underscore_refer_to_score_eval features: - name: idx sequence: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 3986067 num_examples: 18496 - name: validation num_bytes: 530200 num_examples: 2534 - name: test num_bytes: 756077 num_examples: 3534 download_size: 1212731 dataset_size: 5272344 - config_name: winogrande_winogrande_debiased_fill_in_the_blank features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 2513297 num_examples: 9248 - name: validation num_bytes: 321208 num_examples: 1267 - name: test num_bytes: 478398 num_examples: 1767 download_size: 1149729 dataset_size: 3312903 - config_name: winogrande_winogrande_debiased_fill_in_the_blank_score_eval features: - name: idx sequence: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 4255637 num_examples: 18496 - name: validation num_bytes: 567158 num_examples: 2534 - name: test num_bytes: 807785 num_examples: 3534 download_size: 1240495 dataset_size: 5630580 - config_name: winogrande_winogrande_debiased_stand_for features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 2331302 num_examples: 9248 - name: validation num_bytes: 296295 num_examples: 1267 - name: test num_bytes: 443594 num_examples: 1767 download_size: 1132226 dataset_size: 3071191 - config_name: winogrande_winogrande_debiased_stand_for_score_eval features: - name: idx sequence: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 3891647 num_examples: 18496 - name: validation num_bytes: 517332 num_examples: 2534 - name: test num_bytes: 738177 num_examples: 3534 download_size: 1218916 dataset_size: 5147156 - config_name: winogrande_winogrande_debiased_underscore_refer_to features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 2362950 num_examples: 9248 - name: validation num_bytes: 300567 num_examples: 1267 - name: test num_bytes: 449241 num_examples: 1767 download_size: 1141040 dataset_size: 3112758 - config_name: winogrande_winogrande_debiased_underscore_refer_to_score_eval features: - name: idx sequence: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 3954943 num_examples: 18496 - name: validation num_bytes: 525876 num_examples: 2534 - name: test num_bytes: 749471 num_examples: 3534 download_size: 1228522 dataset_size: 5230290 - config_name: winogrande_winogrande_xl_Replace features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 10741385 num_examples: 40398 - name: validation num_bytes: 318674 num_examples: 1267 - name: test num_bytes: 474866 num_examples: 1767 download_size: 3228045 dataset_size: 11534925 - config_name: winogrande_winogrande_xl_Replace_score_eval features: - name: idx sequence: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 18186622 num_examples: 80796 - name: validation num_bytes: 562090 num_examples: 2534 - name: test num_bytes: 800721 num_examples: 3534 download_size: 3525012 dataset_size: 19549433 - config_name: winogrande_winogrande_xl_does_underscore_refer_to features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 10233503 num_examples: 40398 - name: validation num_bytes: 302729 num_examples: 1267 - name: test num_bytes: 452544 num_examples: 1767 download_size: 3202869 dataset_size: 10988776 - config_name: winogrande_winogrande_xl_does_underscore_refer_to_score_eval features: - name: idx sequence: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 17170858 num_examples: 80796 - name: validation num_bytes: 530200 num_examples: 2534 - name: test num_bytes: 756077 num_examples: 3534 download_size: 3495469 dataset_size: 18457135 - config_name: winogrande_winogrande_xl_fill_in_the_blank features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 10822162 num_examples: 40398 - name: validation num_bytes: 321208 num_examples: 1267 - name: test num_bytes: 478398 num_examples: 1767 download_size: 3251155 dataset_size: 11621768 - config_name: winogrande_winogrande_xl_fill_in_the_blank_score_eval features: - name: idx sequence: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 18348176 num_examples: 80796 - name: validation num_bytes: 567158 num_examples: 2534 - name: test num_bytes: 807785 num_examples: 3534 download_size: 3559359 dataset_size: 19723119 - config_name: winogrande_winogrande_xl_stand_for features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 10027577 num_examples: 40398 - name: validation num_bytes: 296295 num_examples: 1267 - name: test num_bytes: 443594 num_examples: 1767 download_size: 3199335 dataset_size: 10767466 - config_name: winogrande_winogrande_xl_stand_for_score_eval features: - name: idx sequence: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 16759006 num_examples: 80796 - name: validation num_bytes: 517332 num_examples: 2534 - name: test num_bytes: 738177 num_examples: 3534 download_size: 3490708 dataset_size: 18014515 - config_name: winogrande_winogrande_xl_underscore_refer_to features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 10164596 num_examples: 40398 - name: validation num_bytes: 300567 num_examples: 1267 - name: test num_bytes: 449241 num_examples: 1767 download_size: 3238319 dataset_size: 10914404 - config_name: winogrande_winogrande_xl_underscore_refer_to_score_eval features: - name: idx sequence: int32 - name: inputs_pretokenized dtype: string - name: is_correct dtype: bool - name: targets_pretokenized dtype: string - name: weight dtype: float32 splits: - name: train num_bytes: 17033044 num_examples: 80796 - name: validation num_bytes: 525876 num_examples: 2534 - name: test num_bytes: 749471 num_examples: 3534 download_size: 3535036 dataset_size: 18308391 - config_name: wiqa_does_the_supposed_perturbation_have_an_effect features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 16606837 num_examples: 29808 - name: validation num_bytes: 3646373 num_examples: 6894 - name: test num_bytes: 1453319 num_examples: 3003 download_size: 7860625 dataset_size: 21706529 - config_name: wiqa_effect_with_label_answer features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 15276174 num_examples: 29808 - name: validation num_bytes: 3338021 num_examples: 6894 - name: test num_bytes: 1321769 num_examples: 3003 download_size: 7596498 dataset_size: 19935964 - config_name: wiqa_effect_with_string_answer features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 17442183 num_examples: 29808 - name: validation num_bytes: 3838951 num_examples: 6894 - name: test num_bytes: 1538114 num_examples: 3003 download_size: 7965517 dataset_size: 22819248 - config_name: wiqa_what_is_the_final_step_of_the_following_process features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 11055108 num_examples: 29808 - name: validation num_bytes: 2393488 num_examples: 6894 - name: test num_bytes: 919963 num_examples: 3003 download_size: 1798047 dataset_size: 14368559 - config_name: wiqa_what_is_the_missing_first_step features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 11524119 num_examples: 29808 - name: validation num_bytes: 2497447 num_examples: 6894 - name: test num_bytes: 965820 num_examples: 3003 download_size: 1803559 dataset_size: 14987386 - config_name: wiqa_what_might_be_the_first_step_of_the_process features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 11315905 num_examples: 29808 - name: validation num_bytes: 2449390 num_examples: 6894 - name: test num_bytes: 944799 num_examples: 3003 download_size: 1804439 dataset_size: 14710094 - config_name: wiqa_what_might_be_the_last_step_of_the_process features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 11144532 num_examples: 29808 - name: validation num_bytes: 2414170 num_examples: 6894 - name: test num_bytes: 928972 num_examples: 3003 download_size: 1814891 dataset_size: 14487674 - config_name: wiqa_which_of_the_following_is_the_supposed_perturbation features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 18957678 num_examples: 29808 - name: validation num_bytes: 4189565 num_examples: 6894 - name: test num_bytes: 1693819 num_examples: 3003 download_size: 8179468 dataset_size: 24841062 - config_name: xsum_DOC_boils_down_to_simple_idea_that features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 371586389 num_examples: 204045 - name: validation num_bytes: 20594956 num_examples: 11332 - name: test num_bytes: 20687186 num_examples: 11334 download_size: 269300769 dataset_size: 412868531 - config_name: xsum_DOC_given_above_write_one_sentence features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 379138897 num_examples: 204045 - name: validation num_bytes: 21014340 num_examples: 11332 - name: test num_bytes: 21106399 num_examples: 11334 download_size: 270338713 dataset_size: 421259636 - config_name: xsum_DOC_how_would_you_rephrase_few_words features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 369954546 num_examples: 204045 - name: validation num_bytes: 20504409 num_examples: 11332 - name: test num_bytes: 20596440 num_examples: 11334 download_size: 269019804 dataset_size: 411055395 - config_name: xsum_DOC_tldr features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 362915209 num_examples: 204045 - name: validation num_bytes: 20113639 num_examples: 11332 - name: test num_bytes: 20205635 num_examples: 11334 download_size: 268508779 dataset_size: 403234483 - config_name: xsum_DOC_write_summary_of_above features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 371792476 num_examples: 204045 - name: validation num_bytes: 20606407 num_examples: 11332 - name: test num_bytes: 20698539 num_examples: 11334 download_size: 269049793 dataset_size: 413097422 - config_name: xsum_article_DOC_summary features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 365810922 num_examples: 204045 - name: validation num_bytes: 20275221 num_examples: 11332 - name: test num_bytes: 20365758 num_examples: 11334 download_size: 268441502 dataset_size: 406451901 - config_name: xsum_college_roommate_asked_DOC_so_I_recap features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 384238846 num_examples: 204045 - name: validation num_bytes: 21297509 num_examples: 11332 - name: test num_bytes: 21389915 num_examples: 11334 download_size: 271215076 dataset_size: 426926270 - config_name: xsum_read_below_DOC_write_abstract features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 380158330 num_examples: 204045 - name: validation num_bytes: 21070992 num_examples: 11332 - name: test num_bytes: 21163190 num_examples: 11334 download_size: 270293998 dataset_size: 422392512 - config_name: xsum_summarize_DOC features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 363565939 num_examples: 204045 - name: validation num_bytes: 20150985 num_examples: 11332 - name: test num_bytes: 20242518 num_examples: 11334 download_size: 268400446 dataset_size: 403959442 - config_name: xsum_summarize_this_DOC_summary features: - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 368965786 num_examples: 204045 - name: validation num_bytes: 20450887 num_examples: 11332 - name: test num_bytes: 20542475 num_examples: 11334 download_size: 269385999 dataset_size: 409959148 - config_name: yelp_review_full_based_on_that features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 580583043 num_examples: 650000 - name: test num_bytes: 44715436 num_examples: 50000 download_size: 340857277 dataset_size: 625298479 - config_name: yelp_review_full_format_rating features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 583826688 num_examples: 650000 - name: test num_bytes: 44964700 num_examples: 50000 download_size: 341987921 dataset_size: 628791388 - config_name: yelp_review_full_format_score features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 576090819 num_examples: 650000 - name: test num_bytes: 44367990 num_examples: 50000 download_size: 342372356 dataset_size: 620458809 - config_name: yelp_review_full_format_star features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 572776745 num_examples: 650000 - name: test num_bytes: 44115095 num_examples: 50000 download_size: 340848272 dataset_size: 616891840 - config_name: yelp_review_full_on_a_scale features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 573518291 num_examples: 650000 - name: test num_bytes: 44166441 num_examples: 50000 download_size: 342749800 dataset_size: 617684732 - config_name: yelp_review_full_so_i_would features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 573422606 num_examples: 650000 - name: test num_bytes: 44166265 num_examples: 50000 download_size: 340243303 dataset_size: 617588871 - config_name: yelp_review_full_this_place features: - name: answer_choices sequence: string - name: inputs_pretokenized dtype: string - name: targets_pretokenized dtype: string splits: - name: train num_bytes: 572825339 num_examples: 650000 - name: test num_bytes: 44118931 num_examples: 50000 download_size: 341483353 dataset_size: 616944270 configs: - config_name: adversarial_qa_dbert_answer_the_following_q data_files: - split: train path: adversarial_qa_dbert_answer_the_following_q/train-* - split: validation path: adversarial_qa_dbert_answer_the_following_q/validation-* - config_name: adversarial_qa_dbert_based_on data_files: - split: train path: adversarial_qa_dbert_based_on/train-* - split: validation path: adversarial_qa_dbert_based_on/validation-* - config_name: adversarial_qa_dbert_generate_question data_files: - split: train path: adversarial_qa_dbert_generate_question/train-* - split: validation path: adversarial_qa_dbert_generate_question/validation-* - split: test path: adversarial_qa_dbert_generate_question/test-* - config_name: adversarial_qa_dbert_question_context_answer data_files: - split: train path: adversarial_qa_dbert_question_context_answer/train-* - split: validation path: adversarial_qa_dbert_question_context_answer/validation-* - config_name: adversarial_qa_dbert_tell_what_it_is data_files: - split: train path: adversarial_qa_dbert_tell_what_it_is/train-* - split: validation path: adversarial_qa_dbert_tell_what_it_is/validation-* - config_name: adversarial_qa_dbidaf_answer_the_following_q data_files: - split: train path: adversarial_qa_dbidaf_answer_the_following_q/train-* - split: validation path: adversarial_qa_dbidaf_answer_the_following_q/validation-* - config_name: adversarial_qa_dbidaf_based_on data_files: - split: train path: adversarial_qa_dbidaf_based_on/train-* - split: validation path: adversarial_qa_dbidaf_based_on/validation-* - config_name: adversarial_qa_dbidaf_generate_question data_files: - split: train path: adversarial_qa_dbidaf_generate_question/train-* - split: validation path: adversarial_qa_dbidaf_generate_question/validation-* - split: test path: adversarial_qa_dbidaf_generate_question/test-* - config_name: adversarial_qa_dbidaf_question_context_answer data_files: - split: train path: adversarial_qa_dbidaf_question_context_answer/train-* - split: validation path: adversarial_qa_dbidaf_question_context_answer/validation-* - config_name: adversarial_qa_dbidaf_tell_what_it_is data_files: - split: train path: adversarial_qa_dbidaf_tell_what_it_is/train-* - split: validation path: adversarial_qa_dbidaf_tell_what_it_is/validation-* - config_name: adversarial_qa_droberta_answer_the_following_q data_files: - split: train path: adversarial_qa_droberta_answer_the_following_q/train-* - split: validation path: adversarial_qa_droberta_answer_the_following_q/validation-* - config_name: adversarial_qa_droberta_based_on data_files: - split: train path: adversarial_qa_droberta_based_on/train-* - split: validation path: adversarial_qa_droberta_based_on/validation-* - config_name: adversarial_qa_droberta_generate_question data_files: - split: train path: adversarial_qa_droberta_generate_question/train-* - split: validation path: adversarial_qa_droberta_generate_question/validation-* - split: test path: adversarial_qa_droberta_generate_question/test-* - config_name: adversarial_qa_droberta_question_context_answer data_files: - split: train path: adversarial_qa_droberta_question_context_answer/train-* - split: validation path: adversarial_qa_droberta_question_context_answer/validation-* - config_name: adversarial_qa_droberta_tell_what_it_is data_files: - split: train path: adversarial_qa_droberta_tell_what_it_is/train-* - split: validation path: adversarial_qa_droberta_tell_what_it_is/validation-* - config_name: ag_news_classify data_files: - split: train path: ag_news_classify/train-* - split: test path: ag_news_classify/test-* - config_name: ag_news_classify_question_first data_files: - split: train path: ag_news_classify_question_first/train-* - split: test path: ag_news_classify_question_first/test-* - config_name: ag_news_classify_with_choices data_files: - split: train path: ag_news_classify_with_choices/train-* - split: test path: ag_news_classify_with_choices/test-* - config_name: ag_news_classify_with_choices_question_first data_files: - split: train path: ag_news_classify_with_choices_question_first/train-* - split: test path: ag_news_classify_with_choices_question_first/test-* - config_name: ag_news_recommend data_files: - split: train path: ag_news_recommend/train-* - split: test path: ag_news_recommend/test-* - config_name: ag_news_which_section data_files: - split: train path: ag_news_which_section/train-* - split: test path: ag_news_which_section/test-* - config_name: ag_news_which_section_choices data_files: - split: train path: ag_news_which_section_choices/train-* - split: test path: ag_news_which_section_choices/test-* - config_name: ai2_arc_ARC_Challenge_heres_a_problem data_files: - split: train path: ai2_arc_ARC_Challenge_heres_a_problem/train-* - split: validation path: ai2_arc_ARC_Challenge_heres_a_problem/validation-* - split: test path: ai2_arc_ARC_Challenge_heres_a_problem/test-* - config_name: ai2_arc_ARC_Challenge_i_am_hesitating data_files: - split: train path: ai2_arc_ARC_Challenge_i_am_hesitating/train-* - split: validation path: ai2_arc_ARC_Challenge_i_am_hesitating/validation-* - split: test path: ai2_arc_ARC_Challenge_i_am_hesitating/test-* - config_name: ai2_arc_ARC_Challenge_multiple_choice data_files: - split: train path: ai2_arc_ARC_Challenge_multiple_choice/train-* - split: validation path: ai2_arc_ARC_Challenge_multiple_choice/validation-* - split: test path: ai2_arc_ARC_Challenge_multiple_choice/test-* - config_name: ai2_arc_ARC_Challenge_pick_false_options data_files: - split: train path: ai2_arc_ARC_Challenge_pick_false_options/train-* - split: validation path: ai2_arc_ARC_Challenge_pick_false_options/validation-* - split: test path: ai2_arc_ARC_Challenge_pick_false_options/test-* - config_name: ai2_arc_ARC_Challenge_pick_the_most_correct_option data_files: - split: train path: ai2_arc_ARC_Challenge_pick_the_most_correct_option/train-* - split: validation path: ai2_arc_ARC_Challenge_pick_the_most_correct_option/validation-* - split: test path: ai2_arc_ARC_Challenge_pick_the_most_correct_option/test-* - config_name: ai2_arc_ARC_Challenge_qa_options data_files: - split: train path: ai2_arc_ARC_Challenge_qa_options/train-* - split: validation path: ai2_arc_ARC_Challenge_qa_options/validation-* - split: test path: ai2_arc_ARC_Challenge_qa_options/test-* - config_name: ai2_arc_ARC_Easy_heres_a_problem data_files: - split: train path: ai2_arc_ARC_Easy_heres_a_problem/train-* - split: validation path: ai2_arc_ARC_Easy_heres_a_problem/validation-* - split: test path: ai2_arc_ARC_Easy_heres_a_problem/test-* - config_name: ai2_arc_ARC_Easy_i_am_hesitating data_files: - split: train path: ai2_arc_ARC_Easy_i_am_hesitating/train-* - split: validation path: ai2_arc_ARC_Easy_i_am_hesitating/validation-* - split: test path: ai2_arc_ARC_Easy_i_am_hesitating/test-* - config_name: ai2_arc_ARC_Easy_multiple_choice data_files: - split: train path: ai2_arc_ARC_Easy_multiple_choice/train-* - split: validation path: ai2_arc_ARC_Easy_multiple_choice/validation-* - split: test path: ai2_arc_ARC_Easy_multiple_choice/test-* - config_name: ai2_arc_ARC_Easy_pick_false_options data_files: - split: train path: ai2_arc_ARC_Easy_pick_false_options/train-* - split: validation path: ai2_arc_ARC_Easy_pick_false_options/validation-* - split: test path: ai2_arc_ARC_Easy_pick_false_options/test-* - config_name: ai2_arc_ARC_Easy_pick_the_most_correct_option data_files: - split: train path: ai2_arc_ARC_Easy_pick_the_most_correct_option/train-* - split: validation path: ai2_arc_ARC_Easy_pick_the_most_correct_option/validation-* - split: test path: ai2_arc_ARC_Easy_pick_the_most_correct_option/test-* - config_name: ai2_arc_ARC_Easy_qa_options data_files: - split: train path: ai2_arc_ARC_Easy_qa_options/train-* - split: validation path: ai2_arc_ARC_Easy_qa_options/validation-* - split: test path: ai2_arc_ARC_Easy_qa_options/test-* - config_name: amazon_polarity_Is_this_product_review_positive data_files: - split: train path: amazon_polarity_Is_this_product_review_positive/train-* - split: test path: amazon_polarity_Is_this_product_review_positive/test-* - config_name: amazon_polarity_Is_this_review data_files: - split: train path: amazon_polarity_Is_this_review/train-* - split: test path: amazon_polarity_Is_this_review/test-* - config_name: amazon_polarity_Is_this_review_negative data_files: - split: train path: amazon_polarity_Is_this_review_negative/train-* - split: test path: amazon_polarity_Is_this_review_negative/test-* - config_name: amazon_polarity_User_recommend_this_product data_files: - split: train path: amazon_polarity_User_recommend_this_product/train-* - split: test path: amazon_polarity_User_recommend_this_product/test-* - config_name: amazon_polarity_convey_negative_or_positive_sentiment data_files: - split: train path: amazon_polarity_convey_negative_or_positive_sentiment/train-* - split: test path: amazon_polarity_convey_negative_or_positive_sentiment/test-* - config_name: amazon_polarity_flattering_or_not data_files: - split: train path: amazon_polarity_flattering_or_not/train-* - split: test path: amazon_polarity_flattering_or_not/test-* - config_name: amazon_polarity_negative_or_positive_tone data_files: - split: train path: amazon_polarity_negative_or_positive_tone/train-* - split: test path: amazon_polarity_negative_or_positive_tone/test-* - config_name: anli_GPT_3_style_r1 data_files: - split: train path: anli_GPT_3_style_r1/train-* - split: validation path: anli_GPT_3_style_r1/validation-* - split: test path: anli_GPT_3_style_r1/test-* - config_name: anli_GPT_3_style_r1_score_eval data_files: - split: train path: anli_GPT_3_style_r1_score_eval/train-* - split: validation path: anli_GPT_3_style_r1_score_eval/validation-* - split: test path: anli_GPT_3_style_r1_score_eval/test-* - config_name: anli_GPT_3_style_r2 data_files: - split: train path: anli_GPT_3_style_r2/train-* - split: validation path: anli_GPT_3_style_r2/validation-* - split: test path: anli_GPT_3_style_r2/test-* - config_name: anli_GPT_3_style_r2_score_eval data_files: - split: train path: anli_GPT_3_style_r2_score_eval/train-* - split: validation path: anli_GPT_3_style_r2_score_eval/validation-* - split: test path: anli_GPT_3_style_r2_score_eval/test-* - config_name: anli_GPT_3_style_r3 data_files: - split: train path: anli_GPT_3_style_r3/train-* - split: validation path: anli_GPT_3_style_r3/validation-* - split: test path: anli_GPT_3_style_r3/test-* - config_name: anli_GPT_3_style_r3_score_eval data_files: - split: train path: anli_GPT_3_style_r3_score_eval/train-* - split: validation path: anli_GPT_3_style_r3_score_eval/validation-* - split: test path: anli_GPT_3_style_r3_score_eval/test-* - config_name: anli_MNLI_crowdsource_r1 data_files: - split: train path: anli_MNLI_crowdsource_r1/train-* - split: validation path: anli_MNLI_crowdsource_r1/validation-* - split: test path: anli_MNLI_crowdsource_r1/test-* - config_name: anli_MNLI_crowdsource_r1_score_eval data_files: - split: train path: anli_MNLI_crowdsource_r1_score_eval/train-* - split: validation path: anli_MNLI_crowdsource_r1_score_eval/validation-* - split: test path: anli_MNLI_crowdsource_r1_score_eval/test-* - config_name: anli_MNLI_crowdsource_r2 data_files: - split: train path: anli_MNLI_crowdsource_r2/train-* - split: validation path: anli_MNLI_crowdsource_r2/validation-* - split: test path: anli_MNLI_crowdsource_r2/test-* - config_name: anli_MNLI_crowdsource_r2_score_eval data_files: - split: train path: anli_MNLI_crowdsource_r2_score_eval/train-* - split: validation path: anli_MNLI_crowdsource_r2_score_eval/validation-* - split: test path: anli_MNLI_crowdsource_r2_score_eval/test-* - config_name: anli_MNLI_crowdsource_r3 data_files: - split: train path: anli_MNLI_crowdsource_r3/train-* - split: validation path: anli_MNLI_crowdsource_r3/validation-* - split: test path: anli_MNLI_crowdsource_r3/test-* - config_name: anli_MNLI_crowdsource_r3_score_eval data_files: - split: train path: anli_MNLI_crowdsource_r3_score_eval/train-* - split: validation path: anli_MNLI_crowdsource_r3_score_eval/validation-* - split: test path: anli_MNLI_crowdsource_r3_score_eval/test-* - config_name: anli_always_sometimes_never_r1 data_files: - split: train path: anli_always_sometimes_never_r1/train-* - split: validation path: anli_always_sometimes_never_r1/validation-* - split: test path: anli_always_sometimes_never_r1/test-* - config_name: anli_always_sometimes_never_r1_score_eval data_files: - split: train path: anli_always_sometimes_never_r1_score_eval/train-* - split: validation path: anli_always_sometimes_never_r1_score_eval/validation-* - split: test path: anli_always_sometimes_never_r1_score_eval/test-* - config_name: anli_always_sometimes_never_r2 data_files: - split: train path: anli_always_sometimes_never_r2/train-* - split: validation path: anli_always_sometimes_never_r2/validation-* - split: test path: anli_always_sometimes_never_r2/test-* - config_name: anli_always_sometimes_never_r2_score_eval data_files: - split: train path: anli_always_sometimes_never_r2_score_eval/train-* - split: validation path: anli_always_sometimes_never_r2_score_eval/validation-* - split: test path: anli_always_sometimes_never_r2_score_eval/test-* - config_name: anli_always_sometimes_never_r3 data_files: - split: train path: anli_always_sometimes_never_r3/train-* - split: validation path: anli_always_sometimes_never_r3/validation-* - split: test path: anli_always_sometimes_never_r3/test-* - config_name: anli_always_sometimes_never_r3_score_eval data_files: - split: train path: anli_always_sometimes_never_r3_score_eval/train-* - split: validation path: anli_always_sometimes_never_r3_score_eval/validation-* - split: test path: anli_always_sometimes_never_r3_score_eval/test-* - config_name: anli_based_on_the_previous_passage_r1 data_files: - split: train path: anli_based_on_the_previous_passage_r1/train-* - split: validation path: anli_based_on_the_previous_passage_r1/validation-* - split: test path: anli_based_on_the_previous_passage_r1/test-* - config_name: anli_based_on_the_previous_passage_r1_score_eval data_files: - split: train path: anli_based_on_the_previous_passage_r1_score_eval/train-* - split: validation path: anli_based_on_the_previous_passage_r1_score_eval/validation-* - split: test path: anli_based_on_the_previous_passage_r1_score_eval/test-* - config_name: anli_based_on_the_previous_passage_r2 data_files: - split: train path: anli_based_on_the_previous_passage_r2/train-* - split: validation path: anli_based_on_the_previous_passage_r2/validation-* - split: test path: anli_based_on_the_previous_passage_r2/test-* - config_name: anli_based_on_the_previous_passage_r2_score_eval data_files: - split: train path: anli_based_on_the_previous_passage_r2_score_eval/train-* - split: validation path: anli_based_on_the_previous_passage_r2_score_eval/validation-* - split: test path: anli_based_on_the_previous_passage_r2_score_eval/test-* - config_name: anli_based_on_the_previous_passage_r3 data_files: - split: train path: anli_based_on_the_previous_passage_r3/train-* - split: validation path: anli_based_on_the_previous_passage_r3/validation-* - split: test path: anli_based_on_the_previous_passage_r3/test-* - config_name: anli_based_on_the_previous_passage_r3_score_eval data_files: - split: train path: anli_based_on_the_previous_passage_r3_score_eval/train-* - split: validation path: anli_based_on_the_previous_passage_r3_score_eval/validation-* - split: test path: anli_based_on_the_previous_passage_r3_score_eval/test-* - config_name: anli_can_we_infer_r1 data_files: - split: train path: anli_can_we_infer_r1/train-* - split: validation path: anli_can_we_infer_r1/validation-* - split: test path: anli_can_we_infer_r1/test-* - config_name: anli_can_we_infer_r1_score_eval data_files: - split: train path: anli_can_we_infer_r1_score_eval/train-* - split: validation path: anli_can_we_infer_r1_score_eval/validation-* - split: test path: anli_can_we_infer_r1_score_eval/test-* - config_name: anli_can_we_infer_r2 data_files: - split: train path: anli_can_we_infer_r2/train-* - split: validation path: anli_can_we_infer_r2/validation-* - split: test path: anli_can_we_infer_r2/test-* - config_name: anli_can_we_infer_r2_score_eval data_files: - split: train path: anli_can_we_infer_r2_score_eval/train-* - split: validation path: anli_can_we_infer_r2_score_eval/validation-* - split: test path: anli_can_we_infer_r2_score_eval/test-* - config_name: anli_can_we_infer_r3 data_files: - split: train path: anli_can_we_infer_r3/train-* - split: validation path: anli_can_we_infer_r3/validation-* - split: test path: anli_can_we_infer_r3/test-* - config_name: anli_can_we_infer_r3_score_eval data_files: - split: train path: anli_can_we_infer_r3_score_eval/train-* - split: validation path: anli_can_we_infer_r3_score_eval/validation-* - split: test path: anli_can_we_infer_r3_score_eval/test-* - config_name: anli_claim_true_false_inconclusive_r1_score_eval data_files: - split: train path: anli_claim_true_false_inconclusive_r1_score_eval/train-* - split: validation path: anli_claim_true_false_inconclusive_r1_score_eval/validation-* - split: test path: anli_claim_true_false_inconclusive_r1_score_eval/test-* - config_name: anli_claim_true_false_inconclusive_r2 data_files: - split: train path: anli_claim_true_false_inconclusive_r2/train-* - split: validation path: anli_claim_true_false_inconclusive_r2/validation-* - split: test path: anli_claim_true_false_inconclusive_r2/test-* - config_name: anli_claim_true_false_inconclusive_r2_score_eval data_files: - split: train path: anli_claim_true_false_inconclusive_r2_score_eval/train-* - split: validation path: anli_claim_true_false_inconclusive_r2_score_eval/validation-* - split: test path: anli_claim_true_false_inconclusive_r2_score_eval/test-* - config_name: anli_claim_true_false_inconclusive_r3 data_files: - split: train path: anli_claim_true_false_inconclusive_r3/train-* - split: validation path: anli_claim_true_false_inconclusive_r3/validation-* - split: test path: anli_claim_true_false_inconclusive_r3/test-* - config_name: anli_claim_true_false_inconclusive_r3_score_eval data_files: - split: train path: anli_claim_true_false_inconclusive_r3_score_eval/train-* - split: validation path: anli_claim_true_false_inconclusive_r3_score_eval/validation-* - split: test path: anli_claim_true_false_inconclusive_r3_score_eval/test-* - config_name: anli_consider_always_sometimes_never_r1 data_files: - split: train path: anli_consider_always_sometimes_never_r1/train-* - split: validation path: anli_consider_always_sometimes_never_r1/validation-* - split: test path: anli_consider_always_sometimes_never_r1/test-* - config_name: anli_consider_always_sometimes_never_r1_score_eval data_files: - split: train path: anli_consider_always_sometimes_never_r1_score_eval/train-* - split: validation path: anli_consider_always_sometimes_never_r1_score_eval/validation-* - split: test path: anli_consider_always_sometimes_never_r1_score_eval/test-* - config_name: anli_consider_always_sometimes_never_r2 data_files: - split: train path: anli_consider_always_sometimes_never_r2/train-* - split: validation path: anli_consider_always_sometimes_never_r2/validation-* - split: test path: anli_consider_always_sometimes_never_r2/test-* - config_name: anli_consider_always_sometimes_never_r2_score_eval data_files: - split: train path: anli_consider_always_sometimes_never_r2_score_eval/train-* - split: validation path: anli_consider_always_sometimes_never_r2_score_eval/validation-* - split: test path: anli_consider_always_sometimes_never_r2_score_eval/test-* - config_name: anli_consider_always_sometimes_never_r3 data_files: - split: train path: anli_consider_always_sometimes_never_r3/train-* - split: validation path: anli_consider_always_sometimes_never_r3/validation-* - split: test path: anli_consider_always_sometimes_never_r3/test-* - config_name: anli_does_it_follow_that_r1 data_files: - split: train path: anli_does_it_follow_that_r1/train-* - split: validation path: anli_does_it_follow_that_r1/validation-* - split: test path: anli_does_it_follow_that_r1/test-* - config_name: anli_does_it_follow_that_r1_score_eval data_files: - split: train path: anli_does_it_follow_that_r1_score_eval/train-* - split: validation path: anli_does_it_follow_that_r1_score_eval/validation-* - split: test path: anli_does_it_follow_that_r1_score_eval/test-* - config_name: anli_does_it_follow_that_r2 data_files: - split: train path: anli_does_it_follow_that_r2/train-* - split: validation path: anli_does_it_follow_that_r2/validation-* - split: test path: anli_does_it_follow_that_r2/test-* - config_name: anli_does_it_follow_that_r2_score_eval data_files: - split: train path: anli_does_it_follow_that_r2_score_eval/train-* - split: validation path: anli_does_it_follow_that_r2_score_eval/validation-* - split: test path: anli_does_it_follow_that_r2_score_eval/test-* - config_name: anli_does_it_follow_that_r3 data_files: - split: train path: anli_does_it_follow_that_r3/train-* - split: validation path: anli_does_it_follow_that_r3/validation-* - split: test path: anli_does_it_follow_that_r3/test-* - config_name: anli_does_it_follow_that_r3_score_eval data_files: - split: train path: anli_does_it_follow_that_r3_score_eval/train-* - split: validation path: anli_does_it_follow_that_r3_score_eval/validation-* - split: test path: anli_does_it_follow_that_r3_score_eval/test-* - config_name: anli_does_this_imply_r1 data_files: - split: train path: anli_does_this_imply_r1/train-* - split: validation path: anli_does_this_imply_r1/validation-* - split: test path: anli_does_this_imply_r1/test-* - config_name: anli_does_this_imply_r1_score_eval data_files: - split: train path: anli_does_this_imply_r1_score_eval/train-* - split: validation path: anli_does_this_imply_r1_score_eval/validation-* - split: test path: anli_does_this_imply_r1_score_eval/test-* - config_name: anli_does_this_imply_r2 data_files: - split: train path: anli_does_this_imply_r2/train-* - split: validation path: anli_does_this_imply_r2/validation-* - split: test path: anli_does_this_imply_r2/test-* - config_name: anli_does_this_imply_r2_score_eval data_files: - split: train path: anli_does_this_imply_r2_score_eval/train-* - split: validation path: anli_does_this_imply_r2_score_eval/validation-* - split: test path: anli_does_this_imply_r2_score_eval/test-* - config_name: anli_does_this_imply_r3 data_files: - split: train path: anli_does_this_imply_r3/train-* - split: validation path: anli_does_this_imply_r3/validation-* - split: test path: anli_does_this_imply_r3/test-* - config_name: anli_does_this_imply_r3_score_eval data_files: - split: train path: anli_does_this_imply_r3_score_eval/train-* - split: validation path: anli_does_this_imply_r3_score_eval/validation-* - split: test path: anli_does_this_imply_r3_score_eval/test-* - config_name: anli_guaranteed_possible_impossible_r1 data_files: - split: train path: anli_guaranteed_possible_impossible_r1/train-* - split: validation path: anli_guaranteed_possible_impossible_r1/validation-* - split: test path: anli_guaranteed_possible_impossible_r1/test-* - config_name: anli_guaranteed_possible_impossible_r1_score_eval data_files: - split: train path: anli_guaranteed_possible_impossible_r1_score_eval/train-* - split: validation path: anli_guaranteed_possible_impossible_r1_score_eval/validation-* - split: test path: anli_guaranteed_possible_impossible_r1_score_eval/test-* - config_name: anli_guaranteed_possible_impossible_r2_score_eval data_files: - split: train path: anli_guaranteed_possible_impossible_r2_score_eval/train-* - split: validation path: anli_guaranteed_possible_impossible_r2_score_eval/validation-* - split: test path: anli_guaranteed_possible_impossible_r2_score_eval/test-* - config_name: anli_guaranteed_possible_impossible_r3 data_files: - split: train path: anli_guaranteed_possible_impossible_r3/train-* - split: validation path: anli_guaranteed_possible_impossible_r3/validation-* - split: test path: anli_guaranteed_possible_impossible_r3/test-* - config_name: anli_guaranteed_possible_impossible_r3_score_eval data_files: - split: train path: anli_guaranteed_possible_impossible_r3_score_eval/train-* - split: validation path: anli_guaranteed_possible_impossible_r3_score_eval/validation-* - split: test path: anli_guaranteed_possible_impossible_r3_score_eval/test-* - config_name: anli_guaranteed_true_r1_score_eval data_files: - split: train path: anli_guaranteed_true_r1_score_eval/train-* - split: validation path: anli_guaranteed_true_r1_score_eval/validation-* - split: test path: anli_guaranteed_true_r1_score_eval/test-* - config_name: anli_guaranteed_true_r2 data_files: - split: train path: anli_guaranteed_true_r2/train-* - split: validation path: anli_guaranteed_true_r2/validation-* - split: test path: anli_guaranteed_true_r2/test-* - config_name: anli_guaranteed_true_r2_score_eval data_files: - split: train path: anli_guaranteed_true_r2_score_eval/train-* - split: validation path: anli_guaranteed_true_r2_score_eval/validation-* - split: test path: anli_guaranteed_true_r2_score_eval/test-* - config_name: anli_guaranteed_true_r3 data_files: - split: train path: anli_guaranteed_true_r3/train-* - split: validation path: anli_guaranteed_true_r3/validation-* - split: test path: anli_guaranteed_true_r3/test-* - config_name: anli_guaranteed_true_r3_score_eval data_files: - split: train path: anli_guaranteed_true_r3_score_eval/train-* - split: validation path: anli_guaranteed_true_r3_score_eval/validation-* - split: test path: anli_guaranteed_true_r3_score_eval/test-* - config_name: anli_justified_in_saying_r1 data_files: - split: train path: anli_justified_in_saying_r1/train-* - split: validation path: anli_justified_in_saying_r1/validation-* - split: test path: anli_justified_in_saying_r1/test-* - config_name: anli_justified_in_saying_r1_score_eval data_files: - split: train path: anli_justified_in_saying_r1_score_eval/train-* - split: validation path: anli_justified_in_saying_r1_score_eval/validation-* - split: test path: anli_justified_in_saying_r1_score_eval/test-* - config_name: anli_justified_in_saying_r2 data_files: - split: train path: anli_justified_in_saying_r2/train-* - split: validation path: anli_justified_in_saying_r2/validation-* - split: test path: anli_justified_in_saying_r2/test-* - config_name: anli_justified_in_saying_r2_score_eval data_files: - split: train path: anli_justified_in_saying_r2_score_eval/train-* - split: validation path: anli_justified_in_saying_r2_score_eval/validation-* - split: test path: anli_justified_in_saying_r2_score_eval/test-* - config_name: anli_justified_in_saying_r3 data_files: - split: train path: anli_justified_in_saying_r3/train-* - split: validation path: anli_justified_in_saying_r3/validation-* - split: test path: anli_justified_in_saying_r3/test-* - config_name: anli_justified_in_saying_r3_score_eval data_files: - split: train path: anli_justified_in_saying_r3_score_eval/train-* - split: validation path: anli_justified_in_saying_r3_score_eval/validation-* - split: test path: anli_justified_in_saying_r3_score_eval/test-* - config_name: anli_must_be_true_r1 data_files: - split: train path: anli_must_be_true_r1/train-* - split: validation path: anli_must_be_true_r1/validation-* - split: test path: anli_must_be_true_r1/test-* - config_name: anli_must_be_true_r1_score_eval data_files: - split: train path: anli_must_be_true_r1_score_eval/train-* - split: validation path: anli_must_be_true_r1_score_eval/validation-* - split: test path: anli_must_be_true_r1_score_eval/test-* - config_name: anli_must_be_true_r2 data_files: - split: train path: anli_must_be_true_r2/train-* - split: validation path: anli_must_be_true_r2/validation-* - split: test path: anli_must_be_true_r2/test-* - config_name: anli_must_be_true_r2_score_eval data_files: - split: train path: anli_must_be_true_r2_score_eval/train-* - split: validation path: anli_must_be_true_r2_score_eval/validation-* - split: test path: anli_must_be_true_r2_score_eval/test-* - config_name: anli_must_be_true_r3_score_eval data_files: - split: train path: anli_must_be_true_r3_score_eval/train-* - split: validation path: anli_must_be_true_r3_score_eval/validation-* - split: test path: anli_must_be_true_r3_score_eval/test-* - config_name: anli_should_assume_r1 data_files: - split: train path: anli_should_assume_r1/train-* - split: validation path: anli_should_assume_r1/validation-* - split: test path: anli_should_assume_r1/test-* - config_name: anli_should_assume_r1_score_eval data_files: - split: train path: anli_should_assume_r1_score_eval/train-* - split: validation path: anli_should_assume_r1_score_eval/validation-* - split: test path: anli_should_assume_r1_score_eval/test-* - config_name: anli_should_assume_r2 data_files: - split: train path: anli_should_assume_r2/train-* - split: validation path: anli_should_assume_r2/validation-* - split: test path: anli_should_assume_r2/test-* - config_name: anli_should_assume_r2_score_eval data_files: - split: train path: anli_should_assume_r2_score_eval/train-* - split: validation path: anli_should_assume_r2_score_eval/validation-* - split: test path: anli_should_assume_r2_score_eval/test-* - config_name: anli_should_assume_r3 data_files: - split: train path: anli_should_assume_r3/train-* - split: validation path: anli_should_assume_r3/validation-* - split: test path: anli_should_assume_r3/test-* - config_name: anli_should_assume_r3_score_eval data_files: - split: train path: anli_should_assume_r3_score_eval/train-* - split: validation path: anli_should_assume_r3_score_eval/validation-* - split: test path: anli_should_assume_r3_score_eval/test-* - config_name: anli_take_the_following_as_truth_r1 data_files: - split: train path: anli_take_the_following_as_truth_r1/train-* - split: validation path: anli_take_the_following_as_truth_r1/validation-* - split: test path: anli_take_the_following_as_truth_r1/test-* - config_name: anli_take_the_following_as_truth_r1_score_eval data_files: - split: train path: anli_take_the_following_as_truth_r1_score_eval/train-* - split: validation path: anli_take_the_following_as_truth_r1_score_eval/validation-* - split: test path: anli_take_the_following_as_truth_r1_score_eval/test-* - config_name: anli_take_the_following_as_truth_r2 data_files: - split: train path: anli_take_the_following_as_truth_r2/train-* - split: validation path: anli_take_the_following_as_truth_r2/validation-* - split: test path: anli_take_the_following_as_truth_r2/test-* - config_name: anli_take_the_following_as_truth_r2_score_eval data_files: - split: train path: anli_take_the_following_as_truth_r2_score_eval/train-* - split: validation path: anli_take_the_following_as_truth_r2_score_eval/validation-* - split: test path: anli_take_the_following_as_truth_r2_score_eval/test-* - config_name: anli_take_the_following_as_truth_r3 data_files: - split: train path: anli_take_the_following_as_truth_r3/train-* - split: validation path: anli_take_the_following_as_truth_r3/validation-* - split: test path: anli_take_the_following_as_truth_r3/test-* - config_name: anli_take_the_following_as_truth_r3_score_eval data_files: - split: train path: anli_take_the_following_as_truth_r3_score_eval/train-* - split: validation path: anli_take_the_following_as_truth_r3_score_eval/validation-* - split: test path: anli_take_the_following_as_truth_r3_score_eval/test-* - config_name: app_reviews_categorize_rating_using_review data_files: - split: train path: app_reviews_categorize_rating_using_review/train-* - config_name: app_reviews_convert_to_rating data_files: - split: train path: app_reviews_convert_to_rating/train-* - config_name: app_reviews_convert_to_star_rating data_files: - split: train path: app_reviews_convert_to_star_rating/train-* - config_name: app_reviews_generate_review data_files: - split: train path: app_reviews_generate_review/train-* - config_name: cnn_dailymail_3.0.0_generate_story data_files: - split: train path: cnn_dailymail_3.0.0_generate_story/train-* - split: validation path: cnn_dailymail_3.0.0_generate_story/validation-* - split: test path: cnn_dailymail_3.0.0_generate_story/test-* - config_name: cnn_dailymail_3.0.0_news_card_view data_files: - split: train path: cnn_dailymail_3.0.0_news_card_view/train-* - split: validation path: cnn_dailymail_3.0.0_news_card_view/validation-* - split: test path: cnn_dailymail_3.0.0_news_card_view/test-* - config_name: cnn_dailymail_3.0.0_news_stock data_files: - split: train path: cnn_dailymail_3.0.0_news_stock/train-* - split: validation path: cnn_dailymail_3.0.0_news_stock/validation-* - split: test path: cnn_dailymail_3.0.0_news_stock/test-* - config_name: cnn_dailymail_3.0.0_spice_up_story data_files: - split: train path: cnn_dailymail_3.0.0_spice_up_story/train-* - split: validation path: cnn_dailymail_3.0.0_spice_up_story/validation-* - split: test path: cnn_dailymail_3.0.0_spice_up_story/test-* - config_name: cnn_dailymail_3.0.0_sum_in_brief data_files: - split: train path: cnn_dailymail_3.0.0_sum_in_brief/train-* - split: validation path: cnn_dailymail_3.0.0_sum_in_brief/validation-* - split: test path: cnn_dailymail_3.0.0_sum_in_brief/test-* - config_name: wiki_hop_original_generate_subject_and_object data_files: - split: train path: wiki_hop_original_generate_subject_and_object/train-* - split: validation path: wiki_hop_original_generate_subject_and_object/validation-* - config_name: wiki_qa_Decide_good_answer data_files: - split: train path: wiki_qa_Decide_good_answer/train-* - split: validation path: wiki_qa_Decide_good_answer/validation-* - split: test path: wiki_qa_Decide_good_answer/test-* - config_name: wiki_qa_Direct_Answer_to_Question data_files: - split: train path: wiki_qa_Direct_Answer_to_Question/train-* - split: validation path: wiki_qa_Direct_Answer_to_Question/validation-* - split: test path: wiki_qa_Direct_Answer_to_Question/test-* - config_name: wiki_qa_Generate_Question_from_Topic data_files: - split: train path: wiki_qa_Generate_Question_from_Topic/train-* - split: validation path: wiki_qa_Generate_Question_from_Topic/validation-* - split: test path: wiki_qa_Generate_Question_from_Topic/test-* - config_name: wiki_qa_Is_This_True_ data_files: - split: train path: wiki_qa_Is_This_True_/train-* - split: validation path: wiki_qa_Is_This_True_/validation-* - split: test path: wiki_qa_Is_This_True_/test-* - config_name: wiki_qa_Jeopardy_style data_files: - split: train path: wiki_qa_Jeopardy_style/train-* - split: validation path: wiki_qa_Jeopardy_style/validation-* - split: test path: wiki_qa_Jeopardy_style/test-* - config_name: wiki_qa_Topic_Prediction_Answer_Only data_files: - split: train path: wiki_qa_Topic_Prediction_Answer_Only/train-* - split: validation path: wiki_qa_Topic_Prediction_Answer_Only/validation-* - split: test path: wiki_qa_Topic_Prediction_Answer_Only/test-* - config_name: wiki_qa_Topic_Prediction_Question_Only data_files: - split: train path: wiki_qa_Topic_Prediction_Question_Only/train-* - split: validation path: wiki_qa_Topic_Prediction_Question_Only/validation-* - split: test path: wiki_qa_Topic_Prediction_Question_Only/test-* - config_name: wiki_qa_Topic_Prediction_Question_and_Answer_Pair data_files: - split: train path: wiki_qa_Topic_Prediction_Question_and_Answer_Pair/train-* - split: validation path: wiki_qa_Topic_Prediction_Question_and_Answer_Pair/validation-* - split: test path: wiki_qa_Topic_Prediction_Question_and_Answer_Pair/test-* - config_name: wiki_qa_automatic_system data_files: - split: train path: wiki_qa_automatic_system/train-* - split: validation path: wiki_qa_automatic_system/validation-* - split: test path: wiki_qa_automatic_system/test-* - config_name: wiki_qa_exercise data_files: - split: train path: wiki_qa_exercise/train-* - split: validation path: wiki_qa_exercise/validation-* - split: test path: wiki_qa_exercise/test-* - config_name: wiki_qa_found_on_google data_files: - split: train path: wiki_qa_found_on_google/train-* - split: validation path: wiki_qa_found_on_google/validation-* - split: test path: wiki_qa_found_on_google/test-* - config_name: winogrande_winogrande_debiased_Replace data_files: - split: train path: winogrande_winogrande_debiased_Replace/train-* - split: validation path: winogrande_winogrande_debiased_Replace/validation-* - split: test path: winogrande_winogrande_debiased_Replace/test-* - config_name: winogrande_winogrande_debiased_Replace_score_eval data_files: - split: train path: winogrande_winogrande_debiased_Replace_score_eval/train-* - split: validation path: winogrande_winogrande_debiased_Replace_score_eval/validation-* - split: test path: winogrande_winogrande_debiased_Replace_score_eval/test-* - config_name: winogrande_winogrande_debiased_does_underscore_refer_to data_files: - split: train path: winogrande_winogrande_debiased_does_underscore_refer_to/train-* - split: validation path: winogrande_winogrande_debiased_does_underscore_refer_to/validation-* - split: test path: winogrande_winogrande_debiased_does_underscore_refer_to/test-* - config_name: winogrande_winogrande_debiased_does_underscore_refer_to_score_eval data_files: - split: train path: winogrande_winogrande_debiased_does_underscore_refer_to_score_eval/train-* - split: validation path: winogrande_winogrande_debiased_does_underscore_refer_to_score_eval/validation-* - split: test path: winogrande_winogrande_debiased_does_underscore_refer_to_score_eval/test-* - config_name: winogrande_winogrande_debiased_fill_in_the_blank data_files: - split: train path: winogrande_winogrande_debiased_fill_in_the_blank/train-* - split: validation path: winogrande_winogrande_debiased_fill_in_the_blank/validation-* - split: test path: winogrande_winogrande_debiased_fill_in_the_blank/test-* - config_name: winogrande_winogrande_debiased_fill_in_the_blank_score_eval data_files: - split: train path: winogrande_winogrande_debiased_fill_in_the_blank_score_eval/train-* - split: validation path: winogrande_winogrande_debiased_fill_in_the_blank_score_eval/validation-* - split: test path: winogrande_winogrande_debiased_fill_in_the_blank_score_eval/test-* - config_name: winogrande_winogrande_debiased_stand_for data_files: - split: train path: winogrande_winogrande_debiased_stand_for/train-* - split: validation path: winogrande_winogrande_debiased_stand_for/validation-* - split: test path: winogrande_winogrande_debiased_stand_for/test-* - config_name: winogrande_winogrande_debiased_stand_for_score_eval data_files: - split: train path: winogrande_winogrande_debiased_stand_for_score_eval/train-* - split: validation path: winogrande_winogrande_debiased_stand_for_score_eval/validation-* - split: test path: winogrande_winogrande_debiased_stand_for_score_eval/test-* - config_name: winogrande_winogrande_debiased_underscore_refer_to data_files: - split: train path: winogrande_winogrande_debiased_underscore_refer_to/train-* - split: validation path: winogrande_winogrande_debiased_underscore_refer_to/validation-* - split: test path: winogrande_winogrande_debiased_underscore_refer_to/test-* - config_name: winogrande_winogrande_debiased_underscore_refer_to_score_eval data_files: - split: train path: winogrande_winogrande_debiased_underscore_refer_to_score_eval/train-* - split: validation path: winogrande_winogrande_debiased_underscore_refer_to_score_eval/validation-* - split: test path: winogrande_winogrande_debiased_underscore_refer_to_score_eval/test-* - config_name: winogrande_winogrande_xl_Replace data_files: - split: train path: winogrande_winogrande_xl_Replace/train-* - split: validation path: winogrande_winogrande_xl_Replace/validation-* - split: test path: winogrande_winogrande_xl_Replace/test-* - config_name: winogrande_winogrande_xl_Replace_score_eval data_files: - split: train path: winogrande_winogrande_xl_Replace_score_eval/train-* - split: validation path: winogrande_winogrande_xl_Replace_score_eval/validation-* - split: test path: winogrande_winogrande_xl_Replace_score_eval/test-* - config_name: winogrande_winogrande_xl_does_underscore_refer_to data_files: - split: train path: winogrande_winogrande_xl_does_underscore_refer_to/train-* - split: validation path: winogrande_winogrande_xl_does_underscore_refer_to/validation-* - split: test path: winogrande_winogrande_xl_does_underscore_refer_to/test-* - config_name: winogrande_winogrande_xl_does_underscore_refer_to_score_eval data_files: - split: train path: winogrande_winogrande_xl_does_underscore_refer_to_score_eval/train-* - split: validation path: winogrande_winogrande_xl_does_underscore_refer_to_score_eval/validation-* - split: test path: winogrande_winogrande_xl_does_underscore_refer_to_score_eval/test-* - config_name: winogrande_winogrande_xl_fill_in_the_blank data_files: - split: train path: winogrande_winogrande_xl_fill_in_the_blank/train-* - split: validation path: winogrande_winogrande_xl_fill_in_the_blank/validation-* - split: test path: winogrande_winogrande_xl_fill_in_the_blank/test-* - config_name: winogrande_winogrande_xl_fill_in_the_blank_score_eval data_files: - split: train path: winogrande_winogrande_xl_fill_in_the_blank_score_eval/train-* - split: validation path: winogrande_winogrande_xl_fill_in_the_blank_score_eval/validation-* - split: test path: winogrande_winogrande_xl_fill_in_the_blank_score_eval/test-* - config_name: winogrande_winogrande_xl_stand_for data_files: - split: train path: winogrande_winogrande_xl_stand_for/train-* - split: validation path: winogrande_winogrande_xl_stand_for/validation-* - split: test path: winogrande_winogrande_xl_stand_for/test-* - config_name: winogrande_winogrande_xl_stand_for_score_eval data_files: - split: train path: winogrande_winogrande_xl_stand_for_score_eval/train-* - split: validation path: winogrande_winogrande_xl_stand_for_score_eval/validation-* - split: test path: winogrande_winogrande_xl_stand_for_score_eval/test-* - config_name: winogrande_winogrande_xl_underscore_refer_to data_files: - split: train path: winogrande_winogrande_xl_underscore_refer_to/train-* - split: validation path: winogrande_winogrande_xl_underscore_refer_to/validation-* - split: test path: winogrande_winogrande_xl_underscore_refer_to/test-* - config_name: winogrande_winogrande_xl_underscore_refer_to_score_eval data_files: - split: train path: winogrande_winogrande_xl_underscore_refer_to_score_eval/train-* - split: validation path: winogrande_winogrande_xl_underscore_refer_to_score_eval/validation-* - split: test path: winogrande_winogrande_xl_underscore_refer_to_score_eval/test-* - config_name: wiqa_does_the_supposed_perturbation_have_an_effect data_files: - split: train path: wiqa_does_the_supposed_perturbation_have_an_effect/train-* - split: validation path: wiqa_does_the_supposed_perturbation_have_an_effect/validation-* - split: test path: wiqa_does_the_supposed_perturbation_have_an_effect/test-* - config_name: wiqa_effect_with_label_answer data_files: - split: train path: wiqa_effect_with_label_answer/train-* - split: validation path: wiqa_effect_with_label_answer/validation-* - split: test path: wiqa_effect_with_label_answer/test-* - config_name: wiqa_effect_with_string_answer data_files: - split: train path: wiqa_effect_with_string_answer/train-* - split: validation path: wiqa_effect_with_string_answer/validation-* - split: test path: wiqa_effect_with_string_answer/test-* - config_name: wiqa_what_is_the_final_step_of_the_following_process data_files: - split: train path: wiqa_what_is_the_final_step_of_the_following_process/train-* - split: validation path: wiqa_what_is_the_final_step_of_the_following_process/validation-* - split: test path: wiqa_what_is_the_final_step_of_the_following_process/test-* - config_name: wiqa_what_is_the_missing_first_step data_files: - split: train path: wiqa_what_is_the_missing_first_step/train-* - split: validation path: wiqa_what_is_the_missing_first_step/validation-* - split: test path: wiqa_what_is_the_missing_first_step/test-* - config_name: wiqa_what_might_be_the_first_step_of_the_process data_files: - split: train path: wiqa_what_might_be_the_first_step_of_the_process/train-* - split: validation path: wiqa_what_might_be_the_first_step_of_the_process/validation-* - split: test path: wiqa_what_might_be_the_first_step_of_the_process/test-* - config_name: wiqa_what_might_be_the_last_step_of_the_process data_files: - split: train path: wiqa_what_might_be_the_last_step_of_the_process/train-* - split: validation path: wiqa_what_might_be_the_last_step_of_the_process/validation-* - split: test path: wiqa_what_might_be_the_last_step_of_the_process/test-* - config_name: wiqa_which_of_the_following_is_the_supposed_perturbation data_files: - split: train path: wiqa_which_of_the_following_is_the_supposed_perturbation/train-* - split: validation path: wiqa_which_of_the_following_is_the_supposed_perturbation/validation-* - split: test path: wiqa_which_of_the_following_is_the_supposed_perturbation/test-* - config_name: xsum_DOC_boils_down_to_simple_idea_that data_files: - split: train path: xsum_DOC_boils_down_to_simple_idea_that/train-* - split: validation path: xsum_DOC_boils_down_to_simple_idea_that/validation-* - split: test path: xsum_DOC_boils_down_to_simple_idea_that/test-* - config_name: xsum_DOC_given_above_write_one_sentence data_files: - split: train path: xsum_DOC_given_above_write_one_sentence/train-* - split: validation path: xsum_DOC_given_above_write_one_sentence/validation-* - split: test path: xsum_DOC_given_above_write_one_sentence/test-* - config_name: xsum_DOC_how_would_you_rephrase_few_words data_files: - split: train path: xsum_DOC_how_would_you_rephrase_few_words/train-* - split: validation path: xsum_DOC_how_would_you_rephrase_few_words/validation-* - split: test path: xsum_DOC_how_would_you_rephrase_few_words/test-* - config_name: xsum_DOC_tldr data_files: - split: train path: xsum_DOC_tldr/train-* - split: validation path: xsum_DOC_tldr/validation-* - split: test path: xsum_DOC_tldr/test-* - config_name: xsum_DOC_write_summary_of_above data_files: - split: train path: xsum_DOC_write_summary_of_above/train-* - split: validation path: xsum_DOC_write_summary_of_above/validation-* - split: test path: xsum_DOC_write_summary_of_above/test-* - config_name: xsum_article_DOC_summary data_files: - split: train path: xsum_article_DOC_summary/train-* - split: validation path: xsum_article_DOC_summary/validation-* - split: test path: xsum_article_DOC_summary/test-* - config_name: xsum_college_roommate_asked_DOC_so_I_recap data_files: - split: train path: xsum_college_roommate_asked_DOC_so_I_recap/train-* - split: validation path: xsum_college_roommate_asked_DOC_so_I_recap/validation-* - split: test path: xsum_college_roommate_asked_DOC_so_I_recap/test-* - config_name: xsum_read_below_DOC_write_abstract data_files: - split: train path: xsum_read_below_DOC_write_abstract/train-* - split: validation path: xsum_read_below_DOC_write_abstract/validation-* - split: test path: xsum_read_below_DOC_write_abstract/test-* - config_name: xsum_summarize_DOC data_files: - split: train path: xsum_summarize_DOC/train-* - split: validation path: xsum_summarize_DOC/validation-* - split: test path: xsum_summarize_DOC/test-* - config_name: xsum_summarize_this_DOC_summary data_files: - split: train path: xsum_summarize_this_DOC_summary/train-* - split: validation path: xsum_summarize_this_DOC_summary/validation-* - split: test path: xsum_summarize_this_DOC_summary/test-* - config_name: yelp_review_full_based_on_that data_files: - split: train path: yelp_review_full_based_on_that/train-* - split: test path: yelp_review_full_based_on_that/test-* - config_name: yelp_review_full_format_rating data_files: - split: train path: yelp_review_full_format_rating/train-* - split: test path: yelp_review_full_format_rating/test-* - config_name: yelp_review_full_format_score data_files: - split: train path: yelp_review_full_format_score/train-* - split: test path: yelp_review_full_format_score/test-* - config_name: yelp_review_full_format_star data_files: - split: train path: yelp_review_full_format_star/train-* - split: test path: yelp_review_full_format_star/test-* - config_name: yelp_review_full_on_a_scale data_files: - split: train path: yelp_review_full_on_a_scale/train-* - split: test path: yelp_review_full_on_a_scale/test-* - config_name: yelp_review_full_so_i_would data_files: - split: train path: yelp_review_full_so_i_would/train-* - split: test path: yelp_review_full_so_i_would/test-* - config_name: yelp_review_full_this_place data_files: - split: train path: yelp_review_full_this_place/train-* - split: test path: yelp_review_full_this_place/test-* language: - lv --- This is an automatically translated version of [P3 (Public Pool of Prompts)](https://huggingface.co/datasets/bigscience/P3) using [quickmt-en-lv](https://huggingface.co/quickmt/quickmt-en-lv). ### Languages The data in P3-Latvian-Full are in Latvian (BCP-47 `lv`). ## Dataset Structure ### Data Instances An example of "train" looks as follows: ```bash { 'answer_choices': ['mobilais tālrunis', 'televīzija', 'ledusskapis', 'lidmašīna'], 'inputs_pretokenized': 'Kura tehnoloģija tika izstrādāta pavisam nesen? Iespējas: - mobilais tālrunis - televizors - ledusskapis - lidmašīna', 'targets_pretokenized': 'mobilais tālrunis' } ``` In the case of rank classification (letting the model select its the prediction the option with the highest log-likelihood), an example looks as follows: ```bash { 'idx': [5, 0], 'inputs_pretokenized': 'Es zinu, ka atbilde uz jautājumu "Ko CBS darīja otro reizi?" ir "1989. gadā CBS Records atkārtoti iekļāva mūzikas izdevējdarbības biznesu, iegādājoties Nashville mūzikas izdevēju Tree International Publishing par vairāk nekā 30 miljoniem ASV dolāru. ". Vai jūs varat man pateikt, kas tas ir?', 'is_correct': True, 'targets_pretokenized': 'atgriezās mūzikas izdevējdarbības biznesā', 'weight': 1.0 } ``` ### Data Fields The data fields are the same among all splits: - `answer_choices`: the choices (in natural language) available to the model - `inputs_pretokenized`: the natural language input fed to the model - `targets_pretokenized`: the natural language target that the model has to generate - `idx`: identifier of the (example, answer_option_id) in the case of rank classification - `weight`: a weight for the example produced by seqio (always set to 1.0 in practise) - `is_correct`: whether the (example, answer_option_id) is the correct one

应用场景：