matiss/P3-Latvian-QuickMT
收藏Hugging Face2026-02-16 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/matiss/P3-Latvian-QuickMT
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: adversarial_qa_dbert_answer_the_following_q
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 10073371
num_examples: 10000
- name: validation
num_bytes: 992047
num_examples: 1000
download_size: 2583136
dataset_size: 11065418
- config_name: adversarial_qa_dbert_based_on
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9543041
num_examples: 10000
- name: validation
num_bytes: 938956
num_examples: 1000
download_size: 2548825
dataset_size: 10481997
- config_name: adversarial_qa_dbert_generate_question
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9935906
num_examples: 10000
- name: validation
num_bytes: 981170
num_examples: 1000
- name: test
num_bytes: 1040114
num_examples: 1000
download_size: 2317046
dataset_size: 11957190
- config_name: adversarial_qa_dbert_question_context_answer
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9141988
num_examples: 10000
- name: validation
num_bytes: 899724
num_examples: 1000
download_size: 2513875
dataset_size: 10041712
- config_name: adversarial_qa_dbert_tell_what_it_is
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9543304
num_examples: 10000
- name: validation
num_bytes: 939766
num_examples: 1000
download_size: 2550715
dataset_size: 10483070
- config_name: adversarial_qa_dbidaf_answer_the_following_q
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9995769
num_examples: 10000
- name: validation
num_bytes: 995527
num_examples: 1000
download_size: 2610594
dataset_size: 10991296
- config_name: adversarial_qa_dbidaf_based_on
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9467160
num_examples: 10000
- name: validation
num_bytes: 941873
num_examples: 1000
download_size: 2583831
dataset_size: 10409033
- config_name: adversarial_qa_dbidaf_generate_question
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9922079
num_examples: 10000
- name: validation
num_bytes: 984612
num_examples: 1000
- name: test
num_bytes: 1023021
num_examples: 1000
download_size: 2347181
dataset_size: 11929712
- config_name: adversarial_qa_dbidaf_question_context_answer
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9065166
num_examples: 10000
- name: validation
num_bytes: 902890
num_examples: 1000
download_size: 2547566
dataset_size: 9968056
- config_name: adversarial_qa_dbidaf_tell_what_it_is
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9469228
num_examples: 10000
- name: validation
num_bytes: 943176
num_examples: 1000
download_size: 2595039
dataset_size: 10412404
- config_name: adversarial_qa_droberta_answer_the_following_q
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9931301
num_examples: 10000
- name: validation
num_bytes: 980683
num_examples: 1000
download_size: 2665783
dataset_size: 10911984
- config_name: adversarial_qa_droberta_based_on
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9402830
num_examples: 10000
- name: validation
num_bytes: 927020
num_examples: 1000
download_size: 2619552
dataset_size: 10329850
- config_name: adversarial_qa_droberta_generate_question
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9778471
num_examples: 10000
- name: validation
num_bytes: 973024
num_examples: 1000
- name: test
num_bytes: 1066952
num_examples: 1000
download_size: 2426128
dataset_size: 11818447
- config_name: adversarial_qa_droberta_question_context_answer
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 8999476
num_examples: 10000
- name: validation
num_bytes: 888031
num_examples: 1000
download_size: 2595726
dataset_size: 9887507
- config_name: adversarial_qa_droberta_tell_what_it_is
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9400737
num_examples: 10000
- name: validation
num_bytes: 927874
num_examples: 1000
download_size: 2633324
dataset_size: 10328611
- config_name: ag_news_classify
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 51201178
num_examples: 120000
- name: test
num_bytes: 3233173
num_examples: 7600
download_size: 21150470
dataset_size: 54434351
- config_name: ag_news_classify_question_first
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 51201178
num_examples: 120000
- name: test
num_bytes: 3233173
num_examples: 7600
download_size: 21013028
dataset_size: 54434351
- config_name: ag_news_classify_with_choices
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 55521424
num_examples: 120000
- name: test
num_bytes: 3506795
num_examples: 7600
download_size: 21835832
dataset_size: 59028219
- config_name: ag_news_classify_with_choices_question_first
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 55521424
num_examples: 120000
- name: test
num_bytes: 3506795
num_examples: 7600
download_size: 21693542
dataset_size: 59028219
- config_name: ag_news_recommend
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 51891217
num_examples: 120000
- name: test
num_bytes: 3276936
num_examples: 7600
download_size: 21812607
dataset_size: 55168153
- config_name: ag_news_which_section
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 51861207
num_examples: 120000
- name: test
num_bytes: 3274999
num_examples: 7600
download_size: 21236892
dataset_size: 55136206
- config_name: ag_news_which_section_choices
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 60619719
num_examples: 120000
- name: test
num_bytes: 3829486
num_examples: 7600
download_size: 22487894
dataset_size: 64449205
- config_name: ai2_arc_ARC_Challenge_heres_a_problem
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 465398
num_examples: 1119
- name: validation
num_bytes: 127761
num_examples: 299
- name: test
num_bytes: 496119
num_examples: 1172
download_size: 457627
dataset_size: 1089278
- config_name: ai2_arc_ARC_Challenge_i_am_hesitating
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 689304
num_examples: 1119
- name: validation
num_bytes: 186520
num_examples: 299
- name: test
num_bytes: 716071
num_examples: 1172
download_size: 749084
dataset_size: 1591895
- config_name: ai2_arc_ARC_Challenge_multiple_choice
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 748129
num_examples: 1119
- name: validation
num_bytes: 202353
num_examples: 299
- name: test
num_bytes: 777708
num_examples: 1172
download_size: 769013
dataset_size: 1728190
- config_name: ai2_arc_ARC_Challenge_pick_false_options
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 491201
num_examples: 1119
- name: validation
num_bytes: 135537
num_examples: 299
- name: test
num_bytes: 525124
num_examples: 1172
download_size: 601174
dataset_size: 1151862
- config_name: ai2_arc_ARC_Challenge_pick_the_most_correct_option
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 462864
num_examples: 1119
- name: validation
num_bytes: 127018
num_examples: 299
- name: test
num_bytes: 493354
num_examples: 1172
download_size: 462204
dataset_size: 1083236
- config_name: ai2_arc_ARC_Challenge_qa_options
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 561853
num_examples: 1119
- name: validation
num_bytes: 152514
num_examples: 299
- name: test
num_bytes: 582558
num_examples: 1172
download_size: 723478
dataset_size: 1296925
- config_name: ai2_arc_ARC_Easy_heres_a_problem
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 850868
num_examples: 2251
- name: validation
num_bytes: 215754
num_examples: 570
- name: test
num_bytes: 902804
num_examples: 2376
download_size: 784671
dataset_size: 1969426
- config_name: ai2_arc_ARC_Easy_i_am_hesitating
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 1263913
num_examples: 2251
- name: validation
num_bytes: 337210
num_examples: 570
- name: test
num_bytes: 1349197
num_examples: 2376
download_size: 1244736
dataset_size: 2950320
- config_name: ai2_arc_ARC_Easy_multiple_choice
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 1382690
num_examples: 2251
- name: validation
num_bytes: 367311
num_examples: 570
- name: test
num_bytes: 1474809
num_examples: 2376
download_size: 1282047
dataset_size: 3224810
- config_name: ai2_arc_ARC_Easy_pick_false_options
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 866252
num_examples: 2251
- name: validation
num_bytes: 219001
num_examples: 570
- name: test
num_bytes: 919188
num_examples: 2376
download_size: 1016220
dataset_size: 2004441
- config_name: ai2_arc_ARC_Easy_pick_the_most_correct_option
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 846126
num_examples: 2251
- name: validation
num_bytes: 214684
num_examples: 570
- name: test
num_bytes: 897783
num_examples: 2376
download_size: 795969
dataset_size: 1958593
- config_name: ai2_arc_ARC_Easy_qa_options
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 1007817
num_examples: 2251
- name: validation
num_bytes: 272367
num_examples: 570
- name: test
num_bytes: 1079242
num_examples: 2376
download_size: 1192608
dataset_size: 2359426
- config_name: amazon_polarity_Is_this_product_review_positive
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 2116403330
num_examples: 3600000
- name: test
num_bytes: 235047773
num_examples: 400000
download_size: 1234623408
dataset_size: 2351451103
- config_name: amazon_polarity_Is_this_review
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 2332403191
num_examples: 3600000
- name: test
num_bytes: 259047773
num_examples: 400000
download_size: 1234078263
dataset_size: 2591450964
- config_name: amazon_polarity_Is_this_review_negative
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 2084003136
num_examples: 3600000
- name: test
num_bytes: 231447772
num_examples: 400000
download_size: 1234000458
dataset_size: 2315450908
- config_name: amazon_polarity_User_recommend_this_product
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 2087813177
num_examples: 3600000
- name: test
num_bytes: 231855467
num_examples: 400000
download_size: 1173652629
dataset_size: 2319668644
- config_name: amazon_polarity_convey_negative_or_positive_sentiment
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 2436803218
num_examples: 3600000
- name: test
num_bytes: 270647775
num_examples: 400000
download_size: 1252792299
dataset_size: 2707450993
- config_name: amazon_polarity_flattering_or_not
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 2309994572
num_examples: 3600000
- name: test
num_bytes: 256544770
num_examples: 400000
download_size: 1261191576
dataset_size: 2566539342
- config_name: amazon_polarity_negative_or_positive_tone
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 2480002823
num_examples: 3600000
- name: test
num_bytes: 275447773
num_examples: 400000
download_size: 1254172662
dataset_size: 2755450596
- config_name: anli_GPT_3_style_r1
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9115390
num_examples: 16946
- name: validation
num_bytes: 536694
num_examples: 1000
- name: test
num_bytes: 540176
num_examples: 1000
download_size: 3459817
dataset_size: 10192260
- config_name: anli_GPT_3_style_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 24867598
num_examples: 50838
- name: validation
num_bytes: 1460439
num_examples: 3000
- name: test
num_bytes: 1470885
num_examples: 3000
download_size: 5023993
dataset_size: 27798922
- config_name: anli_GPT_3_style_r2
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 24000342
num_examples: 45460
- name: validation
num_bytes: 532044
num_examples: 1000
- name: test
num_bytes: 533883
num_examples: 1000
download_size: 6649999
dataset_size: 25066269
- config_name: anli_GPT_3_style_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 65432804
num_examples: 136380
- name: validation
num_bytes: 1446489
num_examples: 3000
- name: test
num_bytes: 1452006
num_examples: 3000
download_size: 10738630
dataset_size: 68331299
- config_name: anli_GPT_3_style_r3
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 51902632
num_examples: 100459
- name: validation
num_bytes: 629291
num_examples: 1200
- name: test
num_bytes: 628740
num_examples: 1200
download_size: 14041053
dataset_size: 53160663
- config_name: anli_GPT_3_style_r3_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 140973371
num_examples: 301377
- name: validation
num_bytes: 1708385
num_examples: 3600
- name: test
num_bytes: 1706732
num_examples: 3600
download_size: 22892192
dataset_size: 144388488
- config_name: anli_MNLI_crowdsource_r1
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 10642813
num_examples: 16946
- name: validation
num_bytes: 626399
num_examples: 1000
- name: test
num_bytes: 627777
num_examples: 1000
download_size: 3660956
dataset_size: 11896989
- config_name: anli_MNLI_crowdsource_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 29776793
num_examples: 50838
- name: validation
num_bytes: 1753564
num_examples: 3000
- name: test
num_bytes: 1757698
num_examples: 3000
download_size: 5491534
dataset_size: 33288055
- config_name: anli_MNLI_crowdsource_r2
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 28143499
num_examples: 45460
- name: validation
num_bytes: 621285
num_examples: 1000
- name: test
num_bytes: 622546
num_examples: 1000
download_size: 7070331
dataset_size: 29387330
- config_name: anli_MNLI_crowdsource_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 78614594
num_examples: 136380
- name: validation
num_bytes: 1738222
num_examples: 3000
- name: test
num_bytes: 1742005
num_examples: 3000
download_size: 11848479
dataset_size: 82094821
- config_name: anli_MNLI_crowdsource_r3
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 60941758
num_examples: 100459
- name: validation
num_bytes: 735534
num_examples: 1200
- name: test
num_bytes: 735326
num_examples: 1200
download_size: 14942935
dataset_size: 62412618
- config_name: anli_MNLI_crowdsource_r3_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 170080650
num_examples: 301377
- name: validation
num_bytes: 2055764
num_examples: 3600
- name: test
num_bytes: 2055140
num_examples: 3600
download_size: 25275994
dataset_size: 174191554
- config_name: anli_always_sometimes_never_r1
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 11191134
num_examples: 16946
- name: validation
num_bytes: 656563
num_examples: 1000
- name: test
num_bytes: 658871
num_examples: 1000
download_size: 3587589
dataset_size: 12506568
- config_name: anli_always_sometimes_never_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 27700949
num_examples: 50838
- name: validation
num_bytes: 1628101
num_examples: 3000
- name: test
num_bytes: 1635025
num_examples: 3000
download_size: 5260211
dataset_size: 30964075
- config_name: anli_always_sometimes_never_r2
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 29812996
num_examples: 45460
- name: validation
num_bytes: 654966
num_examples: 1000
- name: test
num_bytes: 658147
num_examples: 1000
download_size: 6908336
dataset_size: 31126109
- config_name: anli_always_sometimes_never_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 73576704
num_examples: 136380
- name: validation
num_bytes: 1623431
num_examples: 3000
- name: test
num_bytes: 1632853
num_examples: 3000
download_size: 11287979
dataset_size: 76832988
- config_name: anli_always_sometimes_never_r3
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 65435333
num_examples: 100459
- name: validation
num_bytes: 787929
num_examples: 1200
- name: test
num_bytes: 787630
num_examples: 1200
download_size: 14623047
dataset_size: 67010892
- config_name: anli_always_sometimes_never_r3_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 161561672
num_examples: 301377
- name: validation
num_bytes: 1953785
num_examples: 3600
- name: test
num_bytes: 1952888
num_examples: 3600
download_size: 24140958
dataset_size: 165468345
- config_name: anli_based_on_the_previous_passage_r1
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 10187263
num_examples: 16946
- name: validation
num_bytes: 596966
num_examples: 1000
- name: test
num_bytes: 600395
num_examples: 1000
download_size: 3580346
dataset_size: 11384624
- config_name: anli_based_on_the_previous_passage_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 27768843
num_examples: 50838
- name: validation
num_bytes: 1632270
num_examples: 3000
- name: test
num_bytes: 1642557
num_examples: 3000
download_size: 5297339
dataset_size: 31043670
- config_name: anli_based_on_the_previous_passage_r2
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 26981220
num_examples: 45460
- name: validation
num_bytes: 593000
num_examples: 1000
- name: test
num_bytes: 594249
num_examples: 1000
download_size: 6899624
dataset_size: 28168469
- config_name: anli_based_on_the_previous_passage_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 73278732
num_examples: 136380
- name: validation
num_bytes: 1620372
num_examples: 3000
- name: test
num_bytes: 1624119
num_examples: 3000
download_size: 11365938
dataset_size: 76523223
- config_name: anli_based_on_the_previous_passage_r3
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 58306464
num_examples: 100459
- name: validation
num_bytes: 702240
num_examples: 1200
- name: test
num_bytes: 701887
num_examples: 1200
download_size: 14536507
dataset_size: 59710591
- config_name: anli_based_on_the_previous_passage_r3_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 158428558
num_examples: 301377
- name: validation
num_bytes: 1916108
num_examples: 3600
- name: test
num_bytes: 1915049
num_examples: 3600
download_size: 24238355
dataset_size: 162259715
- config_name: anli_can_we_infer_r1
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9660251
num_examples: 16946
- name: validation
num_bytes: 567095
num_examples: 1000
- name: test
num_bytes: 569234
num_examples: 1000
download_size: 3540836
dataset_size: 10796580
- config_name: anli_can_we_infer_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 26188002
num_examples: 50838
- name: validation
num_bytes: 1542657
num_examples: 3000
- name: test
num_bytes: 1549074
num_examples: 3000
download_size: 5185988
dataset_size: 29279733
- config_name: anli_can_we_infer_r2
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 25702496
num_examples: 45460
- name: validation
num_bytes: 564027
num_examples: 1000
- name: test
num_bytes: 566605
num_examples: 1000
download_size: 6839728
dataset_size: 26833128
- config_name: anli_can_we_infer_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 69442411
num_examples: 136380
- name: validation
num_bytes: 1533453
num_examples: 3000
- name: test
num_bytes: 1541187
num_examples: 3000
download_size: 11133446
dataset_size: 72517051
- config_name: anli_can_we_infer_r3
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 55711582
num_examples: 100459
- name: validation
num_bytes: 671622
num_examples: 1200
- name: test
num_bytes: 671041
num_examples: 1200
download_size: 14400590
dataset_size: 57054245
- config_name: anli_can_we_infer_r3_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 150644154
num_examples: 301377
- name: validation
num_bytes: 1824254
num_examples: 3600
- name: test
num_bytes: 1822511
num_examples: 3600
download_size: 23718191
dataset_size: 154290919
- config_name: anli_claim_true_false_inconclusive_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 28009371
num_examples: 50838
- name: validation
num_bytes: 1645608
num_examples: 3000
- name: test
num_bytes: 1656036
num_examples: 3000
download_size: 5309202
dataset_size: 31311015
- config_name: anli_claim_true_false_inconclusive_r2
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 27303933
num_examples: 45460
- name: validation
num_bytes: 603705
num_examples: 1000
- name: test
num_bytes: 605108
num_examples: 1000
download_size: 6909550
dataset_size: 28512746
- config_name: anli_claim_true_false_inconclusive_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 73958163
num_examples: 136380
- name: validation
num_bytes: 1634481
num_examples: 3000
- name: test
num_bytes: 1638690
num_examples: 3000
download_size: 11378115
dataset_size: 77231334
- config_name: anli_claim_true_false_inconclusive_r3
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 59171538
num_examples: 100459
- name: validation
num_bytes: 715035
num_examples: 1200
- name: test
num_bytes: 714555
num_examples: 1200
download_size: 14576610
dataset_size: 60601128
- config_name: anli_claim_true_false_inconclusive_r3_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 159869656
num_examples: 301377
- name: validation
num_bytes: 1933163
num_examples: 3600
- name: test
num_bytes: 1931723
num_examples: 3600
download_size: 24278176
dataset_size: 163734542
- config_name: anli_consider_always_sometimes_never_r1
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 11760024
num_examples: 16946
- name: validation
num_bytes: 689682
num_examples: 1000
- name: test
num_bytes: 693346
num_examples: 1000
download_size: 3573913
dataset_size: 13143052
- config_name: anli_consider_always_sometimes_never_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 29407373
num_examples: 50838
- name: validation
num_bytes: 1727533
num_examples: 3000
- name: test
num_bytes: 1738450
num_examples: 3000
download_size: 5322654
dataset_size: 32873356
- config_name: anli_consider_always_sometimes_never_r2
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 31166765
num_examples: 45460
- name: validation
num_bytes: 686050
num_examples: 1000
- name: test
num_bytes: 687415
num_examples: 1000
download_size: 6920348
dataset_size: 32540230
- config_name: anli_consider_always_sometimes_never_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 77636441
num_examples: 136380
- name: validation
num_bytes: 1716562
num_examples: 3000
- name: test
num_bytes: 1720657
num_examples: 3000
download_size: 11455538
dataset_size: 81073660
- config_name: anli_consider_always_sometimes_never_r3
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 67603449
num_examples: 100459
- name: validation
num_bytes: 813779
num_examples: 1200
- name: test
num_bytes: 813586
num_examples: 1200
download_size: 14579500
dataset_size: 69230814
- config_name: anli_does_it_follow_that_r1
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9519548
num_examples: 16946
- name: validation
num_bytes: 558881
num_examples: 1000
- name: test
num_bytes: 559906
num_examples: 1000
download_size: 3515973
dataset_size: 10638335
- config_name: anli_does_it_follow_that_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 25766034
num_examples: 50838
- name: validation
num_bytes: 1518015
num_examples: 3000
- name: test
num_bytes: 1521090
num_examples: 3000
download_size: 5128975
dataset_size: 28805139
- config_name: anli_does_it_follow_that_r2
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 25316900
num_examples: 45460
- name: validation
num_bytes: 555526
num_examples: 1000
- name: test
num_bytes: 557906
num_examples: 1000
download_size: 6775880
dataset_size: 26430332
- config_name: anli_does_it_follow_that_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 68285560
num_examples: 136380
- name: validation
num_bytes: 1507950
num_examples: 3000
- name: test
num_bytes: 1515090
num_examples: 3000
download_size: 10995030
dataset_size: 71308600
- config_name: anli_does_it_follow_that_r3
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 55098606
num_examples: 100459
- name: validation
num_bytes: 663250
num_examples: 1200
- name: test
num_bytes: 663142
num_examples: 1200
download_size: 14338970
dataset_size: 56424998
- config_name: anli_does_it_follow_that_r3_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 148805338
num_examples: 301377
- name: validation
num_bytes: 1799138
num_examples: 3600
- name: test
num_bytes: 1798814
num_examples: 3600
download_size: 23554543
dataset_size: 152403290
- config_name: anli_does_this_imply_r1
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9655086
num_examples: 16946
- name: validation
num_bytes: 565658
num_examples: 1000
- name: test
num_bytes: 568828
num_examples: 1000
download_size: 3492464
dataset_size: 10789572
- config_name: anli_does_this_imply_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 26172648
num_examples: 50838
- name: validation
num_bytes: 1538346
num_examples: 3000
- name: test
num_bytes: 1547856
num_examples: 3000
download_size: 5134759
dataset_size: 29258850
- config_name: anli_does_this_imply_r2
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 25536784
num_examples: 45460
- name: validation
num_bytes: 560912
num_examples: 1000
- name: test
num_bytes: 563115
num_examples: 1000
download_size: 6697367
dataset_size: 26660811
- config_name: anli_does_this_imply_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 68945440
num_examples: 136380
- name: validation
num_bytes: 1524108
num_examples: 3000
- name: test
num_bytes: 1530717
num_examples: 3000
download_size: 10938678
dataset_size: 72000265
- config_name: anli_does_this_imply_r3
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 55037282
num_examples: 100459
- name: validation
num_bytes: 663232
num_examples: 1200
- name: test
num_bytes: 663019
num_examples: 1200
download_size: 14125310
dataset_size: 56363533
- config_name: anli_does_this_imply_r3_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 148621016
num_examples: 301377
- name: validation
num_bytes: 1799084
num_examples: 3600
- name: test
num_bytes: 1798445
num_examples: 3600
download_size: 23265299
dataset_size: 152218545
- config_name: anli_guaranteed_possible_impossible_r1
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 10167319
num_examples: 16946
- name: validation
num_bytes: 597064
num_examples: 1000
- name: test
num_bytes: 599508
num_examples: 1000
download_size: 3575441
dataset_size: 11363891
- config_name: anli_guaranteed_possible_impossible_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 26962553
num_examples: 50838
- name: validation
num_bytes: 1584541
num_examples: 3000
- name: test
num_bytes: 1591873
num_examples: 3000
download_size: 5250394
dataset_size: 30138967
- config_name: anli_guaranteed_possible_impossible_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 71603447
num_examples: 136380
- name: validation
num_bytes: 1579576
num_examples: 3000
- name: test
num_bytes: 1588447
num_examples: 3000
download_size: 11266213
dataset_size: 74771470
- config_name: anli_guaranteed_possible_impossible_r3
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 59431742
num_examples: 100459
- name: validation
num_bytes: 718066
num_examples: 1200
- name: test
num_bytes: 716235
num_examples: 1200
download_size: 14660957
dataset_size: 60866043
- config_name: anli_guaranteed_possible_impossible_r3_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 157322530
num_examples: 301377
- name: validation
num_bytes: 1906052
num_examples: 3600
- name: test
num_bytes: 1900559
num_examples: 3600
download_size: 24211019
dataset_size: 161129141
- config_name: anli_guaranteed_true_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 26303334
num_examples: 50838
- name: validation
num_bytes: 1549782
num_examples: 3000
- name: test
num_bytes: 1554123
num_examples: 3000
download_size: 5189596
dataset_size: 29407239
- config_name: anli_guaranteed_true_r2
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 25800134
num_examples: 45460
- name: validation
num_bytes: 566492
num_examples: 1000
- name: test
num_bytes: 568940
num_examples: 1000
download_size: 6816917
dataset_size: 26935566
- config_name: anli_guaranteed_true_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 69735655
num_examples: 136380
- name: validation
num_bytes: 1540848
num_examples: 3000
- name: test
num_bytes: 1548192
num_examples: 3000
download_size: 11124774
dataset_size: 72824695
- config_name: anli_guaranteed_true_r3
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 56032445
num_examples: 100459
- name: validation
num_bytes: 675708
num_examples: 1200
- name: test
num_bytes: 675096
num_examples: 1200
download_size: 14434826
dataset_size: 57383249
- config_name: anli_guaranteed_true_r3_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 151606885
num_examples: 301377
- name: validation
num_bytes: 1836512
num_examples: 3600
- name: test
num_bytes: 1834676
num_examples: 3600
download_size: 23818570
dataset_size: 155278073
- config_name: anli_justified_in_saying_r1
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9616558
num_examples: 16946
- name: validation
num_bytes: 563304
num_examples: 1000
- name: test
num_bytes: 566515
num_examples: 1000
download_size: 3521817
dataset_size: 10746377
- config_name: anli_justified_in_saying_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 26057064
num_examples: 50838
- name: validation
num_bytes: 1531284
num_examples: 3000
- name: test
num_bytes: 1540917
num_examples: 3000
download_size: 5139740
dataset_size: 29129265
- config_name: anli_justified_in_saying_r2
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 25437357
num_examples: 45460
- name: validation
num_bytes: 558810
num_examples: 1000
- name: test
num_bytes: 560649
num_examples: 1000
download_size: 6730996
dataset_size: 26556816
- config_name: anli_justified_in_saying_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 68647159
num_examples: 136380
- name: validation
num_bytes: 1517802
num_examples: 3000
- name: test
num_bytes: 1523319
num_examples: 3000
download_size: 10951560
dataset_size: 71688280
- config_name: anli_justified_in_saying_r3
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 54839356
num_examples: 100459
- name: validation
num_bytes: 661135
num_examples: 1200
- name: test
num_bytes: 660452
num_examples: 1200
download_size: 14173959
dataset_size: 56160943
- config_name: anli_justified_in_saying_r3_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 148027576
num_examples: 301377
- name: validation
num_bytes: 1792793
num_examples: 3600
- name: test
num_bytes: 1790744
num_examples: 3600
download_size: 23294761
dataset_size: 151611113
- config_name: anli_must_be_true_r1
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9670808
num_examples: 16946
- name: validation
num_bytes: 567825
num_examples: 1000
- name: test
num_bytes: 569003
num_examples: 1000
download_size: 3529205
dataset_size: 10807636
- config_name: anli_must_be_true_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 26219814
num_examples: 50838
- name: validation
num_bytes: 1544847
num_examples: 3000
- name: test
num_bytes: 1548381
num_examples: 3000
download_size: 5178314
dataset_size: 29313042
- config_name: anli_must_be_true_r2
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 25724048
num_examples: 45460
- name: validation
num_bytes: 564549
num_examples: 1000
- name: test
num_bytes: 567086
num_examples: 1000
download_size: 6789593
dataset_size: 26855683
- config_name: anli_must_be_true_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 69507232
num_examples: 136380
- name: validation
num_bytes: 1535019
num_examples: 3000
- name: test
num_bytes: 1542630
num_examples: 3000
download_size: 11090449
dataset_size: 72584881
- config_name: anli_must_be_true_r3_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 151384159
num_examples: 301377
- name: validation
num_bytes: 1831217
num_examples: 3600
- name: test
num_bytes: 1830050
num_examples: 3600
download_size: 23694968
dataset_size: 155045426
- config_name: anli_should_assume_r1
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 9808224
num_examples: 16946
- name: validation
num_bytes: 576245
num_examples: 1000
- name: test
num_bytes: 577674
num_examples: 1000
download_size: 3557170
dataset_size: 10962143
- config_name: anli_should_assume_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 26632062
num_examples: 50838
- name: validation
num_bytes: 1570107
num_examples: 3000
- name: test
num_bytes: 1574394
num_examples: 3000
download_size: 5227051
dataset_size: 29776563
- config_name: anli_should_assume_r2
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 26138650
num_examples: 45460
- name: validation
num_bytes: 573744
num_examples: 1000
- name: test
num_bytes: 576288
num_examples: 1000
download_size: 6843519
dataset_size: 27288682
- config_name: anli_should_assume_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 70751038
num_examples: 136380
- name: validation
num_bytes: 1562604
num_examples: 3000
- name: test
num_bytes: 1570236
num_examples: 3000
download_size: 11203140
dataset_size: 73883878
- config_name: anli_should_assume_r3
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 56811303
num_examples: 100459
- name: validation
num_bytes: 684578
num_examples: 1200
- name: test
num_bytes: 684389
num_examples: 1200
download_size: 14457250
dataset_size: 58180270
- config_name: anli_should_assume_r3_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 153943435
num_examples: 301377
- name: validation
num_bytes: 1863122
num_examples: 3600
- name: test
num_bytes: 1862555
num_examples: 3600
download_size: 23996809
dataset_size: 157669112
- config_name: anli_take_the_following_as_truth_r1
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 10155922
num_examples: 16946
- name: validation
num_bytes: 596175
num_examples: 1000
- name: test
num_bytes: 602483
num_examples: 1000
download_size: 3647827
dataset_size: 11354580
- config_name: anli_take_the_following_as_truth_r1_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 27494019
num_examples: 50838
- name: validation
num_bytes: 1611891
num_examples: 3000
- name: test
num_bytes: 1630815
num_examples: 3000
download_size: 5358731
dataset_size: 30736725
- config_name: anli_take_the_following_as_truth_r2
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 26847342
num_examples: 45460
- name: validation
num_bytes: 593274
num_examples: 1000
- name: test
num_bytes: 595952
num_examples: 1000
download_size: 7011166
dataset_size: 28036568
- config_name: anli_take_the_following_as_truth_r2_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 72588670
num_examples: 136380
- name: validation
num_bytes: 1603188
num_examples: 3000
- name: test
num_bytes: 1611222
num_examples: 3000
download_size: 11470246
dataset_size: 75803080
- config_name: anli_take_the_following_as_truth_r3
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 58926387
num_examples: 100459
- name: validation
num_bytes: 712653
num_examples: 1200
- name: test
num_bytes: 710980
num_examples: 1200
download_size: 14817660
dataset_size: 60350020
- config_name: anli_take_the_following_as_truth_r3_score_eval
features:
- name: idx
list: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 159133840
num_examples: 301377
- name: validation
num_bytes: 1926017
num_examples: 3600
- name: test
num_bytes: 1920998
num_examples: 3600
download_size: 24605953
dataset_size: 162980855
- config_name: app_reviews_categorize_rating_using_review
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 84161782
num_examples: 288065
download_size: 16063169
dataset_size: 84161782
- config_name: app_reviews_convert_to_rating
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 56636258
num_examples: 288065
download_size: 15450009
dataset_size: 56636258
- config_name: app_reviews_convert_to_star_rating
features:
- name: answer_choices
list: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 82142267
num_examples: 288065
download_size: 15479328
dataset_size: 82142267
- config_name: app_reviews_generate_review
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 56378272
num_examples: 288065
download_size: 13190483
dataset_size: 56378272
- config_name: cnn_dailymail_3.0.0_generate_story
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 720471112
num_examples: 287113
- name: validation
num_bytes: 33618761
num_examples: 13368
- name: test
num_bytes: 28745061
num_examples: 11490
download_size: 494183488
dataset_size: 782834934
- config_name: cnn_dailymail_3.0.0_news_card_view
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 732243635
num_examples: 287113
- name: validation
num_bytes: 34166818
num_examples: 13368
- name: test
num_bytes: 29216132
num_examples: 11490
download_size: 497253563
dataset_size: 795626585
- config_name: cnn_dailymail_3.0.0_news_stock
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 730808072
num_examples: 287113
- name: validation
num_bytes: 34099975
num_examples: 13368
- name: test
num_bytes: 29158682
num_examples: 11490
download_size: 496939280
dataset_size: 794066729
- config_name: cnn_dailymail_3.0.0_spice_up_story
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 731668204
num_examples: 287113
- name: validation
num_bytes: 34140304
num_examples: 13368
- name: test
num_bytes: 29193153
num_examples: 11490
download_size: 495827285
dataset_size: 795001661
- config_name: cnn_dailymail_3.0.0_sum_in_brief
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 713359413
num_examples: 287113
- name: validation
num_bytes: 33281342
num_examples: 13368
- name: test
num_bytes: 28452485
num_examples: 11490
download_size: 495236620
dataset_size: 775093240
- config_name: wiki_hop_original_generate_subject_and_object
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 324405773
num_examples: 43738
- name: validation
num_bytes: 40667716
num_examples: 5129
download_size: 214339064
dataset_size: 365073489
- config_name: wiki_qa_Decide_good_answer
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 6709566
num_examples: 20360
- name: validation
num_bytes: 892236
num_examples: 2733
- name: test
num_bytes: 2011550
num_examples: 6165
download_size: 3332585
dataset_size: 9613352
- config_name: wiki_qa_Direct_Answer_to_Question
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 247130
num_examples: 1040
- name: validation
num_bytes: 33007
num_examples: 140
- name: test
num_bytes: 69123
num_examples: 293
download_size: 223105
dataset_size: 349260
- config_name: wiki_qa_Generate_Question_from_Topic
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 288904
num_examples: 1040
- name: validation
num_bytes: 39404
num_examples: 140
- name: test
num_bytes: 78870
num_examples: 293
download_size: 239887
dataset_size: 407178
- config_name: wiki_qa_Is_This_True_
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 5530835
num_examples: 20360
- name: validation
num_bytes: 732086
num_examples: 2733
- name: test
num_bytes: 1659667
num_examples: 6165
download_size: 3174774
dataset_size: 7922588
- config_name: wiki_qa_Jeopardy_style
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 273177
num_examples: 1040
- name: validation
num_bytes: 37569
num_examples: 140
- name: test
num_bytes: 74673
num_examples: 293
download_size: 237245
dataset_size: 385419
- config_name: wiki_qa_Topic_Prediction_Answer_Only
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 241330
num_examples: 1040
- name: validation
num_bytes: 31552
num_examples: 140
- name: test
num_bytes: 64548
num_examples: 293
download_size: 210145
dataset_size: 337430
- config_name: wiki_qa_Topic_Prediction_Question_Only
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 115109
num_examples: 1040
- name: validation
num_bytes: 14809
num_examples: 140
- name: test
num_bytes: 30624
num_examples: 293
download_size: 65785
dataset_size: 160542
- config_name: wiki_qa_Topic_Prediction_Question_and_Answer_Pair
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 315922
num_examples: 1040
- name: validation
num_bytes: 41981
num_examples: 140
- name: test
num_bytes: 85690
num_examples: 293
download_size: 243365
dataset_size: 443593
- config_name: wiki_qa_automatic_system
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 7509189
num_examples: 20360
- name: validation
num_bytes: 999229
num_examples: 2733
- name: test
num_bytes: 2259517
num_examples: 6165
download_size: 3413064
dataset_size: 10767935
- config_name: wiki_qa_exercise
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 8969100
num_examples: 20360
- name: validation
num_bytes: 1194909
num_examples: 2733
- name: test
num_bytes: 2706993
num_examples: 6165
download_size: 3473289
dataset_size: 12871002
- config_name: wiki_qa_found_on_google
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 6406361
num_examples: 20360
- name: validation
num_bytes: 851514
num_examples: 2733
- name: test
num_bytes: 1927237
num_examples: 6165
download_size: 3286517
dataset_size: 9185112
- config_name: winogrande_winogrande_debiased_Replace
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 2494802
num_examples: 9248
- name: validation
num_bytes: 318674
num_examples: 1267
- name: test
num_bytes: 474866
num_examples: 1767
download_size: 1133981
dataset_size: 3288342
- config_name: winogrande_winogrande_debiased_Replace_score_eval
features:
- name: idx
sequence: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 4218647
num_examples: 18496
- name: validation
num_bytes: 562090
num_examples: 2534
- name: test
num_bytes: 800721
num_examples: 3534
download_size: 1223476
dataset_size: 5581458
- config_name: winogrande_winogrande_debiased_does_underscore_refer_to
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 2378512
num_examples: 9248
- name: validation
num_bytes: 302729
num_examples: 1267
- name: test
num_bytes: 452544
num_examples: 1767
download_size: 1125118
dataset_size: 3133785
- config_name: winogrande_winogrande_debiased_does_underscore_refer_to_score_eval
features:
- name: idx
sequence: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 3986067
num_examples: 18496
- name: validation
num_bytes: 530200
num_examples: 2534
- name: test
num_bytes: 756077
num_examples: 3534
download_size: 1212731
dataset_size: 5272344
- config_name: winogrande_winogrande_debiased_fill_in_the_blank
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 2513297
num_examples: 9248
- name: validation
num_bytes: 321208
num_examples: 1267
- name: test
num_bytes: 478398
num_examples: 1767
download_size: 1149729
dataset_size: 3312903
- config_name: winogrande_winogrande_debiased_fill_in_the_blank_score_eval
features:
- name: idx
sequence: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 4255637
num_examples: 18496
- name: validation
num_bytes: 567158
num_examples: 2534
- name: test
num_bytes: 807785
num_examples: 3534
download_size: 1240495
dataset_size: 5630580
- config_name: winogrande_winogrande_debiased_stand_for
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 2331302
num_examples: 9248
- name: validation
num_bytes: 296295
num_examples: 1267
- name: test
num_bytes: 443594
num_examples: 1767
download_size: 1132226
dataset_size: 3071191
- config_name: winogrande_winogrande_debiased_stand_for_score_eval
features:
- name: idx
sequence: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 3891647
num_examples: 18496
- name: validation
num_bytes: 517332
num_examples: 2534
- name: test
num_bytes: 738177
num_examples: 3534
download_size: 1218916
dataset_size: 5147156
- config_name: winogrande_winogrande_debiased_underscore_refer_to
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 2362950
num_examples: 9248
- name: validation
num_bytes: 300567
num_examples: 1267
- name: test
num_bytes: 449241
num_examples: 1767
download_size: 1141040
dataset_size: 3112758
- config_name: winogrande_winogrande_debiased_underscore_refer_to_score_eval
features:
- name: idx
sequence: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 3954943
num_examples: 18496
- name: validation
num_bytes: 525876
num_examples: 2534
- name: test
num_bytes: 749471
num_examples: 3534
download_size: 1228522
dataset_size: 5230290
- config_name: winogrande_winogrande_xl_Replace
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 10741385
num_examples: 40398
- name: validation
num_bytes: 318674
num_examples: 1267
- name: test
num_bytes: 474866
num_examples: 1767
download_size: 3228045
dataset_size: 11534925
- config_name: winogrande_winogrande_xl_Replace_score_eval
features:
- name: idx
sequence: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 18186622
num_examples: 80796
- name: validation
num_bytes: 562090
num_examples: 2534
- name: test
num_bytes: 800721
num_examples: 3534
download_size: 3525012
dataset_size: 19549433
- config_name: winogrande_winogrande_xl_does_underscore_refer_to
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 10233503
num_examples: 40398
- name: validation
num_bytes: 302729
num_examples: 1267
- name: test
num_bytes: 452544
num_examples: 1767
download_size: 3202869
dataset_size: 10988776
- config_name: winogrande_winogrande_xl_does_underscore_refer_to_score_eval
features:
- name: idx
sequence: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 17170858
num_examples: 80796
- name: validation
num_bytes: 530200
num_examples: 2534
- name: test
num_bytes: 756077
num_examples: 3534
download_size: 3495469
dataset_size: 18457135
- config_name: winogrande_winogrande_xl_fill_in_the_blank
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 10822162
num_examples: 40398
- name: validation
num_bytes: 321208
num_examples: 1267
- name: test
num_bytes: 478398
num_examples: 1767
download_size: 3251155
dataset_size: 11621768
- config_name: winogrande_winogrande_xl_fill_in_the_blank_score_eval
features:
- name: idx
sequence: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 18348176
num_examples: 80796
- name: validation
num_bytes: 567158
num_examples: 2534
- name: test
num_bytes: 807785
num_examples: 3534
download_size: 3559359
dataset_size: 19723119
- config_name: winogrande_winogrande_xl_stand_for
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 10027577
num_examples: 40398
- name: validation
num_bytes: 296295
num_examples: 1267
- name: test
num_bytes: 443594
num_examples: 1767
download_size: 3199335
dataset_size: 10767466
- config_name: winogrande_winogrande_xl_stand_for_score_eval
features:
- name: idx
sequence: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 16759006
num_examples: 80796
- name: validation
num_bytes: 517332
num_examples: 2534
- name: test
num_bytes: 738177
num_examples: 3534
download_size: 3490708
dataset_size: 18014515
- config_name: winogrande_winogrande_xl_underscore_refer_to
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 10164596
num_examples: 40398
- name: validation
num_bytes: 300567
num_examples: 1267
- name: test
num_bytes: 449241
num_examples: 1767
download_size: 3238319
dataset_size: 10914404
- config_name: winogrande_winogrande_xl_underscore_refer_to_score_eval
features:
- name: idx
sequence: int32
- name: inputs_pretokenized
dtype: string
- name: is_correct
dtype: bool
- name: targets_pretokenized
dtype: string
- name: weight
dtype: float32
splits:
- name: train
num_bytes: 17033044
num_examples: 80796
- name: validation
num_bytes: 525876
num_examples: 2534
- name: test
num_bytes: 749471
num_examples: 3534
download_size: 3535036
dataset_size: 18308391
- config_name: wiqa_does_the_supposed_perturbation_have_an_effect
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 16606837
num_examples: 29808
- name: validation
num_bytes: 3646373
num_examples: 6894
- name: test
num_bytes: 1453319
num_examples: 3003
download_size: 7860625
dataset_size: 21706529
- config_name: wiqa_effect_with_label_answer
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 15276174
num_examples: 29808
- name: validation
num_bytes: 3338021
num_examples: 6894
- name: test
num_bytes: 1321769
num_examples: 3003
download_size: 7596498
dataset_size: 19935964
- config_name: wiqa_effect_with_string_answer
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 17442183
num_examples: 29808
- name: validation
num_bytes: 3838951
num_examples: 6894
- name: test
num_bytes: 1538114
num_examples: 3003
download_size: 7965517
dataset_size: 22819248
- config_name: wiqa_what_is_the_final_step_of_the_following_process
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 11055108
num_examples: 29808
- name: validation
num_bytes: 2393488
num_examples: 6894
- name: test
num_bytes: 919963
num_examples: 3003
download_size: 1798047
dataset_size: 14368559
- config_name: wiqa_what_is_the_missing_first_step
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 11524119
num_examples: 29808
- name: validation
num_bytes: 2497447
num_examples: 6894
- name: test
num_bytes: 965820
num_examples: 3003
download_size: 1803559
dataset_size: 14987386
- config_name: wiqa_what_might_be_the_first_step_of_the_process
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 11315905
num_examples: 29808
- name: validation
num_bytes: 2449390
num_examples: 6894
- name: test
num_bytes: 944799
num_examples: 3003
download_size: 1804439
dataset_size: 14710094
- config_name: wiqa_what_might_be_the_last_step_of_the_process
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 11144532
num_examples: 29808
- name: validation
num_bytes: 2414170
num_examples: 6894
- name: test
num_bytes: 928972
num_examples: 3003
download_size: 1814891
dataset_size: 14487674
- config_name: wiqa_which_of_the_following_is_the_supposed_perturbation
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 18957678
num_examples: 29808
- name: validation
num_bytes: 4189565
num_examples: 6894
- name: test
num_bytes: 1693819
num_examples: 3003
download_size: 8179468
dataset_size: 24841062
- config_name: xsum_DOC_boils_down_to_simple_idea_that
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 371586389
num_examples: 204045
- name: validation
num_bytes: 20594956
num_examples: 11332
- name: test
num_bytes: 20687186
num_examples: 11334
download_size: 269300769
dataset_size: 412868531
- config_name: xsum_DOC_given_above_write_one_sentence
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 379138897
num_examples: 204045
- name: validation
num_bytes: 21014340
num_examples: 11332
- name: test
num_bytes: 21106399
num_examples: 11334
download_size: 270338713
dataset_size: 421259636
- config_name: xsum_DOC_how_would_you_rephrase_few_words
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 369954546
num_examples: 204045
- name: validation
num_bytes: 20504409
num_examples: 11332
- name: test
num_bytes: 20596440
num_examples: 11334
download_size: 269019804
dataset_size: 411055395
- config_name: xsum_DOC_tldr
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 362915209
num_examples: 204045
- name: validation
num_bytes: 20113639
num_examples: 11332
- name: test
num_bytes: 20205635
num_examples: 11334
download_size: 268508779
dataset_size: 403234483
- config_name: xsum_DOC_write_summary_of_above
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 371792476
num_examples: 204045
- name: validation
num_bytes: 20606407
num_examples: 11332
- name: test
num_bytes: 20698539
num_examples: 11334
download_size: 269049793
dataset_size: 413097422
- config_name: xsum_article_DOC_summary
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 365810922
num_examples: 204045
- name: validation
num_bytes: 20275221
num_examples: 11332
- name: test
num_bytes: 20365758
num_examples: 11334
download_size: 268441502
dataset_size: 406451901
- config_name: xsum_college_roommate_asked_DOC_so_I_recap
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 384238846
num_examples: 204045
- name: validation
num_bytes: 21297509
num_examples: 11332
- name: test
num_bytes: 21389915
num_examples: 11334
download_size: 271215076
dataset_size: 426926270
- config_name: xsum_read_below_DOC_write_abstract
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 380158330
num_examples: 204045
- name: validation
num_bytes: 21070992
num_examples: 11332
- name: test
num_bytes: 21163190
num_examples: 11334
download_size: 270293998
dataset_size: 422392512
- config_name: xsum_summarize_DOC
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 363565939
num_examples: 204045
- name: validation
num_bytes: 20150985
num_examples: 11332
- name: test
num_bytes: 20242518
num_examples: 11334
download_size: 268400446
dataset_size: 403959442
- config_name: xsum_summarize_this_DOC_summary
features:
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 368965786
num_examples: 204045
- name: validation
num_bytes: 20450887
num_examples: 11332
- name: test
num_bytes: 20542475
num_examples: 11334
download_size: 269385999
dataset_size: 409959148
- config_name: yelp_review_full_based_on_that
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 580583043
num_examples: 650000
- name: test
num_bytes: 44715436
num_examples: 50000
download_size: 340857277
dataset_size: 625298479
- config_name: yelp_review_full_format_rating
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 583826688
num_examples: 650000
- name: test
num_bytes: 44964700
num_examples: 50000
download_size: 341987921
dataset_size: 628791388
- config_name: yelp_review_full_format_score
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 576090819
num_examples: 650000
- name: test
num_bytes: 44367990
num_examples: 50000
download_size: 342372356
dataset_size: 620458809
- config_name: yelp_review_full_format_star
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 572776745
num_examples: 650000
- name: test
num_bytes: 44115095
num_examples: 50000
download_size: 340848272
dataset_size: 616891840
- config_name: yelp_review_full_on_a_scale
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 573518291
num_examples: 650000
- name: test
num_bytes: 44166441
num_examples: 50000
download_size: 342749800
dataset_size: 617684732
- config_name: yelp_review_full_so_i_would
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 573422606
num_examples: 650000
- name: test
num_bytes: 44166265
num_examples: 50000
download_size: 340243303
dataset_size: 617588871
- config_name: yelp_review_full_this_place
features:
- name: answer_choices
sequence: string
- name: inputs_pretokenized
dtype: string
- name: targets_pretokenized
dtype: string
splits:
- name: train
num_bytes: 572825339
num_examples: 650000
- name: test
num_bytes: 44118931
num_examples: 50000
download_size: 341483353
dataset_size: 616944270
configs:
- config_name: adversarial_qa_dbert_answer_the_following_q
data_files:
- split: train
path: adversarial_qa_dbert_answer_the_following_q/train-*
- split: validation
path: adversarial_qa_dbert_answer_the_following_q/validation-*
- config_name: adversarial_qa_dbert_based_on
data_files:
- split: train
path: adversarial_qa_dbert_based_on/train-*
- split: validation
path: adversarial_qa_dbert_based_on/validation-*
- config_name: adversarial_qa_dbert_generate_question
data_files:
- split: train
path: adversarial_qa_dbert_generate_question/train-*
- split: validation
path: adversarial_qa_dbert_generate_question/validation-*
- split: test
path: adversarial_qa_dbert_generate_question/test-*
- config_name: adversarial_qa_dbert_question_context_answer
data_files:
- split: train
path: adversarial_qa_dbert_question_context_answer/train-*
- split: validation
path: adversarial_qa_dbert_question_context_answer/validation-*
- config_name: adversarial_qa_dbert_tell_what_it_is
data_files:
- split: train
path: adversarial_qa_dbert_tell_what_it_is/train-*
- split: validation
path: adversarial_qa_dbert_tell_what_it_is/validation-*
- config_name: adversarial_qa_dbidaf_answer_the_following_q
data_files:
- split: train
path: adversarial_qa_dbidaf_answer_the_following_q/train-*
- split: validation
path: adversarial_qa_dbidaf_answer_the_following_q/validation-*
- config_name: adversarial_qa_dbidaf_based_on
data_files:
- split: train
path: adversarial_qa_dbidaf_based_on/train-*
- split: validation
path: adversarial_qa_dbidaf_based_on/validation-*
- config_name: adversarial_qa_dbidaf_generate_question
data_files:
- split: train
path: adversarial_qa_dbidaf_generate_question/train-*
- split: validation
path: adversarial_qa_dbidaf_generate_question/validation-*
- split: test
path: adversarial_qa_dbidaf_generate_question/test-*
- config_name: adversarial_qa_dbidaf_question_context_answer
data_files:
- split: train
path: adversarial_qa_dbidaf_question_context_answer/train-*
- split: validation
path: adversarial_qa_dbidaf_question_context_answer/validation-*
- config_name: adversarial_qa_dbidaf_tell_what_it_is
data_files:
- split: train
path: adversarial_qa_dbidaf_tell_what_it_is/train-*
- split: validation
path: adversarial_qa_dbidaf_tell_what_it_is/validation-*
- config_name: adversarial_qa_droberta_answer_the_following_q
data_files:
- split: train
path: adversarial_qa_droberta_answer_the_following_q/train-*
- split: validation
path: adversarial_qa_droberta_answer_the_following_q/validation-*
- config_name: adversarial_qa_droberta_based_on
data_files:
- split: train
path: adversarial_qa_droberta_based_on/train-*
- split: validation
path: adversarial_qa_droberta_based_on/validation-*
- config_name: adversarial_qa_droberta_generate_question
data_files:
- split: train
path: adversarial_qa_droberta_generate_question/train-*
- split: validation
path: adversarial_qa_droberta_generate_question/validation-*
- split: test
path: adversarial_qa_droberta_generate_question/test-*
- config_name: adversarial_qa_droberta_question_context_answer
data_files:
- split: train
path: adversarial_qa_droberta_question_context_answer/train-*
- split: validation
path: adversarial_qa_droberta_question_context_answer/validation-*
- config_name: adversarial_qa_droberta_tell_what_it_is
data_files:
- split: train
path: adversarial_qa_droberta_tell_what_it_is/train-*
- split: validation
path: adversarial_qa_droberta_tell_what_it_is/validation-*
- config_name: ag_news_classify
data_files:
- split: train
path: ag_news_classify/train-*
- split: test
path: ag_news_classify/test-*
- config_name: ag_news_classify_question_first
data_files:
- split: train
path: ag_news_classify_question_first/train-*
- split: test
path: ag_news_classify_question_first/test-*
- config_name: ag_news_classify_with_choices
data_files:
- split: train
path: ag_news_classify_with_choices/train-*
- split: test
path: ag_news_classify_with_choices/test-*
- config_name: ag_news_classify_with_choices_question_first
data_files:
- split: train
path: ag_news_classify_with_choices_question_first/train-*
- split: test
path: ag_news_classify_with_choices_question_first/test-*
- config_name: ag_news_recommend
data_files:
- split: train
path: ag_news_recommend/train-*
- split: test
path: ag_news_recommend/test-*
- config_name: ag_news_which_section
data_files:
- split: train
path: ag_news_which_section/train-*
- split: test
path: ag_news_which_section/test-*
- config_name: ag_news_which_section_choices
data_files:
- split: train
path: ag_news_which_section_choices/train-*
- split: test
path: ag_news_which_section_choices/test-*
- config_name: ai2_arc_ARC_Challenge_heres_a_problem
data_files:
- split: train
path: ai2_arc_ARC_Challenge_heres_a_problem/train-*
- split: validation
path: ai2_arc_ARC_Challenge_heres_a_problem/validation-*
- split: test
path: ai2_arc_ARC_Challenge_heres_a_problem/test-*
- config_name: ai2_arc_ARC_Challenge_i_am_hesitating
data_files:
- split: train
path: ai2_arc_ARC_Challenge_i_am_hesitating/train-*
- split: validation
path: ai2_arc_ARC_Challenge_i_am_hesitating/validation-*
- split: test
path: ai2_arc_ARC_Challenge_i_am_hesitating/test-*
- config_name: ai2_arc_ARC_Challenge_multiple_choice
data_files:
- split: train
path: ai2_arc_ARC_Challenge_multiple_choice/train-*
- split: validation
path: ai2_arc_ARC_Challenge_multiple_choice/validation-*
- split: test
path: ai2_arc_ARC_Challenge_multiple_choice/test-*
- config_name: ai2_arc_ARC_Challenge_pick_false_options
data_files:
- split: train
path: ai2_arc_ARC_Challenge_pick_false_options/train-*
- split: validation
path: ai2_arc_ARC_Challenge_pick_false_options/validation-*
- split: test
path: ai2_arc_ARC_Challenge_pick_false_options/test-*
- config_name: ai2_arc_ARC_Challenge_pick_the_most_correct_option
data_files:
- split: train
path: ai2_arc_ARC_Challenge_pick_the_most_correct_option/train-*
- split: validation
path: ai2_arc_ARC_Challenge_pick_the_most_correct_option/validation-*
- split: test
path: ai2_arc_ARC_Challenge_pick_the_most_correct_option/test-*
- config_name: ai2_arc_ARC_Challenge_qa_options
data_files:
- split: train
path: ai2_arc_ARC_Challenge_qa_options/train-*
- split: validation
path: ai2_arc_ARC_Challenge_qa_options/validation-*
- split: test
path: ai2_arc_ARC_Challenge_qa_options/test-*
- config_name: ai2_arc_ARC_Easy_heres_a_problem
data_files:
- split: train
path: ai2_arc_ARC_Easy_heres_a_problem/train-*
- split: validation
path: ai2_arc_ARC_Easy_heres_a_problem/validation-*
- split: test
path: ai2_arc_ARC_Easy_heres_a_problem/test-*
- config_name: ai2_arc_ARC_Easy_i_am_hesitating
data_files:
- split: train
path: ai2_arc_ARC_Easy_i_am_hesitating/train-*
- split: validation
path: ai2_arc_ARC_Easy_i_am_hesitating/validation-*
- split: test
path: ai2_arc_ARC_Easy_i_am_hesitating/test-*
- config_name: ai2_arc_ARC_Easy_multiple_choice
data_files:
- split: train
path: ai2_arc_ARC_Easy_multiple_choice/train-*
- split: validation
path: ai2_arc_ARC_Easy_multiple_choice/validation-*
- split: test
path: ai2_arc_ARC_Easy_multiple_choice/test-*
- config_name: ai2_arc_ARC_Easy_pick_false_options
data_files:
- split: train
path: ai2_arc_ARC_Easy_pick_false_options/train-*
- split: validation
path: ai2_arc_ARC_Easy_pick_false_options/validation-*
- split: test
path: ai2_arc_ARC_Easy_pick_false_options/test-*
- config_name: ai2_arc_ARC_Easy_pick_the_most_correct_option
data_files:
- split: train
path: ai2_arc_ARC_Easy_pick_the_most_correct_option/train-*
- split: validation
path: ai2_arc_ARC_Easy_pick_the_most_correct_option/validation-*
- split: test
path: ai2_arc_ARC_Easy_pick_the_most_correct_option/test-*
- config_name: ai2_arc_ARC_Easy_qa_options
data_files:
- split: train
path: ai2_arc_ARC_Easy_qa_options/train-*
- split: validation
path: ai2_arc_ARC_Easy_qa_options/validation-*
- split: test
path: ai2_arc_ARC_Easy_qa_options/test-*
- config_name: amazon_polarity_Is_this_product_review_positive
data_files:
- split: train
path: amazon_polarity_Is_this_product_review_positive/train-*
- split: test
path: amazon_polarity_Is_this_product_review_positive/test-*
- config_name: amazon_polarity_Is_this_review
data_files:
- split: train
path: amazon_polarity_Is_this_review/train-*
- split: test
path: amazon_polarity_Is_this_review/test-*
- config_name: amazon_polarity_Is_this_review_negative
data_files:
- split: train
path: amazon_polarity_Is_this_review_negative/train-*
- split: test
path: amazon_polarity_Is_this_review_negative/test-*
- config_name: amazon_polarity_User_recommend_this_product
data_files:
- split: train
path: amazon_polarity_User_recommend_this_product/train-*
- split: test
path: amazon_polarity_User_recommend_this_product/test-*
- config_name: amazon_polarity_convey_negative_or_positive_sentiment
data_files:
- split: train
path: amazon_polarity_convey_negative_or_positive_sentiment/train-*
- split: test
path: amazon_polarity_convey_negative_or_positive_sentiment/test-*
- config_name: amazon_polarity_flattering_or_not
data_files:
- split: train
path: amazon_polarity_flattering_or_not/train-*
- split: test
path: amazon_polarity_flattering_or_not/test-*
- config_name: amazon_polarity_negative_or_positive_tone
data_files:
- split: train
path: amazon_polarity_negative_or_positive_tone/train-*
- split: test
path: amazon_polarity_negative_or_positive_tone/test-*
- config_name: anli_GPT_3_style_r1
data_files:
- split: train
path: anli_GPT_3_style_r1/train-*
- split: validation
path: anli_GPT_3_style_r1/validation-*
- split: test
path: anli_GPT_3_style_r1/test-*
- config_name: anli_GPT_3_style_r1_score_eval
data_files:
- split: train
path: anli_GPT_3_style_r1_score_eval/train-*
- split: validation
path: anli_GPT_3_style_r1_score_eval/validation-*
- split: test
path: anli_GPT_3_style_r1_score_eval/test-*
- config_name: anli_GPT_3_style_r2
data_files:
- split: train
path: anli_GPT_3_style_r2/train-*
- split: validation
path: anli_GPT_3_style_r2/validation-*
- split: test
path: anli_GPT_3_style_r2/test-*
- config_name: anli_GPT_3_style_r2_score_eval
data_files:
- split: train
path: anli_GPT_3_style_r2_score_eval/train-*
- split: validation
path: anli_GPT_3_style_r2_score_eval/validation-*
- split: test
path: anli_GPT_3_style_r2_score_eval/test-*
- config_name: anli_GPT_3_style_r3
data_files:
- split: train
path: anli_GPT_3_style_r3/train-*
- split: validation
path: anli_GPT_3_style_r3/validation-*
- split: test
path: anli_GPT_3_style_r3/test-*
- config_name: anli_GPT_3_style_r3_score_eval
data_files:
- split: train
path: anli_GPT_3_style_r3_score_eval/train-*
- split: validation
path: anli_GPT_3_style_r3_score_eval/validation-*
- split: test
path: anli_GPT_3_style_r3_score_eval/test-*
- config_name: anli_MNLI_crowdsource_r1
data_files:
- split: train
path: anli_MNLI_crowdsource_r1/train-*
- split: validation
path: anli_MNLI_crowdsource_r1/validation-*
- split: test
path: anli_MNLI_crowdsource_r1/test-*
- config_name: anli_MNLI_crowdsource_r1_score_eval
data_files:
- split: train
path: anli_MNLI_crowdsource_r1_score_eval/train-*
- split: validation
path: anli_MNLI_crowdsource_r1_score_eval/validation-*
- split: test
path: anli_MNLI_crowdsource_r1_score_eval/test-*
- config_name: anli_MNLI_crowdsource_r2
data_files:
- split: train
path: anli_MNLI_crowdsource_r2/train-*
- split: validation
path: anli_MNLI_crowdsource_r2/validation-*
- split: test
path: anli_MNLI_crowdsource_r2/test-*
- config_name: anli_MNLI_crowdsource_r2_score_eval
data_files:
- split: train
path: anli_MNLI_crowdsource_r2_score_eval/train-*
- split: validation
path: anli_MNLI_crowdsource_r2_score_eval/validation-*
- split: test
path: anli_MNLI_crowdsource_r2_score_eval/test-*
- config_name: anli_MNLI_crowdsource_r3
data_files:
- split: train
path: anli_MNLI_crowdsource_r3/train-*
- split: validation
path: anli_MNLI_crowdsource_r3/validation-*
- split: test
path: anli_MNLI_crowdsource_r3/test-*
- config_name: anli_MNLI_crowdsource_r3_score_eval
data_files:
- split: train
path: anli_MNLI_crowdsource_r3_score_eval/train-*
- split: validation
path: anli_MNLI_crowdsource_r3_score_eval/validation-*
- split: test
path: anli_MNLI_crowdsource_r3_score_eval/test-*
- config_name: anli_always_sometimes_never_r1
data_files:
- split: train
path: anli_always_sometimes_never_r1/train-*
- split: validation
path: anli_always_sometimes_never_r1/validation-*
- split: test
path: anli_always_sometimes_never_r1/test-*
- config_name: anli_always_sometimes_never_r1_score_eval
data_files:
- split: train
path: anli_always_sometimes_never_r1_score_eval/train-*
- split: validation
path: anli_always_sometimes_never_r1_score_eval/validation-*
- split: test
path: anli_always_sometimes_never_r1_score_eval/test-*
- config_name: anli_always_sometimes_never_r2
data_files:
- split: train
path: anli_always_sometimes_never_r2/train-*
- split: validation
path: anli_always_sometimes_never_r2/validation-*
- split: test
path: anli_always_sometimes_never_r2/test-*
- config_name: anli_always_sometimes_never_r2_score_eval
data_files:
- split: train
path: anli_always_sometimes_never_r2_score_eval/train-*
- split: validation
path: anli_always_sometimes_never_r2_score_eval/validation-*
- split: test
path: anli_always_sometimes_never_r2_score_eval/test-*
- config_name: anli_always_sometimes_never_r3
data_files:
- split: train
path: anli_always_sometimes_never_r3/train-*
- split: validation
path: anli_always_sometimes_never_r3/validation-*
- split: test
path: anli_always_sometimes_never_r3/test-*
- config_name: anli_always_sometimes_never_r3_score_eval
data_files:
- split: train
path: anli_always_sometimes_never_r3_score_eval/train-*
- split: validation
path: anli_always_sometimes_never_r3_score_eval/validation-*
- split: test
path: anli_always_sometimes_never_r3_score_eval/test-*
- config_name: anli_based_on_the_previous_passage_r1
data_files:
- split: train
path: anli_based_on_the_previous_passage_r1/train-*
- split: validation
path: anli_based_on_the_previous_passage_r1/validation-*
- split: test
path: anli_based_on_the_previous_passage_r1/test-*
- config_name: anli_based_on_the_previous_passage_r1_score_eval
data_files:
- split: train
path: anli_based_on_the_previous_passage_r1_score_eval/train-*
- split: validation
path: anli_based_on_the_previous_passage_r1_score_eval/validation-*
- split: test
path: anli_based_on_the_previous_passage_r1_score_eval/test-*
- config_name: anli_based_on_the_previous_passage_r2
data_files:
- split: train
path: anli_based_on_the_previous_passage_r2/train-*
- split: validation
path: anli_based_on_the_previous_passage_r2/validation-*
- split: test
path: anli_based_on_the_previous_passage_r2/test-*
- config_name: anli_based_on_the_previous_passage_r2_score_eval
data_files:
- split: train
path: anli_based_on_the_previous_passage_r2_score_eval/train-*
- split: validation
path: anli_based_on_the_previous_passage_r2_score_eval/validation-*
- split: test
path: anli_based_on_the_previous_passage_r2_score_eval/test-*
- config_name: anli_based_on_the_previous_passage_r3
data_files:
- split: train
path: anli_based_on_the_previous_passage_r3/train-*
- split: validation
path: anli_based_on_the_previous_passage_r3/validation-*
- split: test
path: anli_based_on_the_previous_passage_r3/test-*
- config_name: anli_based_on_the_previous_passage_r3_score_eval
data_files:
- split: train
path: anli_based_on_the_previous_passage_r3_score_eval/train-*
- split: validation
path: anli_based_on_the_previous_passage_r3_score_eval/validation-*
- split: test
path: anli_based_on_the_previous_passage_r3_score_eval/test-*
- config_name: anli_can_we_infer_r1
data_files:
- split: train
path: anli_can_we_infer_r1/train-*
- split: validation
path: anli_can_we_infer_r1/validation-*
- split: test
path: anli_can_we_infer_r1/test-*
- config_name: anli_can_we_infer_r1_score_eval
data_files:
- split: train
path: anli_can_we_infer_r1_score_eval/train-*
- split: validation
path: anli_can_we_infer_r1_score_eval/validation-*
- split: test
path: anli_can_we_infer_r1_score_eval/test-*
- config_name: anli_can_we_infer_r2
data_files:
- split: train
path: anli_can_we_infer_r2/train-*
- split: validation
path: anli_can_we_infer_r2/validation-*
- split: test
path: anli_can_we_infer_r2/test-*
- config_name: anli_can_we_infer_r2_score_eval
data_files:
- split: train
path: anli_can_we_infer_r2_score_eval/train-*
- split: validation
path: anli_can_we_infer_r2_score_eval/validation-*
- split: test
path: anli_can_we_infer_r2_score_eval/test-*
- config_name: anli_can_we_infer_r3
data_files:
- split: train
path: anli_can_we_infer_r3/train-*
- split: validation
path: anli_can_we_infer_r3/validation-*
- split: test
path: anli_can_we_infer_r3/test-*
- config_name: anli_can_we_infer_r3_score_eval
data_files:
- split: train
path: anli_can_we_infer_r3_score_eval/train-*
- split: validation
path: anli_can_we_infer_r3_score_eval/validation-*
- split: test
path: anli_can_we_infer_r3_score_eval/test-*
- config_name: anli_claim_true_false_inconclusive_r1_score_eval
data_files:
- split: train
path: anli_claim_true_false_inconclusive_r1_score_eval/train-*
- split: validation
path: anli_claim_true_false_inconclusive_r1_score_eval/validation-*
- split: test
path: anli_claim_true_false_inconclusive_r1_score_eval/test-*
- config_name: anli_claim_true_false_inconclusive_r2
data_files:
- split: train
path: anli_claim_true_false_inconclusive_r2/train-*
- split: validation
path: anli_claim_true_false_inconclusive_r2/validation-*
- split: test
path: anli_claim_true_false_inconclusive_r2/test-*
- config_name: anli_claim_true_false_inconclusive_r2_score_eval
data_files:
- split: train
path: anli_claim_true_false_inconclusive_r2_score_eval/train-*
- split: validation
path: anli_claim_true_false_inconclusive_r2_score_eval/validation-*
- split: test
path: anli_claim_true_false_inconclusive_r2_score_eval/test-*
- config_name: anli_claim_true_false_inconclusive_r3
data_files:
- split: train
path: anli_claim_true_false_inconclusive_r3/train-*
- split: validation
path: anli_claim_true_false_inconclusive_r3/validation-*
- split: test
path: anli_claim_true_false_inconclusive_r3/test-*
- config_name: anli_claim_true_false_inconclusive_r3_score_eval
data_files:
- split: train
path: anli_claim_true_false_inconclusive_r3_score_eval/train-*
- split: validation
path: anli_claim_true_false_inconclusive_r3_score_eval/validation-*
- split: test
path: anli_claim_true_false_inconclusive_r3_score_eval/test-*
- config_name: anli_consider_always_sometimes_never_r1
data_files:
- split: train
path: anli_consider_always_sometimes_never_r1/train-*
- split: validation
path: anli_consider_always_sometimes_never_r1/validation-*
- split: test
path: anli_consider_always_sometimes_never_r1/test-*
- config_name: anli_consider_always_sometimes_never_r1_score_eval
data_files:
- split: train
path: anli_consider_always_sometimes_never_r1_score_eval/train-*
- split: validation
path: anli_consider_always_sometimes_never_r1_score_eval/validation-*
- split: test
path: anli_consider_always_sometimes_never_r1_score_eval/test-*
- config_name: anli_consider_always_sometimes_never_r2
data_files:
- split: train
path: anli_consider_always_sometimes_never_r2/train-*
- split: validation
path: anli_consider_always_sometimes_never_r2/validation-*
- split: test
path: anli_consider_always_sometimes_never_r2/test-*
- config_name: anli_consider_always_sometimes_never_r2_score_eval
data_files:
- split: train
path: anli_consider_always_sometimes_never_r2_score_eval/train-*
- split: validation
path: anli_consider_always_sometimes_never_r2_score_eval/validation-*
- split: test
path: anli_consider_always_sometimes_never_r2_score_eval/test-*
- config_name: anli_consider_always_sometimes_never_r3
data_files:
- split: train
path: anli_consider_always_sometimes_never_r3/train-*
- split: validation
path: anli_consider_always_sometimes_never_r3/validation-*
- split: test
path: anli_consider_always_sometimes_never_r3/test-*
- config_name: anli_does_it_follow_that_r1
data_files:
- split: train
path: anli_does_it_follow_that_r1/train-*
- split: validation
path: anli_does_it_follow_that_r1/validation-*
- split: test
path: anli_does_it_follow_that_r1/test-*
- config_name: anli_does_it_follow_that_r1_score_eval
data_files:
- split: train
path: anli_does_it_follow_that_r1_score_eval/train-*
- split: validation
path: anli_does_it_follow_that_r1_score_eval/validation-*
- split: test
path: anli_does_it_follow_that_r1_score_eval/test-*
- config_name: anli_does_it_follow_that_r2
data_files:
- split: train
path: anli_does_it_follow_that_r2/train-*
- split: validation
path: anli_does_it_follow_that_r2/validation-*
- split: test
path: anli_does_it_follow_that_r2/test-*
- config_name: anli_does_it_follow_that_r2_score_eval
data_files:
- split: train
path: anli_does_it_follow_that_r2_score_eval/train-*
- split: validation
path: anli_does_it_follow_that_r2_score_eval/validation-*
- split: test
path: anli_does_it_follow_that_r2_score_eval/test-*
- config_name: anli_does_it_follow_that_r3
data_files:
- split: train
path: anli_does_it_follow_that_r3/train-*
- split: validation
path: anli_does_it_follow_that_r3/validation-*
- split: test
path: anli_does_it_follow_that_r3/test-*
- config_name: anli_does_it_follow_that_r3_score_eval
data_files:
- split: train
path: anli_does_it_follow_that_r3_score_eval/train-*
- split: validation
path: anli_does_it_follow_that_r3_score_eval/validation-*
- split: test
path: anli_does_it_follow_that_r3_score_eval/test-*
- config_name: anli_does_this_imply_r1
data_files:
- split: train
path: anli_does_this_imply_r1/train-*
- split: validation
path: anli_does_this_imply_r1/validation-*
- split: test
path: anli_does_this_imply_r1/test-*
- config_name: anli_does_this_imply_r1_score_eval
data_files:
- split: train
path: anli_does_this_imply_r1_score_eval/train-*
- split: validation
path: anli_does_this_imply_r1_score_eval/validation-*
- split: test
path: anli_does_this_imply_r1_score_eval/test-*
- config_name: anli_does_this_imply_r2
data_files:
- split: train
path: anli_does_this_imply_r2/train-*
- split: validation
path: anli_does_this_imply_r2/validation-*
- split: test
path: anli_does_this_imply_r2/test-*
- config_name: anli_does_this_imply_r2_score_eval
data_files:
- split: train
path: anli_does_this_imply_r2_score_eval/train-*
- split: validation
path: anli_does_this_imply_r2_score_eval/validation-*
- split: test
path: anli_does_this_imply_r2_score_eval/test-*
- config_name: anli_does_this_imply_r3
data_files:
- split: train
path: anli_does_this_imply_r3/train-*
- split: validation
path: anli_does_this_imply_r3/validation-*
- split: test
path: anli_does_this_imply_r3/test-*
- config_name: anli_does_this_imply_r3_score_eval
data_files:
- split: train
path: anli_does_this_imply_r3_score_eval/train-*
- split: validation
path: anli_does_this_imply_r3_score_eval/validation-*
- split: test
path: anli_does_this_imply_r3_score_eval/test-*
- config_name: anli_guaranteed_possible_impossible_r1
data_files:
- split: train
path: anli_guaranteed_possible_impossible_r1/train-*
- split: validation
path: anli_guaranteed_possible_impossible_r1/validation-*
- split: test
path: anli_guaranteed_possible_impossible_r1/test-*
- config_name: anli_guaranteed_possible_impossible_r1_score_eval
data_files:
- split: train
path: anli_guaranteed_possible_impossible_r1_score_eval/train-*
- split: validation
path: anli_guaranteed_possible_impossible_r1_score_eval/validation-*
- split: test
path: anli_guaranteed_possible_impossible_r1_score_eval/test-*
- config_name: anli_guaranteed_possible_impossible_r2_score_eval
data_files:
- split: train
path: anli_guaranteed_possible_impossible_r2_score_eval/train-*
- split: validation
path: anli_guaranteed_possible_impossible_r2_score_eval/validation-*
- split: test
path: anli_guaranteed_possible_impossible_r2_score_eval/test-*
- config_name: anli_guaranteed_possible_impossible_r3
data_files:
- split: train
path: anli_guaranteed_possible_impossible_r3/train-*
- split: validation
path: anli_guaranteed_possible_impossible_r3/validation-*
- split: test
path: anli_guaranteed_possible_impossible_r3/test-*
- config_name: anli_guaranteed_possible_impossible_r3_score_eval
data_files:
- split: train
path: anli_guaranteed_possible_impossible_r3_score_eval/train-*
- split: validation
path: anli_guaranteed_possible_impossible_r3_score_eval/validation-*
- split: test
path: anli_guaranteed_possible_impossible_r3_score_eval/test-*
- config_name: anli_guaranteed_true_r1_score_eval
data_files:
- split: train
path: anli_guaranteed_true_r1_score_eval/train-*
- split: validation
path: anli_guaranteed_true_r1_score_eval/validation-*
- split: test
path: anli_guaranteed_true_r1_score_eval/test-*
- config_name: anli_guaranteed_true_r2
data_files:
- split: train
path: anli_guaranteed_true_r2/train-*
- split: validation
path: anli_guaranteed_true_r2/validation-*
- split: test
path: anli_guaranteed_true_r2/test-*
- config_name: anli_guaranteed_true_r2_score_eval
data_files:
- split: train
path: anli_guaranteed_true_r2_score_eval/train-*
- split: validation
path: anli_guaranteed_true_r2_score_eval/validation-*
- split: test
path: anli_guaranteed_true_r2_score_eval/test-*
- config_name: anli_guaranteed_true_r3
data_files:
- split: train
path: anli_guaranteed_true_r3/train-*
- split: validation
path: anli_guaranteed_true_r3/validation-*
- split: test
path: anli_guaranteed_true_r3/test-*
- config_name: anli_guaranteed_true_r3_score_eval
data_files:
- split: train
path: anli_guaranteed_true_r3_score_eval/train-*
- split: validation
path: anli_guaranteed_true_r3_score_eval/validation-*
- split: test
path: anli_guaranteed_true_r3_score_eval/test-*
- config_name: anli_justified_in_saying_r1
data_files:
- split: train
path: anli_justified_in_saying_r1/train-*
- split: validation
path: anli_justified_in_saying_r1/validation-*
- split: test
path: anli_justified_in_saying_r1/test-*
- config_name: anli_justified_in_saying_r1_score_eval
data_files:
- split: train
path: anli_justified_in_saying_r1_score_eval/train-*
- split: validation
path: anli_justified_in_saying_r1_score_eval/validation-*
- split: test
path: anli_justified_in_saying_r1_score_eval/test-*
- config_name: anli_justified_in_saying_r2
data_files:
- split: train
path: anli_justified_in_saying_r2/train-*
- split: validation
path: anli_justified_in_saying_r2/validation-*
- split: test
path: anli_justified_in_saying_r2/test-*
- config_name: anli_justified_in_saying_r2_score_eval
data_files:
- split: train
path: anli_justified_in_saying_r2_score_eval/train-*
- split: validation
path: anli_justified_in_saying_r2_score_eval/validation-*
- split: test
path: anli_justified_in_saying_r2_score_eval/test-*
- config_name: anli_justified_in_saying_r3
data_files:
- split: train
path: anli_justified_in_saying_r3/train-*
- split: validation
path: anli_justified_in_saying_r3/validation-*
- split: test
path: anli_justified_in_saying_r3/test-*
- config_name: anli_justified_in_saying_r3_score_eval
data_files:
- split: train
path: anli_justified_in_saying_r3_score_eval/train-*
- split: validation
path: anli_justified_in_saying_r3_score_eval/validation-*
- split: test
path: anli_justified_in_saying_r3_score_eval/test-*
- config_name: anli_must_be_true_r1
data_files:
- split: train
path: anli_must_be_true_r1/train-*
- split: validation
path: anli_must_be_true_r1/validation-*
- split: test
path: anli_must_be_true_r1/test-*
- config_name: anli_must_be_true_r1_score_eval
data_files:
- split: train
path: anli_must_be_true_r1_score_eval/train-*
- split: validation
path: anli_must_be_true_r1_score_eval/validation-*
- split: test
path: anli_must_be_true_r1_score_eval/test-*
- config_name: anli_must_be_true_r2
data_files:
- split: train
path: anli_must_be_true_r2/train-*
- split: validation
path: anli_must_be_true_r2/validation-*
- split: test
path: anli_must_be_true_r2/test-*
- config_name: anli_must_be_true_r2_score_eval
data_files:
- split: train
path: anli_must_be_true_r2_score_eval/train-*
- split: validation
path: anli_must_be_true_r2_score_eval/validation-*
- split: test
path: anli_must_be_true_r2_score_eval/test-*
- config_name: anli_must_be_true_r3_score_eval
data_files:
- split: train
path: anli_must_be_true_r3_score_eval/train-*
- split: validation
path: anli_must_be_true_r3_score_eval/validation-*
- split: test
path: anli_must_be_true_r3_score_eval/test-*
- config_name: anli_should_assume_r1
data_files:
- split: train
path: anli_should_assume_r1/train-*
- split: validation
path: anli_should_assume_r1/validation-*
- split: test
path: anli_should_assume_r1/test-*
- config_name: anli_should_assume_r1_score_eval
data_files:
- split: train
path: anli_should_assume_r1_score_eval/train-*
- split: validation
path: anli_should_assume_r1_score_eval/validation-*
- split: test
path: anli_should_assume_r1_score_eval/test-*
- config_name: anli_should_assume_r2
data_files:
- split: train
path: anli_should_assume_r2/train-*
- split: validation
path: anli_should_assume_r2/validation-*
- split: test
path: anli_should_assume_r2/test-*
- config_name: anli_should_assume_r2_score_eval
data_files:
- split: train
path: anli_should_assume_r2_score_eval/train-*
- split: validation
path: anli_should_assume_r2_score_eval/validation-*
- split: test
path: anli_should_assume_r2_score_eval/test-*
- config_name: anli_should_assume_r3
data_files:
- split: train
path: anli_should_assume_r3/train-*
- split: validation
path: anli_should_assume_r3/validation-*
- split: test
path: anli_should_assume_r3/test-*
- config_name: anli_should_assume_r3_score_eval
data_files:
- split: train
path: anli_should_assume_r3_score_eval/train-*
- split: validation
path: anli_should_assume_r3_score_eval/validation-*
- split: test
path: anli_should_assume_r3_score_eval/test-*
- config_name: anli_take_the_following_as_truth_r1
data_files:
- split: train
path: anli_take_the_following_as_truth_r1/train-*
- split: validation
path: anli_take_the_following_as_truth_r1/validation-*
- split: test
path: anli_take_the_following_as_truth_r1/test-*
- config_name: anli_take_the_following_as_truth_r1_score_eval
data_files:
- split: train
path: anli_take_the_following_as_truth_r1_score_eval/train-*
- split: validation
path: anli_take_the_following_as_truth_r1_score_eval/validation-*
- split: test
path: anli_take_the_following_as_truth_r1_score_eval/test-*
- config_name: anli_take_the_following_as_truth_r2
data_files:
- split: train
path: anli_take_the_following_as_truth_r2/train-*
- split: validation
path: anli_take_the_following_as_truth_r2/validation-*
- split: test
path: anli_take_the_following_as_truth_r2/test-*
- config_name: anli_take_the_following_as_truth_r2_score_eval
data_files:
- split: train
path: anli_take_the_following_as_truth_r2_score_eval/train-*
- split: validation
path: anli_take_the_following_as_truth_r2_score_eval/validation-*
- split: test
path: anli_take_the_following_as_truth_r2_score_eval/test-*
- config_name: anli_take_the_following_as_truth_r3
data_files:
- split: train
path: anli_take_the_following_as_truth_r3/train-*
- split: validation
path: anli_take_the_following_as_truth_r3/validation-*
- split: test
path: anli_take_the_following_as_truth_r3/test-*
- config_name: anli_take_the_following_as_truth_r3_score_eval
data_files:
- split: train
path: anli_take_the_following_as_truth_r3_score_eval/train-*
- split: validation
path: anli_take_the_following_as_truth_r3_score_eval/validation-*
- split: test
path: anli_take_the_following_as_truth_r3_score_eval/test-*
- config_name: app_reviews_categorize_rating_using_review
data_files:
- split: train
path: app_reviews_categorize_rating_using_review/train-*
- config_name: app_reviews_convert_to_rating
data_files:
- split: train
path: app_reviews_convert_to_rating/train-*
- config_name: app_reviews_convert_to_star_rating
data_files:
- split: train
path: app_reviews_convert_to_star_rating/train-*
- config_name: app_reviews_generate_review
data_files:
- split: train
path: app_reviews_generate_review/train-*
- config_name: cnn_dailymail_3.0.0_generate_story
data_files:
- split: train
path: cnn_dailymail_3.0.0_generate_story/train-*
- split: validation
path: cnn_dailymail_3.0.0_generate_story/validation-*
- split: test
path: cnn_dailymail_3.0.0_generate_story/test-*
- config_name: cnn_dailymail_3.0.0_news_card_view
data_files:
- split: train
path: cnn_dailymail_3.0.0_news_card_view/train-*
- split: validation
path: cnn_dailymail_3.0.0_news_card_view/validation-*
- split: test
path: cnn_dailymail_3.0.0_news_card_view/test-*
- config_name: cnn_dailymail_3.0.0_news_stock
data_files:
- split: train
path: cnn_dailymail_3.0.0_news_stock/train-*
- split: validation
path: cnn_dailymail_3.0.0_news_stock/validation-*
- split: test
path: cnn_dailymail_3.0.0_news_stock/test-*
- config_name: cnn_dailymail_3.0.0_spice_up_story
data_files:
- split: train
path: cnn_dailymail_3.0.0_spice_up_story/train-*
- split: validation
path: cnn_dailymail_3.0.0_spice_up_story/validation-*
- split: test
path: cnn_dailymail_3.0.0_spice_up_story/test-*
- config_name: cnn_dailymail_3.0.0_sum_in_brief
data_files:
- split: train
path: cnn_dailymail_3.0.0_sum_in_brief/train-*
- split: validation
path: cnn_dailymail_3.0.0_sum_in_brief/validation-*
- split: test
path: cnn_dailymail_3.0.0_sum_in_brief/test-*
- config_name: wiki_hop_original_generate_subject_and_object
data_files:
- split: train
path: wiki_hop_original_generate_subject_and_object/train-*
- split: validation
path: wiki_hop_original_generate_subject_and_object/validation-*
- config_name: wiki_qa_Decide_good_answer
data_files:
- split: train
path: wiki_qa_Decide_good_answer/train-*
- split: validation
path: wiki_qa_Decide_good_answer/validation-*
- split: test
path: wiki_qa_Decide_good_answer/test-*
- config_name: wiki_qa_Direct_Answer_to_Question
data_files:
- split: train
path: wiki_qa_Direct_Answer_to_Question/train-*
- split: validation
path: wiki_qa_Direct_Answer_to_Question/validation-*
- split: test
path: wiki_qa_Direct_Answer_to_Question/test-*
- config_name: wiki_qa_Generate_Question_from_Topic
data_files:
- split: train
path: wiki_qa_Generate_Question_from_Topic/train-*
- split: validation
path: wiki_qa_Generate_Question_from_Topic/validation-*
- split: test
path: wiki_qa_Generate_Question_from_Topic/test-*
- config_name: wiki_qa_Is_This_True_
data_files:
- split: train
path: wiki_qa_Is_This_True_/train-*
- split: validation
path: wiki_qa_Is_This_True_/validation-*
- split: test
path: wiki_qa_Is_This_True_/test-*
- config_name: wiki_qa_Jeopardy_style
data_files:
- split: train
path: wiki_qa_Jeopardy_style/train-*
- split: validation
path: wiki_qa_Jeopardy_style/validation-*
- split: test
path: wiki_qa_Jeopardy_style/test-*
- config_name: wiki_qa_Topic_Prediction_Answer_Only
data_files:
- split: train
path: wiki_qa_Topic_Prediction_Answer_Only/train-*
- split: validation
path: wiki_qa_Topic_Prediction_Answer_Only/validation-*
- split: test
path: wiki_qa_Topic_Prediction_Answer_Only/test-*
- config_name: wiki_qa_Topic_Prediction_Question_Only
data_files:
- split: train
path: wiki_qa_Topic_Prediction_Question_Only/train-*
- split: validation
path: wiki_qa_Topic_Prediction_Question_Only/validation-*
- split: test
path: wiki_qa_Topic_Prediction_Question_Only/test-*
- config_name: wiki_qa_Topic_Prediction_Question_and_Answer_Pair
data_files:
- split: train
path: wiki_qa_Topic_Prediction_Question_and_Answer_Pair/train-*
- split: validation
path: wiki_qa_Topic_Prediction_Question_and_Answer_Pair/validation-*
- split: test
path: wiki_qa_Topic_Prediction_Question_and_Answer_Pair/test-*
- config_name: wiki_qa_automatic_system
data_files:
- split: train
path: wiki_qa_automatic_system/train-*
- split: validation
path: wiki_qa_automatic_system/validation-*
- split: test
path: wiki_qa_automatic_system/test-*
- config_name: wiki_qa_exercise
data_files:
- split: train
path: wiki_qa_exercise/train-*
- split: validation
path: wiki_qa_exercise/validation-*
- split: test
path: wiki_qa_exercise/test-*
- config_name: wiki_qa_found_on_google
data_files:
- split: train
path: wiki_qa_found_on_google/train-*
- split: validation
path: wiki_qa_found_on_google/validation-*
- split: test
path: wiki_qa_found_on_google/test-*
- config_name: winogrande_winogrande_debiased_Replace
data_files:
- split: train
path: winogrande_winogrande_debiased_Replace/train-*
- split: validation
path: winogrande_winogrande_debiased_Replace/validation-*
- split: test
path: winogrande_winogrande_debiased_Replace/test-*
- config_name: winogrande_winogrande_debiased_Replace_score_eval
data_files:
- split: train
path: winogrande_winogrande_debiased_Replace_score_eval/train-*
- split: validation
path: winogrande_winogrande_debiased_Replace_score_eval/validation-*
- split: test
path: winogrande_winogrande_debiased_Replace_score_eval/test-*
- config_name: winogrande_winogrande_debiased_does_underscore_refer_to
data_files:
- split: train
path: winogrande_winogrande_debiased_does_underscore_refer_to/train-*
- split: validation
path: winogrande_winogrande_debiased_does_underscore_refer_to/validation-*
- split: test
path: winogrande_winogrande_debiased_does_underscore_refer_to/test-*
- config_name: winogrande_winogrande_debiased_does_underscore_refer_to_score_eval
data_files:
- split: train
path: winogrande_winogrande_debiased_does_underscore_refer_to_score_eval/train-*
- split: validation
path: winogrande_winogrande_debiased_does_underscore_refer_to_score_eval/validation-*
- split: test
path: winogrande_winogrande_debiased_does_underscore_refer_to_score_eval/test-*
- config_name: winogrande_winogrande_debiased_fill_in_the_blank
data_files:
- split: train
path: winogrande_winogrande_debiased_fill_in_the_blank/train-*
- split: validation
path: winogrande_winogrande_debiased_fill_in_the_blank/validation-*
- split: test
path: winogrande_winogrande_debiased_fill_in_the_blank/test-*
- config_name: winogrande_winogrande_debiased_fill_in_the_blank_score_eval
data_files:
- split: train
path: winogrande_winogrande_debiased_fill_in_the_blank_score_eval/train-*
- split: validation
path: winogrande_winogrande_debiased_fill_in_the_blank_score_eval/validation-*
- split: test
path: winogrande_winogrande_debiased_fill_in_the_blank_score_eval/test-*
- config_name: winogrande_winogrande_debiased_stand_for
data_files:
- split: train
path: winogrande_winogrande_debiased_stand_for/train-*
- split: validation
path: winogrande_winogrande_debiased_stand_for/validation-*
- split: test
path: winogrande_winogrande_debiased_stand_for/test-*
- config_name: winogrande_winogrande_debiased_stand_for_score_eval
data_files:
- split: train
path: winogrande_winogrande_debiased_stand_for_score_eval/train-*
- split: validation
path: winogrande_winogrande_debiased_stand_for_score_eval/validation-*
- split: test
path: winogrande_winogrande_debiased_stand_for_score_eval/test-*
- config_name: winogrande_winogrande_debiased_underscore_refer_to
data_files:
- split: train
path: winogrande_winogrande_debiased_underscore_refer_to/train-*
- split: validation
path: winogrande_winogrande_debiased_underscore_refer_to/validation-*
- split: test
path: winogrande_winogrande_debiased_underscore_refer_to/test-*
- config_name: winogrande_winogrande_debiased_underscore_refer_to_score_eval
data_files:
- split: train
path: winogrande_winogrande_debiased_underscore_refer_to_score_eval/train-*
- split: validation
path: winogrande_winogrande_debiased_underscore_refer_to_score_eval/validation-*
- split: test
path: winogrande_winogrande_debiased_underscore_refer_to_score_eval/test-*
- config_name: winogrande_winogrande_xl_Replace
data_files:
- split: train
path: winogrande_winogrande_xl_Replace/train-*
- split: validation
path: winogrande_winogrande_xl_Replace/validation-*
- split: test
path: winogrande_winogrande_xl_Replace/test-*
- config_name: winogrande_winogrande_xl_Replace_score_eval
data_files:
- split: train
path: winogrande_winogrande_xl_Replace_score_eval/train-*
- split: validation
path: winogrande_winogrande_xl_Replace_score_eval/validation-*
- split: test
path: winogrande_winogrande_xl_Replace_score_eval/test-*
- config_name: winogrande_winogrande_xl_does_underscore_refer_to
data_files:
- split: train
path: winogrande_winogrande_xl_does_underscore_refer_to/train-*
- split: validation
path: winogrande_winogrande_xl_does_underscore_refer_to/validation-*
- split: test
path: winogrande_winogrande_xl_does_underscore_refer_to/test-*
- config_name: winogrande_winogrande_xl_does_underscore_refer_to_score_eval
data_files:
- split: train
path: winogrande_winogrande_xl_does_underscore_refer_to_score_eval/train-*
- split: validation
path: winogrande_winogrande_xl_does_underscore_refer_to_score_eval/validation-*
- split: test
path: winogrande_winogrande_xl_does_underscore_refer_to_score_eval/test-*
- config_name: winogrande_winogrande_xl_fill_in_the_blank
data_files:
- split: train
path: winogrande_winogrande_xl_fill_in_the_blank/train-*
- split: validation
path: winogrande_winogrande_xl_fill_in_the_blank/validation-*
- split: test
path: winogrande_winogrande_xl_fill_in_the_blank/test-*
- config_name: winogrande_winogrande_xl_fill_in_the_blank_score_eval
data_files:
- split: train
path: winogrande_winogrande_xl_fill_in_the_blank_score_eval/train-*
- split: validation
path: winogrande_winogrande_xl_fill_in_the_blank_score_eval/validation-*
- split: test
path: winogrande_winogrande_xl_fill_in_the_blank_score_eval/test-*
- config_name: winogrande_winogrande_xl_stand_for
data_files:
- split: train
path: winogrande_winogrande_xl_stand_for/train-*
- split: validation
path: winogrande_winogrande_xl_stand_for/validation-*
- split: test
path: winogrande_winogrande_xl_stand_for/test-*
- config_name: winogrande_winogrande_xl_stand_for_score_eval
data_files:
- split: train
path: winogrande_winogrande_xl_stand_for_score_eval/train-*
- split: validation
path: winogrande_winogrande_xl_stand_for_score_eval/validation-*
- split: test
path: winogrande_winogrande_xl_stand_for_score_eval/test-*
- config_name: winogrande_winogrande_xl_underscore_refer_to
data_files:
- split: train
path: winogrande_winogrande_xl_underscore_refer_to/train-*
- split: validation
path: winogrande_winogrande_xl_underscore_refer_to/validation-*
- split: test
path: winogrande_winogrande_xl_underscore_refer_to/test-*
- config_name: winogrande_winogrande_xl_underscore_refer_to_score_eval
data_files:
- split: train
path: winogrande_winogrande_xl_underscore_refer_to_score_eval/train-*
- split: validation
path: winogrande_winogrande_xl_underscore_refer_to_score_eval/validation-*
- split: test
path: winogrande_winogrande_xl_underscore_refer_to_score_eval/test-*
- config_name: wiqa_does_the_supposed_perturbation_have_an_effect
data_files:
- split: train
path: wiqa_does_the_supposed_perturbation_have_an_effect/train-*
- split: validation
path: wiqa_does_the_supposed_perturbation_have_an_effect/validation-*
- split: test
path: wiqa_does_the_supposed_perturbation_have_an_effect/test-*
- config_name: wiqa_effect_with_label_answer
data_files:
- split: train
path: wiqa_effect_with_label_answer/train-*
- split: validation
path: wiqa_effect_with_label_answer/validation-*
- split: test
path: wiqa_effect_with_label_answer/test-*
- config_name: wiqa_effect_with_string_answer
data_files:
- split: train
path: wiqa_effect_with_string_answer/train-*
- split: validation
path: wiqa_effect_with_string_answer/validation-*
- split: test
path: wiqa_effect_with_string_answer/test-*
- config_name: wiqa_what_is_the_final_step_of_the_following_process
data_files:
- split: train
path: wiqa_what_is_the_final_step_of_the_following_process/train-*
- split: validation
path: wiqa_what_is_the_final_step_of_the_following_process/validation-*
- split: test
path: wiqa_what_is_the_final_step_of_the_following_process/test-*
- config_name: wiqa_what_is_the_missing_first_step
data_files:
- split: train
path: wiqa_what_is_the_missing_first_step/train-*
- split: validation
path: wiqa_what_is_the_missing_first_step/validation-*
- split: test
path: wiqa_what_is_the_missing_first_step/test-*
- config_name: wiqa_what_might_be_the_first_step_of_the_process
data_files:
- split: train
path: wiqa_what_might_be_the_first_step_of_the_process/train-*
- split: validation
path: wiqa_what_might_be_the_first_step_of_the_process/validation-*
- split: test
path: wiqa_what_might_be_the_first_step_of_the_process/test-*
- config_name: wiqa_what_might_be_the_last_step_of_the_process
data_files:
- split: train
path: wiqa_what_might_be_the_last_step_of_the_process/train-*
- split: validation
path: wiqa_what_might_be_the_last_step_of_the_process/validation-*
- split: test
path: wiqa_what_might_be_the_last_step_of_the_process/test-*
- config_name: wiqa_which_of_the_following_is_the_supposed_perturbation
data_files:
- split: train
path: wiqa_which_of_the_following_is_the_supposed_perturbation/train-*
- split: validation
path: wiqa_which_of_the_following_is_the_supposed_perturbation/validation-*
- split: test
path: wiqa_which_of_the_following_is_the_supposed_perturbation/test-*
- config_name: xsum_DOC_boils_down_to_simple_idea_that
data_files:
- split: train
path: xsum_DOC_boils_down_to_simple_idea_that/train-*
- split: validation
path: xsum_DOC_boils_down_to_simple_idea_that/validation-*
- split: test
path: xsum_DOC_boils_down_to_simple_idea_that/test-*
- config_name: xsum_DOC_given_above_write_one_sentence
data_files:
- split: train
path: xsum_DOC_given_above_write_one_sentence/train-*
- split: validation
path: xsum_DOC_given_above_write_one_sentence/validation-*
- split: test
path: xsum_DOC_given_above_write_one_sentence/test-*
- config_name: xsum_DOC_how_would_you_rephrase_few_words
data_files:
- split: train
path: xsum_DOC_how_would_you_rephrase_few_words/train-*
- split: validation
path: xsum_DOC_how_would_you_rephrase_few_words/validation-*
- split: test
path: xsum_DOC_how_would_you_rephrase_few_words/test-*
- config_name: xsum_DOC_tldr
data_files:
- split: train
path: xsum_DOC_tldr/train-*
- split: validation
path: xsum_DOC_tldr/validation-*
- split: test
path: xsum_DOC_tldr/test-*
- config_name: xsum_DOC_write_summary_of_above
data_files:
- split: train
path: xsum_DOC_write_summary_of_above/train-*
- split: validation
path: xsum_DOC_write_summary_of_above/validation-*
- split: test
path: xsum_DOC_write_summary_of_above/test-*
- config_name: xsum_article_DOC_summary
data_files:
- split: train
path: xsum_article_DOC_summary/train-*
- split: validation
path: xsum_article_DOC_summary/validation-*
- split: test
path: xsum_article_DOC_summary/test-*
- config_name: xsum_college_roommate_asked_DOC_so_I_recap
data_files:
- split: train
path: xsum_college_roommate_asked_DOC_so_I_recap/train-*
- split: validation
path: xsum_college_roommate_asked_DOC_so_I_recap/validation-*
- split: test
path: xsum_college_roommate_asked_DOC_so_I_recap/test-*
- config_name: xsum_read_below_DOC_write_abstract
data_files:
- split: train
path: xsum_read_below_DOC_write_abstract/train-*
- split: validation
path: xsum_read_below_DOC_write_abstract/validation-*
- split: test
path: xsum_read_below_DOC_write_abstract/test-*
- config_name: xsum_summarize_DOC
data_files:
- split: train
path: xsum_summarize_DOC/train-*
- split: validation
path: xsum_summarize_DOC/validation-*
- split: test
path: xsum_summarize_DOC/test-*
- config_name: xsum_summarize_this_DOC_summary
data_files:
- split: train
path: xsum_summarize_this_DOC_summary/train-*
- split: validation
path: xsum_summarize_this_DOC_summary/validation-*
- split: test
path: xsum_summarize_this_DOC_summary/test-*
- config_name: yelp_review_full_based_on_that
data_files:
- split: train
path: yelp_review_full_based_on_that/train-*
- split: test
path: yelp_review_full_based_on_that/test-*
- config_name: yelp_review_full_format_rating
data_files:
- split: train
path: yelp_review_full_format_rating/train-*
- split: test
path: yelp_review_full_format_rating/test-*
- config_name: yelp_review_full_format_score
data_files:
- split: train
path: yelp_review_full_format_score/train-*
- split: test
path: yelp_review_full_format_score/test-*
- config_name: yelp_review_full_format_star
data_files:
- split: train
path: yelp_review_full_format_star/train-*
- split: test
path: yelp_review_full_format_star/test-*
- config_name: yelp_review_full_on_a_scale
data_files:
- split: train
path: yelp_review_full_on_a_scale/train-*
- split: test
path: yelp_review_full_on_a_scale/test-*
- config_name: yelp_review_full_so_i_would
data_files:
- split: train
path: yelp_review_full_so_i_would/train-*
- split: test
path: yelp_review_full_so_i_would/test-*
- config_name: yelp_review_full_this_place
data_files:
- split: train
path: yelp_review_full_this_place/train-*
- split: test
path: yelp_review_full_this_place/test-*
language:
- lv
---
This is an automatically translated version of [P3 (Public Pool of Prompts)](https://huggingface.co/datasets/bigscience/P3) using [quickmt-en-lv](https://huggingface.co/quickmt/quickmt-en-lv).
### Languages
The data in P3-Latvian-Full are in Latvian (BCP-47 `lv`).
## Dataset Structure
### Data Instances
An example of "train" looks as follows:
```bash
{
'answer_choices': ['mobilais tālrunis', 'televīzija', 'ledusskapis', 'lidmašīna'],
'inputs_pretokenized': 'Kura tehnoloģija tika izstrādāta pavisam nesen? Iespējas: - mobilais tālrunis - televizors - ledusskapis - lidmašīna',
'targets_pretokenized': 'mobilais tālrunis'
}
```
In the case of rank classification (letting the model select its the prediction the option with the highest log-likelihood), an example looks as follows:
```bash
{
'idx': [5, 0],
'inputs_pretokenized': 'Es zinu, ka atbilde uz jautājumu "Ko CBS darīja otro reizi?" ir "1989. gadā CBS Records atkārtoti iekļāva mūzikas izdevējdarbības biznesu, iegādājoties Nashville mūzikas izdevēju Tree International Publishing par vairāk nekā 30 miljoniem ASV dolāru. ". Vai jūs varat man pateikt, kas tas ir?',
'is_correct': True,
'targets_pretokenized': 'atgriezās mūzikas izdevējdarbības biznesā',
'weight': 1.0
}
```
### Data Fields
The data fields are the same among all splits:
- `answer_choices`: the choices (in natural language) available to the model
- `inputs_pretokenized`: the natural language input fed to the model
- `targets_pretokenized`: the natural language target that the model has to generate
- `idx`: identifier of the (example, answer_option_id) in the case of rank classification
- `weight`: a weight for the example produced by seqio (always set to 1.0 in practise)
- `is_correct`: whether the (example, answer_option_id) is the correct one
提供机构:
matiss



