Muennighoff/natural-instructions
收藏Hugging Face2022-12-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Muennighoff/natural-instructions
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是Super-Natural-Instructions的预处理版本,来源于https://github.com/allenai/natural-instructions/tree/master/splits。相同的输入可能会出现不同的输出,因此可以通过`id`或`inputs`字段进行去重。数据集包含多个训练任务,涵盖了问答生成、分类、文本生成等多种任务。
提供机构:
Muennighoff
原始信息汇总
数据集概述
基本信息
- 语言: 英语 (
en) - 多语言性: 单语种
- 大小类别: 100M<n<1B
- 任务类别: 其他 (
other) - 注释创建者: 众包和专家生成
数据集内容
该数据集是Super-Natural-Instructions的预处理版本,源自AllenAI的natural-instructions项目。数据集中可能存在重复的输入,可以通过id或inputs字段进行去重。
训练任务
数据集包含多种任务,以下是部分任务示例:
task001_quoref_question_generationtask002_quoref_answer_generationtask022_cosmosqa_passage_inappropriate_binarytask023_cosmosqa_question_generationtask024_cosmosqa_answer_generationtask025_cosmosqa_incorrect_answer_generationtask026_drop_question_generationtask027_drop_answer_type_generationtask028_drop_answer_generationtask043_essential_terms_answering_incomplete_questionstask044_essential_terms_identifying_essential_wordstask045_miscellaneous_sentence_paraphrasingtask046_miscellaneous_question_typingtask047_miscellaneous_answering_science_questionstask059_ropes_story_generationtask060_ropes_question_generationtask061_ropes_answer_generationtask062_bigbench_repeat_copy_logictask063_first_i_elementstask064_all_elements_except_first_itask065_timetravel_consistent_sentence_classificationtask066_timetravel_binary_consistency_classificationtask067_abductivenli_answer_generationtask068_abductivenli_incorrect_answer_generationtask069_abductivenli_classificationtask070_abductivenli_incorrect_classificationtask071_abductivenli_answer_generationtask072_abductivenli_answer_generationtask073_commonsenseqa_answer_generationtask074_squad1.1_question_generationtask075_squad1.1_answer_generationtask076_splash_correcting_sql_mistaketask077_splash_explanation_to_sqltask078_all_elements_except_last_itask079_conala_concat_stringstask080_piqa_answer_generationtask081_piqa_wrong_answer_generationtask082_babi_t1_single_supporting_fact_question_generationtask083_babi_t1_single_supporting_fact_answer_generationtask084_babi_t1_single_supporting_fact_identify_relevant_facttask085_unnatural_addsub_arithmetictask087_new_operator_addsub_arithmetictask088_identify_typo_verificationtask089_swap_words_verificationtask090_equation_learner_algebratask091_all_elements_from_index_i_to_jtask092_check_prime_classificationtask093_conala_normalize_liststask094_conala_calculate_meantask095_conala_max_absolute_valuetask096_conala_list_index_subtractiontask097_conala_remove_duplicatestask098_conala_list_intersectiontask099_reverse_elements_between_index_i_and_jtask100_concatenate_all_elements_from_index_i_to_jtask101_reverse_and_concatenate_all_elements_from_index_i_to_jtask103_facts2story_long_text_generationtask104_semeval_2019_task10_closed_vocabulary_mathematical_answer_generationtask105_story_cloze-rocstories_sentence_generationtask107_splash_question_to_sqltask1087_two_number_sumtask1088_array_of_productstask1089_check_monotonic_arraytask108_contextualabusedetection_classificationtask109_smsspamcollection_spamsmsdetectiontask110_logic2text_sentence_generationtask111_asset_sentence_simplificationtask112_asset_simple_sentence_identificationtask1135_xcsr_en_commonsense_mc_classificationtask113_count_frequency_of_lettertask1146_country_capitaltask1147_country_currencytask1148_maximum_ascii_valuetask1149_item_check_edibletask114_is_the_given_word_longesttask1150_delete_max_mintask1151_swap_max_mintask115_help_advice_classificationtask1167_penn_treebank_coarse_pos_taggingtask1168_brown_coarse_pos_taggingtask116_com2sense_commonsense_reasoningtask1186_nne_hrngo_classificationtask1188_count_max_freq_chartask1189_check_char_in_stringtask118_semeval_2019_task10_open_vocabulary_mathematical_answer_generationtask1190_add_integer_to_listtask1191_food_veg_nonvegtask1192_food_flavor_profiletask1193_food_course_classificationtask1194_kth_largest_elementtask1196_atomic_classification_oeffecttask1197_atomic_classification_oreacttask1198_atomic_classification_owanttask1199_atomic_classification_xattrtask119_semeval_2019_task10_geometric_mathematical_answer_generationtask1200_atomic_classification_xeffecttask1201_atomic_classification_xintenttask1202_atomic_classification_xneedtask1203_atomic_classification_xreacttask1204_atomic_classification_hinderedbytask1205_atomic_classification_isaftertask1206_atomic_classification_isbeforetask1207_atomic_classification_atlocationtask1208_atomic_classification_xreasontask1209_atomic_classification_objectusetask1210_atomic_classification_madeupoftask1211_atomic_classification_hassubeventtask1212_atomic_classification_haspropertytask1213_atomic_classification_desirestask1214_atomic_classification_xwanttask1215_atomic_classification_capableoftask1216_atomic_classification_causestask1217_atomic_answer_generationtask122_conala_list_index_additiontask123_conala_sort_dictionarytask124_conala_pair_averagestask125_conala_pair_differencestask126_scan_structured_text_generation_command_action_alltask127_scan_long_text_generation_action_command_alltask1283_hrngo_quality_classificationtask1284_hrngo_informativeness_classificationtask1285_kpa_keypoint_matchingtask1286_openbookqa_question_answeringtask1288_glue_mrpc_paraphrasingtask1289_trec_classificationtask128_scan_structured_text_generation_command_action_shorttask1290_xsum_summarizationtask1291_multi_news_summarizationtask1292_yelp_review_full_text_categorizationtask1293_kilt_tasks_hotpotqa_question_answeringtask1294_wiki_qa_answer_verificationtask1295_adversarial_qa_question_answeringtask1296_wiki_hop_question_answeringtask129_scan_long_text_generation_action_command_shorttask1308_amazonreview_category_classificationtask1309_amazonreview_summary_classificationtask130_scan_structured_text_generation_command_action_longtask1310_amazonreview_rating_classificationtask1311_amazonreview_rating_classificationtask1312_amazonreview_polarity_classificationtask1313_amazonreview_polarity_classificationtask1314_country_abbreviationtask1315_find_range_arraytask1316_remove_duplicates_stringtask1317_country_calling_codetask1318_country_national_dishtask1319_country_by_barcode_prefixtask131_scan_long_text_generation_action_command_longtask1320_country_domain_tldtask1321_country_continenttask1322_country_government_typetask1325_qa_zre_question_generation_on_subject_relationtask1326_qa_zre_question_generation_from_answertask1327_qa_zre_answer_generation_from_questiontask1328_qa_zre_relation_generation_from_questiontask132_dais_text_modificationtask1331_reverse_arraytask1332_check_leap_yeartask1333_check_validity_date_ddmmyyyytask1336_peixian_equity_evaluation_corpus_gender_classifiertask1338_peixian_equity_evaluation_corpus_sentiment_classifiertask1339_peixian_equity_evaluation_corpus_text_completiontask1340_msr_text_compression_compressiontask1341_msr_text_classificationtask1346_glue_cola_grammatical_correctness_classificationtask1347_glue_sts-b_similarity_classificationtask1354_sent_comp_classificationtask1355_sent_comp_summarizationtask1359_numer_sense_answer_generationtask1360_numer_sense_multiple_choice_qa_generationtask1361_movierationales_classificationtask1364_hans_answer_generationtask1366_healthfact_classificationtask1368_healthfact_sentence_generationtask1369_healthfact_sentence_generationtask1378_quarel_correct_answer_generationtask1379_quarel_incorrect_answer_generationtask137_detoxifying-lms_classification_toxicitytask1380_quarel_correct_option_generationtask1381_quarel_incorrect_option_generationtask1382_quarel_write_correct_answertask1383_quarel_write_incorrect_answertask1384_deal_or_no_dialog_classificationtask1389_hellaswag_completiontask138_detoxifying-lms_classification_fluencytask1398_obqa_question_generationtask1399_obqa_answer_generationtask139_detoxifying-lms_classification_topicalitytask1400_obqa_incorrect_answer_generationtask1401_obqa_sentence_generationtask1403_check_validity_date_mmddyyyytask1404_date_conversiontask1405_find_mediantask1406_kth_smallest_elementtask140_detoxifying-lms_classification_styletask1412_web_questions_question_answeringtask1418_bless_semantic_relation_classificationtask1419_mathqa_gaintask141_odd-man-out_classification_categorytask1420_mathqa_generaltask1421_mathqa_othertask1422_mathqa_physicstask1423_mathqa_geometrytask1424_mathqa_probabilitytask1425_country_iso_numerictask1426_country_independence_yeartask1427_country_region_in_worldtask1428_country_surface_areatask1429_evalution_semantic_relation_classificationtask142_odd-man-out_classification_no_categorytask1431_head_qa_answer_generationtask1434_head_qa_classificationtask143_odd-man-out_classification_generate_categorytask1443_string_to_numbertask1444_round_power_of_twotask1445_closest_integerstask1446_farthest_integerstask1447_drug_extraction_adetask1448_disease_entity_extraction_ncbi_datasettask1449_disease_entity_extraction_bc5cdr_datasettask144_subjqa_question_answeringtask1451_drug_dose_extractiontask1452_location_entity_extraction_btc_corpustask1453_person_entity_extraction_btc_corpus- `task145_afs_argument_



