five

aslawliet/flan2021-full

收藏
Hugging Face2024-04-19 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/aslawliet/flan2021-full
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 task_categories: - text-generation - text-classification - token-classification - question-answering - zero-shot-classification - translation - summarization language: - en size_categories: - 10M<n<100M --- # Task Name - **FLAN-2021 -> 70** ```json { "ag_news_subset": 108497, "ai2_arc/ARC-Challenge": 829, "ai2_arc/ARC-Easy": 1927, "aeslc": 13187, "anli/r1": 15361, "anli/r2": 41133, "anli/r3": 91048, "bool_q": 8343, "cnn_dailymail": 259607, "coqa": 6456, "cosmos_qa": 22996, "definite_pronoun_resolution": 1079, "drop": 70045, "fix_punct": 25690, "gem/common_gen": 60936, "gem/dart": 56724, "gem/e2e_nlg": 30337, "gem/web_nlg_en": 31899, "gem/wiki_lingua_english_en": 89452, "gigaword": 1853123, "glue/cola": 7594, "glue/mnli": 711413, "glue/mrpc": 3117, "glue/qnli": 94453, "glue/qqp": 329860, "glue/sst2": 61011, "glue/stsb": 5085, "glue/wnli": 600, "hellaswag": 35941, "xsum": 184162, "imdb_reviews/plain_text": 22725, "lambada": 4467, "math_dataset/algebra__linear_1d": 1814247, "multi_news": 40646, "natural_questions_open": 79342, "newsroom": 900966, "openbookqa": 4471, "opinion_abstracts/idebate": 1554, "opinion_abstracts/rotten_tomatoes": 2908, "para_crawl_enes": 27430, "paws_wiki": 44831, "piqa": 14594, "quac": 75448, "samsum": 13232, "sentiment140": 1451736, "snli": 498328, "squad/v1": 79305, "squad/v2": 117979, "story_cloze": 1538, "super_glue/cb": 165, "super_glue/copa": 336, "super_glue/multirc": 24349, "super_glue/record": 90486, "super_glue/rte": 2064, "super_glue/wic": 4783, "super_glue/wsc": 440, "trec": 4679, "trivia_qa": 79623, "true_case": 26581, "unified_qa_science_inst": 560, "winogrande": 36218, "word_segment": 27256, "wmt14_translate/fr-en": 9070285, "wmt16_translate/cs-en": 9066896, "wmt16_translate/de-en": 4124373, "wmt16_translate/fi-en": 1880481, "wmt16_translate/ro-en": 553110, "wmt16_translate/ru-en": 2280872, "wmt16_translate/tr-en": 186016, "yelp_polarity_reviews": 507373 } ```

许可协议:CC BY 4.0 任务类别: - 文本生成 - 文本分类 - 词元(Token)分类 - 问答 - 零样本分类 - 机器翻译 - 文本摘要 语言:英语 样本规模:1000万 < 样本数量 < 1亿 # 任务名称 - **FLAN-2021 共包含70个子数据集** json { "AG新闻子集(ag_news_subset)": 108497, "AI2ARC挑战赛数据集(ai2_arc/ARC-Challenge)": 829, "AI2ARC基础题数据集(ai2_arc/ARC-Easy)": 1927, "AESLC语料库(aeslc)": 13187, "自然语言推理数据集v1(anli/r1)": 15361, "自然语言推理数据集v2(anli/r2)": 41133, "自然语言推理数据集v3(anli/r3)": 91048, "布尔问答数据集(bool_q)": 8343, "CNN/Daily Mail新闻摘要数据集(cnn_dailymail)": 259607, "CoQA对话问答数据集(coqa)": 6456, "Cosmos QA常识问答数据集(cosmos_qa)": 22996, "明确代词消解数据集(definite_pronoun_resolution)": 1079, "DROP阅读理解数据集(drop)": 70045, "标点修正数据集(fix_punct)": 25690, "GEM通用生成数据集(gem/common_gen)": 60936, "GEM Dart数据集(gem/dart)": 56724, "GEM E2E NLG数据集(gem/e2e_nlg)": 30337, "GEM Web NLG英文数据集(gem/web_nlg_en)": 31899, "GEM Wiki Lingua英文数据集(gem/wiki_lingua_english_en)": 89452, "Gigaword语料库(gigaword)": 1853123, "GLUE基准/语言可接受性语料库(glue/cola)": 7594, "GLUE基准/多体裁自然语言推理语料库(glue/mnli)": 711413, "GLUE基准/微软研究释义语料库(glue/mrpc)": 3117, "GLUE基准/问题自然语言推理数据集(glue/qnli)": 94453, "GLUE基准/问题对相似度数据集(glue/qqp)": 329860, "GLUE基准/斯坦福情感树库(glue/sst2)": 61011, "GLUE基准/语义文本相似度基准(glue/stsb)": 5085, "GLUE基准/Winograd Schema自然语言推理数据集(glue/wnli)": 600, "HellaSwag常识推理数据集(hellaswag)": 35941, "XSum摘要数据集(xsum)": 184162, "IMDB影评文本数据集(imdb_reviews/plain_text)": 22725, "Lambada语言建模数据集(lambada)": 4467, "数学数据集/一维线性代数题(math_dataset/algebra__linear_1d)": 1814247, "Multi News多文档摘要数据集(multi_news)": 40646, "开放域自然问答数据集(natural_questions_open)": 79342, "Newsroom新闻摘要数据集(newsroom)": 900966, "OpenBookQA开放书本问答数据集(openbookqa)": 4471, "观点摘要/Idebate辩论数据集(opinion_abstracts/idebate)": 1554, "观点摘要/烂番茄影评数据集(opinion_abstracts/rotten_tomatoes)": 2908, "ParaCrawl英西平行语料库(para_crawl_enes)": 27430, "PAWS-Wiki释义识别数据集(paws_wiki)": 44831, "PIQA物理常识推理数据集(piqa)": 14594, "QuAC对话问答数据集(quac)": 75448, "SAMSum对话摘要数据集(samsum)": 13232, "Sentiment140情感分析数据集(sentiment140)": 1451736, "SNLI斯坦福自然语言推理数据集(snli)": 498328, "SQuAD v1阅读理解数据集(squad/v1)": 79305, "SQuAD v2阅读理解数据集(squad/v2)": 117979, "Story Cloze故事完形填空数据集(story_cloze)": 1538, "SuperGLUE/承诺银行数据集(super_glue/cb)": 165, "SuperGLUE/因果推断数据集(super_glue/copa)": 336, "SuperGLUE/MultiRC多段落阅读理解数据集(super_glue/multirc)": 24349, "SuperGLUE/ReCoRD阅读理解数据集(super_glue/record)": 90486, "SuperGLUE/识别文本蕴涵数据集(super_glue/rte)": 2064, "SuperGLUE/单词义消歧数据集(super_glue/wic)": 4783, "SuperGLUE/Winograd Schema挑战集(super_glue/wsc)": 440, "TREC文本分类数据集(trec)": 4679, "TriviaQA开放域问答数据集(trivia_qa)": 79623, "大小写还原数据集(true_case)": 26581, "Unified QA科学指令数据集(unified_qa_science_inst)": 560, "Winogrande常识推理数据集(winogrande)": 36218, "分词数据集(word_segment)": 27256, "WMT14法英翻译数据集(wmt14_translate/fr-en)": 9070285, "WMT16捷英翻译数据集(wmt16_translate/cs-en)": 9066896, "WMT16德英翻译数据集(wmt16_translate/de-en)": 4124373, "WMT16芬英翻译数据集(wmt16_translate/fi-en)": 1880481, "WMT16罗英翻译数据集(wmt16_translate/ro-en)": 553110, "WMT16俄英翻译数据集(wmt16_translate/ru-en)": 2280872, "WMT16土英翻译数据集(wmt16_translate/tr-en)": 186016, "Yelp极性影评数据集(yelp_polarity_reviews)": 507373 }
提供机构:
aslawliet
原始信息汇总

数据集概述

许可证

  • CC BY 4.0

任务类别

  • 文本生成
  • 文本分类
  • 标记分类
  • 问答
  • 零样本分类
  • 翻译
  • 摘要

语言

  • 英语

数据集大小

  • 10M < n < 100M

数据集详情

  • FLAN-2021 -> 70
    • ag_news_subset: 108497
    • ai2_arc/ARC-Challenge: 829
    • ai2_arc/ARC-Easy: 1927
    • aeslc: 13187
    • anli/r1: 15361
    • anli/r2: 41133
    • anli/r3: 91048
    • bool_q: 8343
    • cnn_dailymail: 259607
    • coqa: 6456
    • cosmos_qa: 22996
    • definite_pronoun_resolution: 1079
    • drop: 70045
    • fix_punct: 25690
    • gem/common_gen: 60936
    • gem/dart: 56724
    • gem/e2e_nlg: 30337
    • gem/web_nlg_en: 31899
    • gem/wiki_lingua_english_en: 89452
    • gigaword: 1853123
    • glue/cola: 7594
    • glue/mnli: 711413
    • glue/mrpc: 3117
    • glue/qnli: 94453
    • glue/qqp: 329860
    • glue/sst2: 61011
    • glue/stsb: 5085
    • glue/wnli: 600
    • hellaswag: 35941
    • xsum: 184162
    • imdb_reviews/plain_text: 22725
    • lambada: 4467
    • math_dataset/algebra__linear_1d: 1814247
    • multi_news: 40646
    • natural_questions_open: 79342
    • newsroom: 900966
    • openbookqa: 4471
    • opinion_abstracts/idebate: 1554
    • opinion_abstracts/rotten_tomatoes: 2908
    • para_crawl_enes: 27430
    • paws_wiki: 44831
    • piqa: 14594
    • quac: 75448
    • samsum: 13232
    • sentiment140: 1451736
    • snli: 498328
    • squad/v1: 79305
    • squad/v2: 117979
    • story_cloze: 1538
    • super_glue/cb: 165
    • super_glue/copa: 336
    • super_glue/multirc: 24349
    • super_glue/record: 90486
    • super_glue/rte: 2064
    • super_glue/wic: 4783
    • super_glue/wsc: 440
    • trec: 4679
    • trivia_qa: 79623
    • true_case: 26581
    • unified_qa_science_inst: 560
    • winogrande: 36218
    • word_segment: 27256
    • wmt14_translate/fr-en: 9070285
    • wmt16_translate/cs-en: 9066896
    • wmt16_translate/de-en: 4124373
    • wmt16_translate/fi-en: 1880481
    • wmt16_translate/ro-en: 553110
    • wmt16_translate/ru-en: 2280872
    • wmt16_translate/tr-en: 186016
    • yelp_polarity_reviews: 507373
搜集汇总
数据集介绍
main_image_url
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作