aslawliet/flan2021-full
收藏Hugging Face2024-04-19 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/aslawliet/flan2021-full
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-4.0
task_categories:
- text-generation
- text-classification
- token-classification
- question-answering
- zero-shot-classification
- translation
- summarization
language:
- en
size_categories:
- 10M<n<100M
---
# Task Name
- **FLAN-2021 -> 70**
```json
{
"ag_news_subset": 108497,
"ai2_arc/ARC-Challenge": 829,
"ai2_arc/ARC-Easy": 1927,
"aeslc": 13187,
"anli/r1": 15361,
"anli/r2": 41133,
"anli/r3": 91048,
"bool_q": 8343,
"cnn_dailymail": 259607,
"coqa": 6456,
"cosmos_qa": 22996,
"definite_pronoun_resolution": 1079,
"drop": 70045,
"fix_punct": 25690,
"gem/common_gen": 60936,
"gem/dart": 56724,
"gem/e2e_nlg": 30337,
"gem/web_nlg_en": 31899,
"gem/wiki_lingua_english_en": 89452,
"gigaword": 1853123,
"glue/cola": 7594,
"glue/mnli": 711413,
"glue/mrpc": 3117,
"glue/qnli": 94453,
"glue/qqp": 329860,
"glue/sst2": 61011,
"glue/stsb": 5085,
"glue/wnli": 600,
"hellaswag": 35941,
"xsum": 184162,
"imdb_reviews/plain_text": 22725,
"lambada": 4467,
"math_dataset/algebra__linear_1d": 1814247,
"multi_news": 40646,
"natural_questions_open": 79342,
"newsroom": 900966,
"openbookqa": 4471,
"opinion_abstracts/idebate": 1554,
"opinion_abstracts/rotten_tomatoes": 2908,
"para_crawl_enes": 27430,
"paws_wiki": 44831,
"piqa": 14594,
"quac": 75448,
"samsum": 13232,
"sentiment140": 1451736,
"snli": 498328,
"squad/v1": 79305,
"squad/v2": 117979,
"story_cloze": 1538,
"super_glue/cb": 165,
"super_glue/copa": 336,
"super_glue/multirc": 24349,
"super_glue/record": 90486,
"super_glue/rte": 2064,
"super_glue/wic": 4783,
"super_glue/wsc": 440,
"trec": 4679,
"trivia_qa": 79623,
"true_case": 26581,
"unified_qa_science_inst": 560,
"winogrande": 36218,
"word_segment": 27256,
"wmt14_translate/fr-en": 9070285,
"wmt16_translate/cs-en": 9066896,
"wmt16_translate/de-en": 4124373,
"wmt16_translate/fi-en": 1880481,
"wmt16_translate/ro-en": 553110,
"wmt16_translate/ru-en": 2280872,
"wmt16_translate/tr-en": 186016,
"yelp_polarity_reviews": 507373
}
```
许可协议:CC BY 4.0
任务类别:
- 文本生成
- 文本分类
- 词元(Token)分类
- 问答
- 零样本分类
- 机器翻译
- 文本摘要
语言:英语
样本规模:1000万 < 样本数量 < 1亿
# 任务名称
- **FLAN-2021 共包含70个子数据集**
json
{
"AG新闻子集(ag_news_subset)": 108497,
"AI2ARC挑战赛数据集(ai2_arc/ARC-Challenge)": 829,
"AI2ARC基础题数据集(ai2_arc/ARC-Easy)": 1927,
"AESLC语料库(aeslc)": 13187,
"自然语言推理数据集v1(anli/r1)": 15361,
"自然语言推理数据集v2(anli/r2)": 41133,
"自然语言推理数据集v3(anli/r3)": 91048,
"布尔问答数据集(bool_q)": 8343,
"CNN/Daily Mail新闻摘要数据集(cnn_dailymail)": 259607,
"CoQA对话问答数据集(coqa)": 6456,
"Cosmos QA常识问答数据集(cosmos_qa)": 22996,
"明确代词消解数据集(definite_pronoun_resolution)": 1079,
"DROP阅读理解数据集(drop)": 70045,
"标点修正数据集(fix_punct)": 25690,
"GEM通用生成数据集(gem/common_gen)": 60936,
"GEM Dart数据集(gem/dart)": 56724,
"GEM E2E NLG数据集(gem/e2e_nlg)": 30337,
"GEM Web NLG英文数据集(gem/web_nlg_en)": 31899,
"GEM Wiki Lingua英文数据集(gem/wiki_lingua_english_en)": 89452,
"Gigaword语料库(gigaword)": 1853123,
"GLUE基准/语言可接受性语料库(glue/cola)": 7594,
"GLUE基准/多体裁自然语言推理语料库(glue/mnli)": 711413,
"GLUE基准/微软研究释义语料库(glue/mrpc)": 3117,
"GLUE基准/问题自然语言推理数据集(glue/qnli)": 94453,
"GLUE基准/问题对相似度数据集(glue/qqp)": 329860,
"GLUE基准/斯坦福情感树库(glue/sst2)": 61011,
"GLUE基准/语义文本相似度基准(glue/stsb)": 5085,
"GLUE基准/Winograd Schema自然语言推理数据集(glue/wnli)": 600,
"HellaSwag常识推理数据集(hellaswag)": 35941,
"XSum摘要数据集(xsum)": 184162,
"IMDB影评文本数据集(imdb_reviews/plain_text)": 22725,
"Lambada语言建模数据集(lambada)": 4467,
"数学数据集/一维线性代数题(math_dataset/algebra__linear_1d)": 1814247,
"Multi News多文档摘要数据集(multi_news)": 40646,
"开放域自然问答数据集(natural_questions_open)": 79342,
"Newsroom新闻摘要数据集(newsroom)": 900966,
"OpenBookQA开放书本问答数据集(openbookqa)": 4471,
"观点摘要/Idebate辩论数据集(opinion_abstracts/idebate)": 1554,
"观点摘要/烂番茄影评数据集(opinion_abstracts/rotten_tomatoes)": 2908,
"ParaCrawl英西平行语料库(para_crawl_enes)": 27430,
"PAWS-Wiki释义识别数据集(paws_wiki)": 44831,
"PIQA物理常识推理数据集(piqa)": 14594,
"QuAC对话问答数据集(quac)": 75448,
"SAMSum对话摘要数据集(samsum)": 13232,
"Sentiment140情感分析数据集(sentiment140)": 1451736,
"SNLI斯坦福自然语言推理数据集(snli)": 498328,
"SQuAD v1阅读理解数据集(squad/v1)": 79305,
"SQuAD v2阅读理解数据集(squad/v2)": 117979,
"Story Cloze故事完形填空数据集(story_cloze)": 1538,
"SuperGLUE/承诺银行数据集(super_glue/cb)": 165,
"SuperGLUE/因果推断数据集(super_glue/copa)": 336,
"SuperGLUE/MultiRC多段落阅读理解数据集(super_glue/multirc)": 24349,
"SuperGLUE/ReCoRD阅读理解数据集(super_glue/record)": 90486,
"SuperGLUE/识别文本蕴涵数据集(super_glue/rte)": 2064,
"SuperGLUE/单词义消歧数据集(super_glue/wic)": 4783,
"SuperGLUE/Winograd Schema挑战集(super_glue/wsc)": 440,
"TREC文本分类数据集(trec)": 4679,
"TriviaQA开放域问答数据集(trivia_qa)": 79623,
"大小写还原数据集(true_case)": 26581,
"Unified QA科学指令数据集(unified_qa_science_inst)": 560,
"Winogrande常识推理数据集(winogrande)": 36218,
"分词数据集(word_segment)": 27256,
"WMT14法英翻译数据集(wmt14_translate/fr-en)": 9070285,
"WMT16捷英翻译数据集(wmt16_translate/cs-en)": 9066896,
"WMT16德英翻译数据集(wmt16_translate/de-en)": 4124373,
"WMT16芬英翻译数据集(wmt16_translate/fi-en)": 1880481,
"WMT16罗英翻译数据集(wmt16_translate/ro-en)": 553110,
"WMT16俄英翻译数据集(wmt16_translate/ru-en)": 2280872,
"WMT16土英翻译数据集(wmt16_translate/tr-en)": 186016,
"Yelp极性影评数据集(yelp_polarity_reviews)": 507373
}
提供机构:
aslawliet
原始信息汇总
数据集概述
许可证
- CC BY 4.0
任务类别
- 文本生成
- 文本分类
- 标记分类
- 问答
- 零样本分类
- 翻译
- 摘要
语言
- 英语
数据集大小
- 10M < n < 100M
数据集详情
- FLAN-2021 -> 70
ag_news_subset: 108497ai2_arc/ARC-Challenge: 829ai2_arc/ARC-Easy: 1927aeslc: 13187anli/r1: 15361anli/r2: 41133anli/r3: 91048bool_q: 8343cnn_dailymail: 259607coqa: 6456cosmos_qa: 22996definite_pronoun_resolution: 1079drop: 70045fix_punct: 25690gem/common_gen: 60936gem/dart: 56724gem/e2e_nlg: 30337gem/web_nlg_en: 31899gem/wiki_lingua_english_en: 89452gigaword: 1853123glue/cola: 7594glue/mnli: 711413glue/mrpc: 3117glue/qnli: 94453glue/qqp: 329860glue/sst2: 61011glue/stsb: 5085glue/wnli: 600hellaswag: 35941xsum: 184162imdb_reviews/plain_text: 22725lambada: 4467math_dataset/algebra__linear_1d: 1814247multi_news: 40646natural_questions_open: 79342newsroom: 900966openbookqa: 4471opinion_abstracts/idebate: 1554opinion_abstracts/rotten_tomatoes: 2908para_crawl_enes: 27430paws_wiki: 44831piqa: 14594quac: 75448samsum: 13232sentiment140: 1451736snli: 498328squad/v1: 79305squad/v2: 117979story_cloze: 1538super_glue/cb: 165super_glue/copa: 336super_glue/multirc: 24349super_glue/record: 90486super_glue/rte: 2064super_glue/wic: 4783super_glue/wsc: 440trec: 4679trivia_qa: 79623true_case: 26581unified_qa_science_inst: 560winogrande: 36218word_segment: 27256wmt14_translate/fr-en: 9070285wmt16_translate/cs-en: 9066896wmt16_translate/de-en: 4124373wmt16_translate/fi-en: 1880481wmt16_translate/ro-en: 553110wmt16_translate/ru-en: 2280872wmt16_translate/tr-en: 186016yelp_polarity_reviews: 507373
搜集汇总
数据集介绍

以上内容由遇见数据集搜集并总结生成



