five

Chain-of-Thought collection

收藏
www.kaggle.com2023-06-19 更新2025-03-23 收录
下载链接:
https://www.kaggle.com/konradb/chain-of-thought-collection
下载链接
链接失效反馈
官方服务:
资源简介:
Dataset accompanying the paper "The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning", including 1.88M CoT rationales extracted across 1,060 tasks" - https://arxiv.org/abs/2305.14045 **From the release repo** https://github.com/kaistAI/CoT-Collection: Large Language Models (LLMs) have shown enhanced capabilities of solving novel tasks by reasoning step-by-step known as Chain-of-Thought (CoT) reasoning; how can we instill the same capability of reasoning step-by-step on unseen tasks into LMs that possess less than <100B parameters? To address this question, we first introduce the CoT Collection, a new instruction-tuning dataset that augments 1.88 million CoT rationales across 1,060 tasks. We show that continually fine-tuning Flan-T5 (3B & 11B) with the CoT Collection enables the 3B & 11B LMs to perform CoT better on unseen tasks, leading to an improvement in the average zero-shot accuracy on 27 datasets of the BIG-Bench-Hard benchmark by +4.34% and +2.44%, respectively. Furthermore, we show that instruction tuning with CoT allows LMs to possess stronger few-shot learning capabilities, resulting in an improvement of +2.97% and +2.37% on 4 domain-specific tasks over Flan-T5 (3B & 11B), respectively.

随论文《The CoT Collection: 通过思维链微调提升大语言模型的零样本和少样本学习》附带的语料库,包含跨越1,060项任务的188万条思维链(CoT)解释。该语料库旨在通过增强Flan-T5(3B & 11B参数规模)模型,赋予其逐步推理的能力,以解决参数规模低于100亿的未见过的新任务。为此,我们首先提出了CoT Collection,这是一个新的指令微调数据集,它丰富了1.88百万条思维链解释。研究表明,通过持续微调Flan-T5模型,能够显著提升其在未见任务上的思维链推理能力,进而使得3B和11B参数规模的模型在BIG-Bench-Hard基准测试的27个数据集上的零样本平均准确率分别提升了4.34%和2.44%。此外,我们还发现,结合思维链的指令微调能够增强大语言模型的少样本学习能力,相对于Flan-T5(3B & 11B),在4个特定领域的任务上分别提升了2.97%和2.37%。
提供机构:
www.kaggle.com
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作