jaeyong2/ja-persona-cot-inst

Name: jaeyong2/ja-persona-cot-inst
Creator: jaeyong2
Published: 2024-10-26 03:20:07
License: 暂无描述

Hugging Face2024-10-26 更新2024-12-14 收录

下载链接：

https://hf-mirror.com/datasets/jaeyong2/ja-persona-cot-inst

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含两个特征：content和text，数据类型均为字符串。数据集分为一个训练集，包含645,000个样本，总大小为2,480,381,909字节。数据集的语言为日语，许可证为cc-by-nc-sa-4.0。使用方式是通过HuggingFace的`load_dataset`函数加载数据集。开发过程包括从另一个数据集加载问题，并使用Qwen/Qwen2-72B-Instruct模型生成带有COT的答案。研究得到了TPU Research Cloud program的支持。

This dataset includes two features: content and text, both of which are of string type. The dataset is divided into a training set containing 645,000 samples, with a total size of 2,480,381,909 bytes. The dataset is in Japanese and uses the CC-BY-NC-SA-4.0 license. The development process of the dataset involves loading the question dataset from jaeyong2/persona-inst and using the Qwen/Qwen2-72B-Instruct model to generate answers with COT (Chain of Thought).

提供机构：

jaeyong2

5,000+

优质数据集

54 个

任务类型

进入经典数据集