five

DeL-TaiseiOzaki/Tengentoppa-sft-base-v1.0

收藏
Hugging Face2024-11-27 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/DeL-TaiseiOzaki/Tengentoppa-sft-base-v1.0
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是一个由12个日语instruction-following数据集整合而成的监督学习用数据集。它包含对话形式的数据、问答、推理任务等多种数据来源,主要用于模型的微调。数据格式为JSON,每个数据点包含instruction、input和output三个字段。

This dataset is a supervised learning dataset created by integrating 12 Japanese instruction-following datasets. It includes various data sources such as conversational data, question-answering, and reasoning tasks. The dataset is in JSON format, with each data point structured to include instruction, input, and output. The dataset contains multiple sub-datasets, each with its specific source and characteristics. When using the dataset, it is important to check the licenses of each source dataset, be aware of data quality, potential masking, and the possibility of context loss in conversational data.
提供机构:
DeL-TaiseiOzaki
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作