NemoSheng/codefuse_luban_sharegpt
收藏Hugging Face2024-07-18 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/NemoSheng/codefuse_luban_sharegpt
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含对话和工具信息,主要用于训练和测试模型。对话部分由from和value两个字段组成,均为字符串类型。工具部分也是字符串类型。数据集分为训练集和测试集,训练集包含10214个样本,测试集包含259个样本。数据集的下载大小为38910497字节,总大小为57538624字节。
This dataset contains conversations and tools information, primarily used for training and testing models. The conversation part consists of two fields: from and value, both of which are string types. The tools part is also of string type. The dataset is divided into a training set and a test set, with the training set containing 10214 samples and the test set containing 259 samples. The download size of the dataset is 38910497 bytes, and the total size is 57538624 bytes.
提供机构:
NemoSheng
原始信息汇总
数据集概述
数据集信息
特征
- conversations:
- from: 字符串类型
- value: 字符串类型
- tools: 字符串类型
数据分割
- train:
- 字节数: 56056181.0
- 样本数: 10214
- test:
- 字节数: 1482443.0
- 样本数: 259
数据大小
- 下载大小: 38910497
- 数据集大小: 57538624.0
配置
- config_name: default
- data_files:
- train: data/train-*
- test: data/test-*
- data_files:



