youjunhyeok/llama3_train
收藏Hugging Face2024-06-17 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/youjunhyeok/llama3_train
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,包括对话、来源和文本。对话特征是一个列表,包含两个子特征:from和value,均为字符串类型。数据集包含一个训练集,大小为2468110466字节,包含1122566个示例。数据集的下载大小为1166783245字节,总大小为2468110466字节。数据集的配置文件指定了默认配置,数据文件路径为data/train-*。
The dataset contains multiple features, including conversations, source, and text. The conversations feature is a list containing two sub-features: from and value, both of which are of string type. The dataset includes a training set with a size of 2468110466 bytes, containing 1122566 examples. The download size of the dataset is 1166783245 bytes, and the total size is 2468110466 bytes. The datasets configuration file specifies the default configuration, with data file paths as data/train-*.
提供机构:
youjunhyeok
原始信息汇总
数据集概述
数据集特征
- conversations
- from: 数据类型为字符串
- value: 数据类型为字符串
- src: 数据类型为字符串
- text: 数据类型为字符串
数据集分割
- train
- num_bytes: 2468110466 字节
- num_examples: 1122566 条记录
数据集大小
- download_size: 1166783245 字节
- dataset_size: 2468110466 字节
配置
- config_name: default
- data_files
- split: train
- path: data/train-*
- split: train
- data_files



