luca0621/multi-RLHF-processed-llama3B-dataset-with-1000-rewards
收藏Hugging Face2024-11-29 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/luca0621/multi-RLHF-processed-llama3B-dataset-with-1000-rewards
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个主要特征:查询(query)、响应(response)和奖励(reward)。查询和响应为字符串类型,奖励为浮点数类型。数据集分为训练集和测试集,训练集包含4000个样本,测试集包含1000个样本。数据集的下载大小为1840456字节,总大小为5738026字节。配置部分指定了数据文件的路径,训练集数据文件路径为data/train-*,测试集数据文件路径为data/test-*。
The dataset contains three main features: query, response, and reward. The query and response are of string type, while the reward is of float64 type. The dataset is divided into a training set and a test set, with the training set containing 4000 samples and the test set containing 1000 samples. The download size of the dataset is 1840456 bytes, and the total size is 5738026 bytes. The configuration section specifies the paths to the data files, with the training set data files located at data/train-* and the test set data files at data/test-*.
提供机构:
luca0621



