five

cornfieldrm/pair-preference-dataset-700K_subset-4-of-4_llama3-8b-it_0.25_sft_conf-0.8_slic

收藏
Hugging Face2024-06-05 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/cornfieldrm/pair-preference-dataset-700K_subset-4-of-4_llama3-8b-it_0.25_sft_conf-0.8_slic
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: messages list: - name: content dtype: string - name: role dtype: string splits: - name: train num_bytes: 237120693.0 num_examples: 70660 download_size: 116641962 dataset_size: 237120693.0 configs: - config_name: default data_files: - split: train path: data/train-* ---

The dataset includes a feature named messages, which is a list containing elements with two fields: content and role, both of which are of string type. The dataset is split into a train set with 70,660 examples and a total byte size of 237,120,693.0. The download size of the dataset is 116,641,962 bytes, while the actual size of the dataset is 237,120,693.0 bytes. The dataset has a default configuration that includes a data file named train with the path data/train-*.
提供机构:
cornfieldrm
原始信息汇总

数据集概述

数据特征

  • messages: 包含以下子特征
    • content: 数据类型为字符串
    • role: 数据类型为字符串

数据分割

  • train:
    • 字节数: 237120693.0
    • 样本数: 70660

数据集大小

  • 下载大小: 116641962
  • 数据集大小: 237120693.0

配置信息

  • default:
    • 数据文件路径: data/train-*
二维码
社区交流群
二维码
科研交流群
商业服务