ZixuanKe/fingpt_convfinqa_sup_sample_from_policy_v1.1_stepwise_dpo_binarized

Name: ZixuanKe/fingpt_convfinqa_sup_sample_from_policy_v1.1_stepwise_dpo_binarized
Creator: ZixuanKe
Published: 2024-11-25 20:01:03
License: 暂无描述

Hugging Face2024-11-25 更新2024-12-14 收录

下载链接：

https://hf-mirror.com/datasets/ZixuanKe/fingpt_convfinqa_sup_sample_from_policy_v1.1_stepwise_dpo_binarized

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含多个特征，如提示、被拒绝的响应、被选中的响应、理由以及与llama3模型相关的长度信息。数据集分为训练集和验证集，训练集包含28286个样本，验证集包含1362个样本。数据集的下载大小为29559377字节，总大小为181271301字节。

The dataset includes multiple features such as prompt, rejected response, chosen response, justification, and length information related to the llama3 model. The dataset is divided into a training set and a validation set, with 28286 and 1362 samples respectively. The download size of the dataset is 29559377 bytes, and the total size is 181271301 bytes.

提供机构：

ZixuanKe

5,000+

优质数据集

54 个

任务类型

进入经典数据集