ZixuanKe/fingpt_convfinqa_sup_sample_from_policy_v1.1_stepwise_dpo_binarized
收藏Hugging Face2024-11-25 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/ZixuanKe/fingpt_convfinqa_sup_sample_from_policy_v1.1_stepwise_dpo_binarized
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如提示、被拒绝的响应、被选中的响应、理由以及与llama3模型相关的长度信息。数据集分为训练集和验证集,训练集包含28286个样本,验证集包含1362个样本。数据集的下载大小为29559377字节,总大小为181271301字节。
The dataset includes multiple features such as prompt, rejected response, chosen response, justification, and length information related to the llama3 model. The dataset is divided into a training set and a validation set, with 28286 and 1362 samples respectively. The download size of the dataset is 29559377 bytes, and the total size is 181271301 bytes.
提供机构:
ZixuanKe



