ZixuanKe/fingpt_convfinqa_sup_sample_from_policy_v1.1_stepwise_dpo_chunk_5
收藏Hugging Face2024-11-23 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/ZixuanKe/fingpt_convfinqa_sup_sample_from_policy_v1.1_stepwise_dpo_chunk_5
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含1414个训练样本,每个样本包含四个字符串类型的特征:prompt(提示)、rejected(被拒绝的响应)、chosen(被选中的响应)和justification(理由)。数据集总大小为8778104字节,下载大小为1448219字节。数据仅包含一个训练集分割。
The dataset contains 1414 training samples, each with four string-type features: prompt, rejected, chosen, and justification. The total size of the dataset is 8778104 bytes, with a download size of 1448219 bytes. The data includes only a training split.
提供机构:
ZixuanKe



