ZixuanKe/flare_finqa_sup_sample_from_policy_v1.1_stepwise_dpo_chunk_18
收藏Hugging Face2024-11-26 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/ZixuanKe/flare_finqa_sup_sample_from_policy_v1.1_stepwise_dpo_chunk_18
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个特征:提示(prompt)、被拒绝的响应(rejected)、被选中的响应(chosen)和理由(justification)。数据集仅包含一个训练集(train),共有1642个样本,文件大小为9065695字节。下载大小为1763699字节。该数据集可能用于训练模型以区分被拒绝和被选中的响应,并可能涉及某种形式的理由或解释。
The dataset contains four features: prompt, rejected, chosen, and justification. It includes only a training set (train) with 1642 examples, and the file size is 9065695 bytes. The download size is 1763699 bytes. This dataset is likely used for training models to distinguish between rejected and chosen responses, possibly involving some form of justification or explanation.
提供机构:
ZixuanKe



