ZixuanKe/flare_finqa_sup_sample_from_policy_v1.1_stepwise_dpo_chunk_6

Name: ZixuanKe/flare_finqa_sup_sample_from_policy_v1.1_stepwise_dpo_chunk_6
Creator: ZixuanKe
Published: 2024-11-26 04:37:40
License: 暂无描述

Hugging Face2024-11-26 更新2024-12-14 收录

下载链接：

https://hf-mirror.com/datasets/ZixuanKe/flare_finqa_sup_sample_from_policy_v1.1_stepwise_dpo_chunk_6

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集用于训练模型，包含用户提示（prompt）、被拒绝的回答（rejected）、被接受的回答（chosen）以及选择该回答的解释（justification）。数据集仅包含一个训练集，共有1642个样本。

This dataset is used for training models and includes user prompts (prompt), rejected responses (rejected), chosen responses (chosen), and justifications for selecting those responses (justification). The dataset consists of a single training set with 1642 samples.

提供机构：

ZixuanKe

5,000+

优质数据集

54 个

任务类型

进入经典数据集