five

jihuny/tldr_1.4b_10k_gopt_policy_unnorm_penul_mean_1_iter0

收藏
Hugging Face2025-11-03 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/jihuny/tldr_1.4b_10k_gopt_policy_unnorm_penul_mean_1_iter0
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了多个字段,如提示文本(prompt)、索引(prompt_idx)、选中的文本(chosen)、被拒绝的文本(rejected)等。每个字段都有其特定的数据类型。数据集被划分为训练集,包含了一定数量的示例。此外,还包含了文本的分数、对数概率、完成原因等信息。数据集还包含了嵌入向量的差异等信息,以及一些评分和排名数据。

The dataset includes multiple fields such as prompt text (prompt), index (prompt_idx), chosen text (chosen), rejected text (rejected), etc., each with its specific data type. The dataset is split into a training set containing a certain number of examples. In addition, it includes information such as scores, log probabilities, completion reasons for the texts, as well as differences in embedding vectors, and some scoring and ranking data.
提供机构:
jihuny
二维码
社区交流群
二维码
科研交流群
商业服务