jihuny/tldr_1.4b_10k_gopt_policy_unnorm_penul_mean_1e-6_iter0
收藏Hugging Face2025-11-03 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/jihuny/tldr_1.4b_10k_gopt_policy_unnorm_penul_mean_1e-6_iter0
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一系列文本相关的特征,如提示文本(prompt)、选择的文本(chosen)和被拒绝的文本(rejected)等,以及它们的相关索引、分数、对数概率和完成原因等。此外,还包括了嵌入向量的差异和最优分数等信息。数据集被划分为训练集,大小约为180MB。
The dataset includes a series of text-related features such as prompt text, chosen text, and rejected text, along with their respective indices, scores, log probabilities, and completion reasons. It also includes information on the difference in embedding vectors and optimal scores. The dataset is split into a training set and is approximately 180MB in size.
提供机构:
jihuny



