jihuny/tldr_1.4b_10k_gopt_policy_unnorm_penul_last_1_iter0

Name: jihuny/tldr_1.4b_10k_gopt_policy_unnorm_penul_last_1_iter0
Creator: jihuny
Published: 2025-11-03 23:12:22
License: 暂无描述

Hugging Face2025-11-03 更新2025-11-15 收录

下载链接：

https://hf-mirror.com/datasets/jihuny/tldr_1.4b_10k_gopt_policy_unnorm_penul_last_1_iter0

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含了多个字段，用于表示提示文本、选择的文本、被拒绝的文本、它们的索引、分数、对数概率、完成原因以及嵌入向量的差异。数据集分为训练集，共有10000个样本，总大小为182638849字节。数据集的具体用途和内容未在README中明确描述。

The dataset contains multiple fields representing prompt text, chosen text, rejected text, their indices, scores, log probabilities, finish reasons, and the difference in embedding vectors. The dataset is split into a training set with 10,000 samples, totaling 182638849 bytes in size. The specific purpose and content of the dataset are not explicitly described in the README.

提供机构：

jihuny