jihuny/tldr_1.4b_10k_gopt_policy_norm_penul_last_1e-6_iter0

Name: jihuny/tldr_1.4b_10k_gopt_policy_norm_penul_last_1e-6_iter0
Creator: jihuny
Published: 2025-11-03 12:59:54
License: 暂无描述

Hugging Face2025-11-03 更新2025-11-15 收录

下载链接：

https://hf-mirror.com/datasets/jihuny/tldr_1.4b_10k_gopt_policy_norm_penul_last_1e-6_iter0

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是一个文本处理相关的数据集，包含文本提示(prompt)、选择的文本(chosen)、被拒绝的文本(rejected)等信息，以及这些文本的索引、评分、对数概率和完成原因等特征。数据集分为训练集(train)，共有10000个示例，数据集总大小为182,617,752字节。

This dataset is related to text processing, containing features such as text prompts (prompt), chosen text (chosen), rejected text (rejected), and other information including indices, scores, log probabilities, and finish reasons for these texts. The dataset is split into a training set (train) with a total of 10,000 examples, and the overall size of the dataset is 182,617,752 bytes.

提供机构：

jihuny