five

jihuny/tldr_1.4b_10k_gopt_policy_norm_penul_last_1e-6_iter0

收藏
Hugging Face2025-11-03 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/jihuny/tldr_1.4b_10k_gopt_policy_norm_penul_last_1e-6_iter0
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是一个文本处理相关的数据集,包含文本提示(prompt)、选择的文本(chosen)、被拒绝的文本(rejected)等信息,以及这些文本的索引、评分、对数概率和完成原因等特征。数据集分为训练集(train),共有10000个示例,数据集总大小为182,617,752字节。

This dataset is related to text processing, containing features such as text prompts (prompt), chosen text (chosen), rejected text (rejected), and other information including indices, scores, log probabilities, and finish reasons for these texts. The dataset is split into a training set (train) with a total of 10,000 examples, and the overall size of the dataset is 182,617,752 bytes.
提供机构:
jihuny
二维码
社区交流群
二维码
科研交流群
商业服务