InsultedByMathematics/all-online_alpha_1e-4_beta_3e-3
收藏Hugging Face2025-02-04 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/InsultedByMathematics/all-online_alpha_1e-4_beta_3e-3
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一系列提示(prompt)和对应的多个响应(response),以及与这些响应相关的对数概率和奖励值。此外,还包含了关于响应的令牌数量和模型来源的信息。数据集适用于训练自然语言处理模型,尤其是那些涉及文本生成和对话系统的模型。数据集分为训练集,大小约为718MB。
The dataset consists of a series of prompts and their corresponding multiple responses, along with log probabilities and reward values associated with these responses. It also includes information about the number of tokens in the responses and the source of the model. The dataset is suitable for training natural language processing models, particularly those involving text generation and conversational systems. The dataset is split into a training set, which is approximately 718MB in size.
提供机构:
InsultedByMathematics



