zjhhhh/whole_sw_maxlen_8192_rescale_mean_beta_50.0_multi_expand_tokenized
收藏Hugging Face2025-10-01 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/zjhhhh/whole_sw_maxlen_8192_rescale_mean_beta_50.0_multi_expand_tokenized
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一系列文本和数值字段,主要用于记录某种文本交互过程中的用户输入、系统响应、以及一些统计指标。具体包括提示文本、需求、不同选择的响应文本、基础响应文本、当前响应文本、各种统计均值和众数值、选择的文本和相应的奖励、拒绝的文本和相应的奖励等。数据集分为训练集和测试集,可用于模型训练和评估。
The dataset consists of a series of text and numeric fields primarily used to record user inputs, system responses, and some statistical metrics in a text interaction process. It includes prompt text, requirements, response texts for different selections, base response texts, current response texts, various statistical mean and majority values, chosen texts and corresponding rewards, rejected texts and corresponding rewards, etc. The dataset is split into training and test sets for model training and evaluation.
提供机构:
zjhhhh



