zhengbang0707/REFUEL-Ultrainteract-Llama-3-Armo-iter_2_TWise
收藏Hugging Face2025-04-08 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/zhengbang0707/REFUEL-Ultrainteract-Llama-3-Armo-iter_2_TWise
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了两个部分:选中的部分(chosen)和拒绝的部分(reject),每个部分都包含了内容(content)和角色(role)信息。此外,还有相关的序列化整数标记和奖励值。数据集分为训练集和测试集,提供了对应的文件路径配置。
The dataset consists of two parts: the chosen part and the reject part, each containing content and role information. In addition, there are related serialized integer tokens and reward values. The dataset is divided into training and test sets, with corresponding file path configurations provided.
提供机构:
zhengbang0707



