zhengbang0707/REFUEL-Ultrainteract-Llama-3-Armo-iter_2_TWise_30k
收藏Hugging Face2025-04-09 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/zhengbang0707/REFUEL-Ultrainteract-Llama-3-Armo-iter_2_TWise_30k
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文本内容、角色、序列标记、掩码序列和奖励值的数据集,分为训练集和测试集。每个样本包含被选中的文本和被拒绝的文本,以及相应的角色、标记序列、掩码序列和奖励值。
This dataset includes text content, roles, sequence tokens, mask sequences, and reward values, split into training and testing sets. Each sample contains chosen and rejected text, along with corresponding roles, token sequences, mask sequences, and reward values.
提供机构:
zhengbang0707



