Humor Reward Model
收藏arXiv2025-09-30 收录
下载链接:
https://huggingface.co/mohameddhiab/humor-no-humor
下载链接
链接失效反馈官方服务:
资源简介:
该数据集旨在评估生成文本中的幽默程度,并作为一个奖励模型在强化学习中使用。该模型在优化幽默性作为强化学习目标之一的背景下被应用,其任务是对强化学习中的奖励进行建模。
This dataset is designed to evaluate the humor level of generated text and serve as a reward model for reinforcement learning. This reward model is applied in the context where humor is optimized as one of the reinforcement learning objectives, and its task is to model the rewards in reinforcement learning.



