five

yenshanchen/DPO-joke-preference-dataset

收藏
Hugging Face2025-12-08 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/yenshanchen/DPO-joke-preference-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
# Joke Preference Dataset This dataset contains jokes with like counts and reasoning annotations, prepared for **preference learning (DPO)**. It provides **train** and **validation** splits. ## Columns - `joke` : the raw joke text - `like_count` : the number of likes - `post_id` : the original post identifier - `reasoning` : full reasoning / analysis - `stripped_reasoning` : reasoning cleaned of any like counts or metadata When loaded as preference pairs, each example contains: - `chosen` : text of the joke chosen as higher preference - `rejected` : text of the lower-preference joke - `score_chosen` : like count of chosen joke - `score_rejected` : like count of rejected joke - `source_post_id` : post id of the group ## Modes - `raw` : chosen/rejected = joke only - `reasoning` : chosen/rejected = `Joke:\n{joke}\n\nReasoning:\n{reasoning}` - `reasoning_stripped` : chosen/rejected = `Joke:\n{joke}\n\nReasoning:\n{stripped_reasoning}` ## Example Usage ```python from datasets import load_dataset # Load HF Dataset splits train_ds = load_dataset("yenshanchen/DPO-joke-preference-dataset", split="train") val_ds = load_dataset("yenshanchen/DPO-joke-preference-dataset", split="validation") # Import your dataloader from dataloader import JokePairsIterable # Build preference pair datasets train_pairs = JokePairsIterable(train_ds, mode="reasoning_stripped") val_pairs = JokePairsIterable(val_ds, mode="reasoning_stripped") print(f"Number of training pairs: {len(train_pairs)}") print(f"Number of validation pairs: {len(val_pairs)}") # Iterate over a few examples for i, pair in enumerate(train_pairs): print(pair) if i >= 2: break ``` --- license: apache-2.0 ---
提供机构:
yenshanchen
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作