THU-KEG/WildFB

Name: THU-KEG/WildFB
Creator: THU-KEG
Published: 2026-02-26 02:20:49
License: 暂无描述

Hugging Face2026-02-26 更新2026-03-29 收录

下载链接：

https://hf-mirror.com/datasets/THU-KEG/WildFB

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: mit task_categories: - text-classification - reinforcement-learning language: - en - zh multilinguality: - multilingual size_categories: - 100K<n<1M source_datasets: - WildChat-4.8M pretty_name: WildFB dataset_info: features: - name: id dtype: string - name: history dtype: string - name: text dtype: string - name: messages dtype: list - name: user_feedback dtype: string - name: label dtype: int splits: train: num_examples: ~181000 test: num_examples: 5000 --- # WildFB Dataset **WildFB** (Wild Feedback) is a high-quality dataset of **186k instances** filtered and refined from [WildChat-4.8M](https://huggingface.co/datasets/allenai/WildChat-4.8M). Each instance is labeled with a **4-level ordinal satisfaction score** extracted from in-the-wild human-LLM interactions. ## Dataset Details WildFB addresses the challenge of training reward models without expensive human-annotated preference pairs. Instead, it extracts **implicit reward signals** from user follow-up queries in real-world conversations. ### Label Distribution The dataset uses a 4-point ordinal scale based on user satisfaction: | Label | Level | Description | |-------|-------|-------------| | 1 | CLEARLY NEGATIVE | User expresses rejection, strong dissatisfaction, or abandonment | | 2 | CORRECTION | User provides error corrections or points out mistakes | | 3 | POSITIVE ENGAGEMENT | User continues conversation with positive engagement | | 4 | CLEAR SATISFACTION | User expresses thanks, praise, or clear satisfaction | ### Dataset Statistics - **Total Instances:** 186,000+ - **Train Split:** ~181,000 - **Test Split:** 5,000 - **Source:** WildChat-4.8M (filtered and refined) - **Languages:** Primarily English, with multilingual support ## Data Generation Pipeline WildFB is constructed through an **automated 8-step pipeline**: 1. **Preprocessing** - Convert WildChat parquet files to JSONL format 2. **Prompt Generation** - Generate preference classification prompts 3. **Response Generation** - Generate classification responses using LLM API 4. **Filtering & Parsing** - Extract and validate user feedback labels 5. **Conversation Merging** - Reconstruct full conversation contexts 6. **Hindsight Mining** - Recover hidden positive signals from neutral-looking contexts 7. **Refusal Validation** - Filter noise where users penalize correct safety refusals 8. **Train/Test Split** - Create 5000-sample test set ### Key Features - **Implicit Feedback Mining** - Recovers positive signals from contexts that appear neutral but indicate satisfaction - **Refusal Validation** - Removes noise where users unjustifiably penalize correct safety refusals by the model - **Topic-Aware Filtering** - Ensures diverse coverage across different conversation topics ## Use Cases WildFB is primarily designed for: 1. **Reward Model Training** - Train ordinal regression models via CORAL-like approach 2. **Quality Assessment** - Benchmark for conversation quality evaluation ## Dataset Structure ```json { "id": "uuid", "history": [ {"role": "user", "content": "..."}, {"role": "assistant", "content": "..."}, ... ], "text": "Full conversation text...", "messages": [ {"role": "user", "content": "..."}, {"role": "assistant", "content": "..."} ], "user_feedback": "thank you!", "label": 4 } ``` ## Usage Example ```python from datasets import load_dataset # Load the dataset dataset = load_dataset("THU-KEG/WildFB") # Access training data train_data = dataset["train"] # Example instance instance = train_data[0] print(f"Label: {instance['label']} (1-4)") print(f"User Feedback: {instance['user_feedback']}") print(f"Messages: {instance['messages']}") ``` ## Source Data WildFB is adapted from the [WildChat-4.8M](https://huggingface.co/datasets/allenai/WildChat-4.8M) dataset, which contains millions of real-world human-LLM conversations collected from the WildChat platform. ## Data Collection & Processing For detailed information on the data collection pipeline and filtering methodology, please refer to: 📚 **[WildReward GitHub Repository](https://github.com/THU-KEG/WildReward)** The repository contains: - Complete pipeline implementation (`collect_rm_data/`) - Detailed documentation for each processing step - Quality control and filtering strategies ## License This dataset is released under the **MIT License**. The original WildChat dataset may have its own license terms that users should comply with. ## Citation ```bibtex @misc{peng2026wildrewardlearningrewardmodels, title={WildReward: Learning Reward Models from In-the-Wild Human Interactions}, author={Hao Peng and Yunjia Qi and Xiaozhi Wang and Zijun Yao and Lei Hou and Juanzi Li}, year={2026}, eprint={2602.08829}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2602.08829}, } ``` ## Acknowledgments - WildChat dataset for providing the raw conversation data - The WildReward project for the data processing pipeline --- **Note:** This is a filtered and processed version of WildChat-4.8M. Please refer to the [WildReward GitHub repository](https://github.com/THU-KEG/WildReward) for complete pipeline details and methodology.

提供机构：

THU-KEG

5,000+

优质数据集

54 个

任务类型

进入经典数据集