five

Salvor-Hardin/Search-Distill-5K-ReAct

收藏
Hugging Face2026-03-22 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Salvor-Hardin/Search-Distill-5K-ReAct
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en license: apache-2.0 size_categories: - 1K-10K task_categories: - text-generation - question-answering tags: - react - search-distillation - reasoning - rag - slm configs: - config_name: default data_files: - split: train path: train.parquet --- # TraceMind-Search-5K **TraceMind-Search-5K** is a high-quality dataset featuring 5,000+ samples of search-augmented reasoning traces. It is specifically designed to train Small Language Models (SLMs) in the **ReAct (Reasoning + Acting)** pattern, focusing on precise information retrieval, noise filtering, and grounded synthesis. ## Dataset Highlights - **Total Samples**: 4,997 entries in Parquet format. - **File**: `train.parquet` (2.7 MB). - **Pattern**: Strict adherence to the ReAct framework: `User` -> `Thought (pre-search)` -> `<search>` -> `<|results|>` -> `Thought (post-search)` -> `Answer`. - **Reasoning Depth**: Each entry includes an internal monologue that justifies the search query and critiques the retrieved results. - **Synthesized Facts**: Contains up-to-date reasoning on synthetic "future" events (2025-2026) to test grounding performance. ## Methodology: Search Distillation The dataset was generated using a "Search Distillation" methodology: 1. **Thought (Pre-search)**: The model identifies knowledge gaps and formulates a search strategy. 2. **Search**: Concise, keyword-based queries are generated. 3. **Synthesis**: The model reviews multiple sources, explicitly dismisses irrelevant "noise," and synthesizes a final answer. 4. **Grounding**: The final answer is strictly grounded in the provided `<|results|>` block. ## Use Cases - **RAG Alignment**: Fine-tune models to better utilize external context and filter out search noise. - **Tool-Use Training**: Train models to generate effective search queries. - **SLM Reasoning**: Improve the "chain-of-thought" capabilities of smaller models through high-quality distillation traces. ## Dataset Structure The data is provided in a single `.jsonl` file with a `text` field containing the full trace: ```json {"text": "User: ... \nAssistant: Thought: ... \n<search>...</search> \n<|results|> ... <|/results|> \nAssistant: Thought: ... \nAnswer: ..."} ``` ## Dataset Generation This dataset consists of synthetic ReAct (Reasoning + Acting) traces generated using the open-weights **`moonshotai/kimi-k2-instruct-0905`** model. The model was specifically prompted to produce high-quality noise-filtered trajectories for training Small Language Models (SLMs). ## License This dataset and its curated structure are released under the **Apache License 2.0**. The source data is derived from Kimi K2, which operates under Moonshot AI's Modified MIT License, allowing for broad open-source and commercial downstream use.
提供机构:
Salvor-Hardin
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作