Salvor-Hardin/Search-Distill-5K-ReAct

Name: Salvor-Hardin/Search-Distill-5K-ReAct
Creator: Salvor-Hardin
Published: 2026-03-22 08:10:10
License: 暂无描述

Hugging Face2026-03-22 更新2026-03-29 收录

下载链接：

https://hf-mirror.com/datasets/Salvor-Hardin/Search-Distill-5K-ReAct

下载链接

链接失效反馈

官方服务：

资源简介：

--- language: - en license: apache-2.0 size_categories: - 1K-10K task_categories: - text-generation - question-answering tags: - react - search-distillation - reasoning - rag - slm configs: - config_name: default data_files: - split: train path: train.parquet --- # TraceMind-Search-5K **TraceMind-Search-5K** is a high-quality dataset featuring 5,000+ samples of search-augmented reasoning traces. It is specifically designed to train Small Language Models (SLMs) in the **ReAct (Reasoning + Acting)** pattern, focusing on precise information retrieval, noise filtering, and grounded synthesis. ## Dataset Highlights - **Total Samples**: 4,997 entries in Parquet format. - **File**: `train.parquet` (2.7 MB). - **Pattern**: Strict adherence to the ReAct framework: `User` -> `Thought (pre-search)` -> `<search>` -> `<|results|>` -> `Thought (post-search)` -> `Answer`. - **Reasoning Depth**: Each entry includes an internal monologue that justifies the search query and critiques the retrieved results. - **Synthesized Facts**: Contains up-to-date reasoning on synthetic "future" events (2025-2026) to test grounding performance. ## Methodology: Search Distillation The dataset was generated using a "Search Distillation" methodology: 1. **Thought (Pre-search)**: The model identifies knowledge gaps and formulates a search strategy. 2. **Search**: Concise, keyword-based queries are generated. 3. **Synthesis**: The model reviews multiple sources, explicitly dismisses irrelevant "noise," and synthesizes a final answer. 4. **Grounding**: The final answer is strictly grounded in the provided `<|results|>` block. ## Use Cases - **RAG Alignment**: Fine-tune models to better utilize external context and filter out search noise. - **Tool-Use Training**: Train models to generate effective search queries. - **SLM Reasoning**: Improve the "chain-of-thought" capabilities of smaller models through high-quality distillation traces. ## Dataset Structure The data is provided in a single `.jsonl` file with a `text` field containing the full trace: ```json {"text": "User: ... \nAssistant: Thought: ... \n<search>...</search> \n<|results|> ... <|/results|> \nAssistant: Thought: ... \nAnswer: ..."} ``` ## Dataset Generation This dataset consists of synthetic ReAct (Reasoning + Acting) traces generated using the open-weights **`moonshotai/kimi-k2-instruct-0905`** model. The model was specifically prompted to produce high-quality noise-filtered trajectories for training Small Language Models (SLMs). ## License This dataset and its curated structure are released under the **Apache License 2.0**. The source data is derived from Kimi K2, which operates under Moonshot AI's Modified MIT License, allowing for broad open-source and commercial downstream use.

提供机构：

Salvor-Hardin

5,000+

优质数据集

54 个

任务类型

进入经典数据集