five

timchen0618/browsecomp-plus-selected-tools-analysis-v1

收藏
Hugging Face2026-04-09 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/timchen0618/browsecomp-plus-selected-tools-analysis-v1
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit tags: - browsecomp-plus - search-agent - selected-tool-calls --- # BrowseComp-Plus: Selected Tools Analysis Side-by-side view of selected tool calls from a reference trajectory alongside the new agent trajectory conditioned on those steps. **Retrieval model**: Qwen3-Embedding-8B **Agent model**: gpt-oss-120b **Run**: traj_summary_ext_selected_tools_gpt-oss-120b_seed0 ## Columns | Column | Description | |--------|-------------| | `query_id` | Query identifier | | `rationale` | GPT rationale for why these k steps were selected from the reference trajectory | | `selected_indices` | Step indices selected from the original reference trajectory | | `k_requested` | Number of steps requested to select | | `k_effective` | Number of steps actually selected | | `excerpt` | Concatenated selected steps from reference trajectory (reasoning + tool calls + results) | | `new_trajectory` | New agent trajectory conditioned on selected tool calls (system prompt omitted) | | `direct_answer` | True if agent answered directly with NO tool calls | | `tool_call_counts` | JSON dict of tool type → count for the new trajectory | | `status` | Trajectory completion status | ## Key Statistic **Direct-answer rate: 560/830 = 67.5%** 67.5% of trajectories had zero tool calls — the agent answered directly from the selected tool call context alone, without performing any new searches.
提供机构:
timchen0618
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作