five

DinoDS/retrieval_grounding

收藏
Hugging Face2026-04-28 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/DinoDS/retrieval_grounding
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是一个专注于检索-接地的预览数据集,基于Dino Data的四个能力切片构建:搜索触发检测、接地搜索集成、历史搜索触发和历史搜索集成。目标是训练或检查助手在两个相关问题上的行为:1) 决定何时需要检索或历史查找;2) 生成基于提供证据或先前线程上下文的接地回答。数据集包含80行数据,分为训练集(72行)、验证集(4行)和测试集(4行),语言为英语。每行是一个扁平化的助手训练示例,包含任务和路由元数据。重要列包括sample_id、source_lane、user_message、assistant_response等。数据集可用于检索触发分类实验、接地答案微调、历史感知助手行为研究等。

This dataset is a focused retrieval-grounding preview built from four Dino Data capability slices: search trigger detection, grounded search integration, history search trigger, and history search integration. The goal is to train or inspect assistant behavior around two connected problems: 1) deciding when retrieval or history lookup is needed, and 2) generating answers that stay grounded to supplied evidence or prior thread context. The dataset contains 80 rows, divided into train (72 rows), validation (4 rows), and test (4 rows) sets, with English as the language. Each row is a flattened assistant-training example with task and routing metadata. Important columns include sample_id, source_lane, user_message, assistant_response, etc. The dataset can be used for retrieval-trigger classification experiments, grounded answer fine-tuning, history-aware assistant behavior studies, etc.
提供机构:
DinoDS
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作