five

zilliz/natural_questions-context-relevance-with-think

收藏
Hugging Face2026-01-06 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/zilliz/natural_questions-context-relevance-with-think
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集用于训练RAG系统中的语义高亮模型,包含查询-上下文对及其相关性标注,帮助识别文档中与查询语义相关的部分。关键特征包括上下文跨度、相关性标签和思考过程。数据集基于Open Provence项目的训练数据集,并进行了修改和重新标注。

This dataset is used for training the `zilliz/semantic-highlight-bilingual-v1` model for semantic highlighting in RAG (Retrieval-Augmented Generation) systems. It contains query-context pairs with relevance annotations for context spans. The annotations help identify which parts of a document are semantically relevant to a query, even when they dont contain exact keyword matches. Key features include context spans, relevance labels, and think process. The dataset is derived from `hotchpotch/natural-questions-context-relevance`, which is based on training datasets from the Open Provence project, with modifications and re-annotation.
提供机构:
zilliz
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作