five

ZDZR/BRINK-Wikidata5m

收藏
Hugging Face2026-03-18 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/ZDZR/BRINK-Wikidata5m
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: BRINK-Wikidata5m task_categories: - question-answering - text-generation language: - en tags: - knowledge-graph - rag - reasoning - benchmark - kgqa - incomplete-knowledge license: mit size_categories: - unknown --- # BRINK-Wikidata5m BRINK (Benchmark for Reasoning under Incomplete Knowledge) is a benchmark for evaluating Knowledge Graph–based Retrieval-Augmented Generation (KG-RAG) under incomplete knowledge. Unlike standard KGQA benchmarks, BRINK is designed so that each question cannot be answered by directly retrieving a single explicit supporting triple. Instead, the answer must be inferred from alternative reasoning paths that remain in the graph after the directly supporting fact is removed. BRINK-Wikidata5m is the Wikidata5m split of BRINK, designed to evaluate reasoning under incomplete knowledge in a larger and more realistic graph environment. 🌐 **Website:** https://github.com/boschresearch/brink 📄 **Paper:** https://arxiv.org/abs/2508.08344 ## Construction BRINK is constructed by mining high-confidence logical rules from the original knowledge graph and using rule groundings to generate question-answer pairs. For each instance, the directly supporting triple is removed while preserving sufficient alternative evidence for inference. ## Intended Use This dataset is intended for evaluating scalable KGQA and KG-RAG systems on large real-world knowledge graphs under incomplete knowledge conditions. ## Citation If you use this dataset, please cite: ```bibtex @inproceedings{zhou2026breaks, title={What Breaks Knowledge Graph based RAG? Benchmarking and Empirical Insights into Reasoning under Incomplete Knowledge}, author={Zhou, Dongzhuoran and Zhu, Yuqicheng and Wang, Xiaxia and Zhou, Hongkuan and He, Yuan and Chen, Jiaoyan and Staab, Steffen and Kharlamov, Evgeny}, booktitle={The 19th Conference of the European Chapter of the Association for Computational Linguistics: EACL}, year={2026} }
提供机构:
ZDZR
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作