johnnyboycurtis/Philosophical-Triplets-Retrieval
收藏Hugging Face2025-12-15 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/johnnyboycurtis/Philosophical-Triplets-Retrieval
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为Philosophical-Triplets-Retrieval,专为训练和评估密集检索模型而设计,特别是在复杂且主题密集的领域(如哲学)中用于检索增强生成(RAG)系统。数据集包含高质量的训练三元组(查询/锚点、正例段落、难负例段落),这些三元组是从基础哲学著作中提取和合成的。数据集创建方法强调源文本的严格遵循、上下文的丰富性以及难负例的挖掘,以确保数据集的质量和相关性。数据集适用于训练嵌入模型、评估检索性能以及领域适应。
This dataset, named Philosophical-Triplets-Retrieval, is designed for training and evaluating dense retrieval models, specifically for Retrieval Augmented Generation (RAG) systems in complex, subject-matter-heavy domains like philosophy. It consists of high-quality training triplets (Query/Anchor, Positive Passage, Hard Negative Passage) extracted and synthesized from foundational philosophical works. The dataset creation methodology emphasizes source adherence, context richness, and hard negative mining to ensure the datasets quality and relevance. It is intended for use in training embedding models, evaluating retrieval performance, and domain adaptation.
提供机构:
johnnyboycurtis



