ConflictQA

Name: ConflictQA
Creator: maas
Published: 2025-12-05 16:40:33
License: 暂无描述

魔搭社区2025-12-05 更新2025-07-05 收录

下载链接：

https://modelscope.cn/datasets/osunlp/ConflictQA

下载链接

链接失效反馈

官方服务：

资源简介：

license: apache-2.0 task_categories: - question-answering language: - en pretty_name: conflictQA size_categories: - 10K<n<100K configs: - config_name: ConflictQA-popQA-chatgpt data_files: - split: test path: "./conflictQA-popQA-chatgpt.json" --- # Dataset Card for ConflcitQA ## Dataset Description - **Repository:** https://github.com/OSU-NLP-Group/LLM-Knowledge-Conflict - **Paper:** https://arxiv.org/abs/2305.13300 - **Point of Contact:** Point of Contact: [Jian Xie](mailto:jianx0321@gmail.com) ## Citation If our paper or related resources prove valuable to your research, we kindly ask for citation. Please feel free to contact us with any inquiries. ```bib @inproceedings{Xie2024KnowledgeConflict, title={Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts}, author={Xie, Jian and Zhang, Kai and Chen, Jiangjie and Lou, Renze and Su, Yu}, booktitle={Proceedings of ICLR}, year={2024}, } ``` # ConflcitQA We provide conflictQA-[based large language model], which utilizes large language models guided parametric memory. ```json {"question": "What is George Rankin's occupation?", "popularity": 142, "ground_truth": ["politician", "political leader", "political figure", "polit.", "pol"], "memory_answer": "George Rankin's occupation is a professional photographer.", "parametric_memory": "As a professional photographer, George Rankin...", "counter_answer": "George Rankin's occupation is political figure.", "counter_memory": "George Rankin has been actively involved in politics for over a decade...", "parametric_memory_aligned_evidence": "George Rankin has a website showcasing his photography portfolio...", "counter_memory_aligned_evidence": "George Rankin Major General George James Rankin..."} ``` ```python #loading dataset from datasets import load_dataset # you can choose dataset "ConflictQA-popQA-[PLACEHOLDER]", and the [PLACEHOLDER] is in ["chatgpt","gpt4","palm2","llama2-7b","llama2-70b","qwen7b","vicuna7b","vicuna33b"]. dataset = load_dataset("osunlp/ConflictQA",'ConflictQA-popQA-chatgpt') ``` # Data Fields - "question": The question in natural language - "popularity": The monthly page views on Wikipedia for the given question - "ground_truth": The factual answer to the question, which may include multiple possible answers - "memory_answer": The answer provided by the LLM to the question - "parametric_memory": The supportive evidence from LLM's parametric memory for the answer - "counter_answer": The answer contradicting the "memory_answer" - "counter_memory": The generation-based evidence supporting the counter_answer - "parametric_memory_aligned_evidence": Additional evidence supporting the "memory_answer", which could be generated or derived from Wikipedia/human annotation - "counter_memory_aligned_evidence": Additional evidence supporting the "counter_answer", either generated or sourced from Wikipedia/human annotation

许可证：Apache-2.0 任务类别： - 问答（question-answering）语言： - 英语展示名称：ConflictQA 规模类别： - 10000 < 样本量 < 100000 配置项： - 配置名称：ConflictQA-popQA-chatgpt 数据文件：拆分方式：测试集（test）路径："./conflictQA-popQA-chatgpt.json" # ConflictQA数据集卡片 ## 数据集描述 - **仓库地址**：https://github.com/OSU-NLP-Group/LLM-Knowledge-Conflict - **论文地址**：https://arxiv.org/abs/2305.13300 - **联系人**：[Jian Xie](mailto:jianx0321@gmail.com) ## 引用说明若您的研究工作得益于本文或相关资源，恳请引用本工作。如有任何疑问，欢迎随时与我们联系。 bib @inproceedings{Xie2024KnowledgeConflict, title={Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts}, author={Xie, Jian and Zhang, Kai and Chen, Jiangjie and Lou, Renze and Su, Yu}, booktitle={Proceedings of ICLR}, year={2024}, } ## ConflictQA数据集我们提供了基于大语言模型（Large Language Model）的ConflictQA数据集变体，该数据集依托受参数记忆引导的大语言模型构建。 json {"question": "What is George Rankin's occupation?", "popularity": 142, "ground_truth": ["politician", "political leader", "political figure", "polit.", "pol"], "memory_answer": "George Rankin's occupation is a professional photographer.", "parametric_memory": "As a professional photographer, George Rankin...", "counter_answer": "George Rankin's occupation is political figure.", "counter_memory": "George Rankin has been actively involved in politics for over a decade...", "parametric_memory_aligned_evidence": "George Rankin has a website showcasing his photography portfolio...", "counter_memory_aligned_evidence": "George Rankin Major General George James Rankin..."} python #loading dataset from datasets import load_dataset # you can choose dataset "ConflictQA-popQA-[PLACEHOLDER]", and the [PLACEHOLDER] is in ["chatgpt","gpt4","palm2","llama2-7b","llama2-70b","qwen7b","vicuna7b","vicuna33b"]. dataset = load_dataset("osunlp/ConflictQA",'ConflictQA-popQA-chatgpt') ## 数据字段说明 - "question"：自然语言形式的问题 - "popularity"：对应问题相关条目在维基百科的月度页面浏览量 - "ground_truth"：该问题的事实性标准答案，可包含多个可选答案 - "memory_answer"：大语言模型针对该问题生成的回答 - "parametric_memory"：大语言模型为其生成的memory_answer提供的支撑性证据 - "counter_answer"：与memory_answer相矛盾的对立答案 - "counter_memory"：用于支撑counter_answer的生成式证据 - "parametric_memory_aligned_evidence"：用于支撑memory_answer的额外证据，可由大语言模型生成或源自维基百科/人工标注 - "counter_memory_aligned_evidence"：用于支撑counter_answer的额外证据，可由大语言模型生成或源自维基百科/人工标注

提供机构：

maas

创建时间：

2025-07-04

5,000+

优质数据集

54 个

任务类型

进入经典数据集