five

ConflictQA

收藏
魔搭社区2025-12-05 更新2025-07-05 收录
下载链接:
https://modelscope.cn/datasets/osunlp/ConflictQA
下载链接
链接失效反馈
官方服务:
资源简介:
license: apache-2.0 task_categories: - question-answering language: - en pretty_name: conflictQA size_categories: - 10K<n<100K configs: - config_name: ConflictQA-popQA-chatgpt data_files: - split: test path: "./conflictQA-popQA-chatgpt.json" --- # Dataset Card for ConflcitQA ## Dataset Description - **Repository:** https://github.com/OSU-NLP-Group/LLM-Knowledge-Conflict - **Paper:** https://arxiv.org/abs/2305.13300 - **Point of Contact:** Point of Contact: [Jian Xie](mailto:jianx0321@gmail.com) ## Citation If our paper or related resources prove valuable to your research, we kindly ask for citation. Please feel free to contact us with any inquiries. ```bib @inproceedings{Xie2024KnowledgeConflict, title={Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts}, author={Xie, Jian and Zhang, Kai and Chen, Jiangjie and Lou, Renze and Su, Yu}, booktitle={Proceedings of ICLR}, year={2024}, } ``` # ConflcitQA We provide conflictQA-[based large language model], which utilizes large language models guided parametric memory. ```json {"question": "What is George Rankin's occupation?", "popularity": 142, "ground_truth": ["politician", "political leader", "political figure", "polit.", "pol"], "memory_answer": "George Rankin's occupation is a professional photographer.", "parametric_memory": "As a professional photographer, George Rankin...", "counter_answer": "George Rankin's occupation is political figure.", "counter_memory": "George Rankin has been actively involved in politics for over a decade...", "parametric_memory_aligned_evidence": "George Rankin has a website showcasing his photography portfolio...", "counter_memory_aligned_evidence": "George Rankin Major General George James Rankin..."} ``` ```python #loading dataset from datasets import load_dataset # you can choose dataset "ConflictQA-popQA-[PLACEHOLDER]", and the [PLACEHOLDER] is in ["chatgpt","gpt4","palm2","llama2-7b","llama2-70b","qwen7b","vicuna7b","vicuna33b"]. dataset = load_dataset("osunlp/ConflictQA",'ConflictQA-popQA-chatgpt') ``` # Data Fields - "question": The question in natural language - "popularity": The monthly page views on Wikipedia for the given question - "ground_truth": The factual answer to the question, which may include multiple possible answers - "memory_answer": The answer provided by the LLM to the question - "parametric_memory": The supportive evidence from LLM's parametric memory for the answer - "counter_answer": The answer contradicting the "memory_answer" - "counter_memory": The generation-based evidence supporting the counter_answer - "parametric_memory_aligned_evidence": Additional evidence supporting the "memory_answer", which could be generated or derived from Wikipedia/human annotation - "counter_memory_aligned_evidence": Additional evidence supporting the "counter_answer", either generated or sourced from Wikipedia/human annotation

许可证:Apache-2.0 任务类别: - 问答(question-answering) 语言: - 英语 展示名称:ConflictQA 规模类别: - 10000 < 样本量 < 100000 配置项: - 配置名称:ConflictQA-popQA-chatgpt 数据文件: 拆分方式:测试集(test) 路径:"./conflictQA-popQA-chatgpt.json" # ConflictQA数据集卡片 ## 数据集描述 - **仓库地址**:https://github.com/OSU-NLP-Group/LLM-Knowledge-Conflict - **论文地址**:https://arxiv.org/abs/2305.13300 - **联系人**:[Jian Xie](mailto:jianx0321@gmail.com) ## 引用说明 若您的研究工作得益于本文或相关资源,恳请引用本工作。如有任何疑问,欢迎随时与我们联系。 bib @inproceedings{Xie2024KnowledgeConflict, title={Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts}, author={Xie, Jian and Zhang, Kai and Chen, Jiangjie and Lou, Renze and Su, Yu}, booktitle={Proceedings of ICLR}, year={2024}, } ## ConflictQA数据集 我们提供了基于大语言模型(Large Language Model)的ConflictQA数据集变体,该数据集依托受参数记忆引导的大语言模型构建。 json {"question": "What is George Rankin's occupation?", "popularity": 142, "ground_truth": ["politician", "political leader", "political figure", "polit.", "pol"], "memory_answer": "George Rankin's occupation is a professional photographer.", "parametric_memory": "As a professional photographer, George Rankin...", "counter_answer": "George Rankin's occupation is political figure.", "counter_memory": "George Rankin has been actively involved in politics for over a decade...", "parametric_memory_aligned_evidence": "George Rankin has a website showcasing his photography portfolio...", "counter_memory_aligned_evidence": "George Rankin Major General George James Rankin..."} python #loading dataset from datasets import load_dataset # you can choose dataset "ConflictQA-popQA-[PLACEHOLDER]", and the [PLACEHOLDER] is in ["chatgpt","gpt4","palm2","llama2-7b","llama2-70b","qwen7b","vicuna7b","vicuna33b"]. dataset = load_dataset("osunlp/ConflictQA",'ConflictQA-popQA-chatgpt') ## 数据字段说明 - "question":自然语言形式的问题 - "popularity":对应问题相关条目在维基百科的月度页面浏览量 - "ground_truth":该问题的事实性标准答案,可包含多个可选答案 - "memory_answer":大语言模型针对该问题生成的回答 - "parametric_memory":大语言模型为其生成的memory_answer提供的支撑性证据 - "counter_answer":与memory_answer相矛盾的对立答案 - "counter_memory":用于支撑counter_answer的生成式证据 - "parametric_memory_aligned_evidence":用于支撑memory_answer的额外证据,可由大语言模型生成或源自维基百科/人工标注 - "counter_memory_aligned_evidence":用于支撑counter_answer的额外证据,可由大语言模型生成或源自维基百科/人工标注
提供机构:
maas
创建时间:
2025-07-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作