Neural Conversational QA

Name: Neural Conversational QA
Creator: OpenDataLab
Published: 2026-05-24 09:30:17
License: 暂无描述

OpenDataLab2026-05-24 更新2024-05-09 收录

下载链接：

https://opendatalab.org.cn/OpenDataLab/Neural_Conversational_QA

下载链接

链接失效反馈

官方服务：

资源简介：

神经对话 QA 任务（如 ShARC）要求系统根据给定段落的内容回答问题。在研究最近关于 ShARC QA 任务的最先进模型时，我们发现模型学习数据集中虚假线索/模式的迹象。此外，为利用这些模式而构建的基于启发式的程序具有与神经模型相比的性能。在本文中，我们分享了我们对 ShARC 语料库中四种模式以及神经模型如何利用它们的发现。受上述发现的启发，我们创建并共享了一个修改后的数据集，该数据集的虚假模式比原始数据集更少，从而使模型能够更好地学习。

Neural conversational QA tasks (e.g., ShARC) require systems to answer questions based on the content of given paragraphs. When investigating recent state-of-the-art models for the ShARC QA task, we observed signs that models learn spurious cues/patterns within the dataset. Furthermore, heuristic-based programs built to exploit these patterns achieved performance comparable to that of neural models. In this work, we share our findings regarding four patterns present in the ShARC corpus and how neural models exploit them. Motivated by these findings, we created and shared a modified dataset with fewer spurious patterns than the original dataset, enabling models to learn more effectively.

提供机构：

OpenDataLab

创建时间：

2022-06-28

搜集汇总

数据集介绍