five

SUBJQA

收藏
arXiv2020-10-06 更新2024-06-21 收录
下载链接:
https://github.com/megagonlabs/SubjQA
下载链接
链接失效反馈
官方服务:
资源简介:
SUBJQA是一个基于客户评论的英语问答数据集,由哥本哈根大学计算机科学系等机构创建,包含超过10,000个跨6个领域的示例,涵盖产品和服务的评价。数据集通过最新的意见提取和矩阵分解技术构建,特别关注问题和答案中的主观性标注。SUBJQA旨在解决自然语言处理中主观性表达的问题,特别是在问答系统中的应用,帮助研究者开发能够理解和处理主观性内容的模型。

SUBJQA is an English question answering dataset based on customer reviews, created by institutions including the Department of Computer Science at the University of Copenhagen and others. It contains over 10,000 examples spanning six domains, covering product and service reviews. The dataset is constructed using state-of-the-art opinion extraction and matrix factorization techniques, with particular focus on subjective annotation in both questions and answers. SUBJQA aims to address the issue of subjective expressions in natural language processing, especially for applications in question answering systems, and helps researchers develop models capable of understanding and processing subjective content.
提供机构:
哥本哈根大学计算机科学系
创建时间:
2020-04-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作