five

MCScript

收藏
arXiv2018-03-14 更新2024-06-21 收录
下载链接:
http://www.sfb1102.uni-saarland.de/?page_id=2582
下载链接
链接失效反馈
官方服务:
资源简介:
MCScript是由德国萨尔兰大学创建的一个大型数据集,专注于评估机器理解中使用常识知识的能力。该数据集包含约2,100个叙述文本和约14,000个关于这些文本的问题,特别关注日常活动的故事,如去电影院或园艺,并要求使用常识知识,特别是脚本知识来回答问题。数据集通过众包收集,经过手动验证和过滤,确保高质量。MCScript不仅为自然语言理解社区提供了挑战性的测试案例,还作为2018年SemEval的一个共享任务的基础,旨在评估和推动机器理解技术的发展。

MCScript is a large-scale dataset developed by Saarland University in Germany, focusing on evaluating machines' ability to utilize common-sense knowledge for machine comprehension. The dataset contains approximately 2,100 narrative texts and around 14,000 questions related to these texts, with a particular emphasis on stories of daily activities such as going to the cinema or gardening, and requires the use of common-sense knowledge, especially script knowledge, to answer the questions. It was collected via crowdsourcing, and then manually validated and filtered to ensure high data quality. Beyond offering challenging test cases for the natural language understanding community, MCScript also served as the foundation for a shared task at SemEval 2018, which aims to evaluate and promote the advancement of machine comprehension technologies.
提供机构:
萨尔兰大学
创建时间:
2018-03-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作