five

NLU-Evaluation-Data

收藏
arXiv2019-03-26 更新2024-06-21 收录
下载链接:
https://github.com/xliuhw/NLU-Evaluation-Data
下载链接
链接失效反馈
官方服务:
资源简介:
本数据集名为NLU-Evaluation-Data,由赫瑞瓦特大学创建,包含25716条多领域(21个领域)的用户语音数据,涵盖64种意图和54种实体类型。数据集通过亚马逊Mechanical Turk收集,涉及多种场景,如设置闹钟、播放音乐等。创建过程包括数据收集、意图和实体类型标注,以及多轮验证以确保数据质量。该数据集主要用于评估和比较不同自然语言理解服务在构建对话代理中的性能,旨在解决如何选择和优化NLU服务的问题。

This dataset, named NLU-Evaluation-Data, was created by Heriot-Watt University. It contains 25,716 instances of user speech data spanning 21 distinct domains, encompassing 64 intents and 54 entity types. The dataset was collected through Amazon Mechanical Turk, covering diverse scenarios including setting alarms, playing music, and other daily use cases. Its development pipeline includes data collection, annotation of intents and entity types, and multi-stage validation to guarantee data quality. This dataset is primarily designed to evaluate and compare the performance of various natural language understanding (NLU) services when constructing conversational agents, with the goal of addressing the challenge of selecting and optimizing NLU services.
提供机构:
赫瑞瓦特大学
创建时间:
2019-03-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作