SNIPS (SNIPS Natural Language Understanding benchmark)
收藏OpenDataLab2026-05-24 更新2024-05-09 收录
下载链接:
https://opendatalab.org.cn/OpenDataLab/SNIPS
下载链接
链接失效反馈官方服务:
资源简介:
SNIPS 自然语言理解基准是一个包含 16,000 多个众包查询的数据集,分布在 7 个不同复杂度的用户意图中:SearchCreativeWork(例如,Find me the I,机器人电视节目)、GetWeather(例如,马萨诸塞州波士顿现在有风吗?) , BookRestaurant(例如,我想在明天晚上在巴黎预订一家评价很高的餐厅),PlayMusic(例如,在 Spotify 上播放 Beyoncé 的最后一首曲目),AddToPlaylist(例如,将钻石添加到我的旅行播放列表),RateBook(例如,给 Of 6 颗星Mice and Men),SearchScreeningEvent(例如,查看神奇女侠在巴黎的放映时间)。训练集包含 13,084 个话语,验证集和测试集各包含 700 个话语,每个意图有 100 个查询。
The SNIPS Natural Language Understanding benchmark is a dataset consisting of over 16,000 crowdsourced queries across 7 user intents with varying levels of complexity: SearchCreativeWork (e.g., "Find me the I, Robot television show"), GetWeather (e.g., "Is it windy in Boston, Massachusetts right now?"), BookRestaurant (e.g., "I would like to book a highly-rated restaurant in Paris for tomorrow evening"), PlayMusic (e.g., "Play Beyoncé's last track on Spotify"), AddToPlaylist (e.g., "Add Diamonds to my travel playlist"), RateBook (e.g., "Give Of Mice and Men a 6-star rating"), and SearchScreeningEvent (e.g., "Check the showtimes for Wonder Woman in Paris"). The training set includes 13,084 utterances, while the validation and test sets each contain 700 utterances, with 100 queries per intent.
提供机构:
OpenDataLab
创建时间:
2022-08-16
搜集汇总
数据集介绍

背景与挑战
背景概述
SNIPS数据集是一个自然语言理解基准数据集,包含超过16,000个众包查询,覆盖7个不同复杂度的用户意图,如搜索创意作品、获取天气等。该数据集主要用于意图识别和音频零样本学习任务,训练集包含13,084个话语,验证集和测试集各700个话语,每个意图有100个查询,适用于模型训练和评估。
以上内容由遇见数据集搜集并总结生成



