Alexa TaskBot Dataset

Name: Alexa TaskBot Dataset
Creator: Alexa Prize TaskBot challenge
License: 暂无描述

arXiv2025-09-30 收录

下载链接：

https://github.com/rafaelhferreira/cta_rating_prediction

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含了在Alexa Prize TaskBot挑战赛中收集的真实人类与智能助手对话以及评分，专注于开发在烹饪和DIY领域的对话式任务助手（CTA）。值得注意的是，只有约10%的用户提供了评分，这使得数据分析面临较大的挑战。平均而言，每个对话包含8到9个来回，对话长度存在较大差异。该数据集的规模为1681个对话，按照90/10/10的比例分为训练集、验证集和测试集。任务目标是根据用户对话预测评分。

This dataset contains real human-assistant conversations and corresponding ratings collected from the Alexa Prize TaskBot Challenge, which focuses on developing conversational task assistants (CTA) in the cooking and DIY domains. Notably, only approximately 10% of users provided ratings, posing significant challenges for data analysis. On average, each conversation consists of 8 to 9 conversational turns, with substantial variations in conversation length. The dataset comprises 1,681 total conversations, and is split into training, validation, and test sets at a ratio of 90/10/10. The task objective is to predict ratings based on user conversations.

提供机构：

Alexa Prize TaskBot challenge

5,000+

优质数据集

54 个

任务类型

进入经典数据集