Spoken-CoQA

arXiv2022-04-30 更新2024-08-06 收录

下载链接：

http://arxiv.org/abs/2204.14272v1

下载链接

链接失效反馈

官方服务：

资源简介：

Spoken-CoQA是一个专为口语对话式问答任务设计的数据集，由腾讯创建，包含超过40,000个问答对，源自4,000次对话。该数据集通过结合语音和文本信息，旨在解决机器在处理复杂对话流程时的挑战。Spoken-CoQA不仅支持多轮对话，还引入了跨模态信息整合，以提升系统在语音和语言处理任务中的性能。此数据集的应用领域包括语音助手和聊天机器人等，旨在通过精细的多模态表示，增强机器对口语对话的理解和响应能力。

Spoken-CoQA is a dataset specifically designed for spoken conversational question answering tasks, developed by Tencent. It contains over 40,000 question-answer pairs derived from 4,000 conversations. This dataset integrates speech and text information, aiming to address the challenges faced by machines when processing complex conversational workflows. Spoken-CoQA not only supports multi-turn conversations but also introduces cross-modal information integration to improve the performance of systems in speech and language processing tasks. This dataset has applications in fields such as voice assistants and chatbots, with the goal of enhancing machines' ability to understand and respond to spoken dialogues through refined multi-modal representations.

提供机构：

腾讯

创建时间：

2022-04-30

搜集汇总

数据集介绍