five

非任务导向对话系统中文语料库

收藏
arXiv2018-05-15 更新2024-06-21 收录
下载链接:
http://ai.tencent.com/ailab/upload/PapersUploads/A_Manually_Annotated_Chinese_Corpus_for_Non-task-oriented_Dialogue_System
下载链接
链接失效反馈
官方服务:
资源简介:
非任务导向对话系统中文语料库是由腾讯AI实验室创建的大规模语料库,包含超过27K独特提示和82K响应,数据来源于社交媒体。该数据集通过定义一个5级评分方案进行人工标注,旨在提高对话系统的响应选择质量。数据集的应用领域主要在于训练和评估对话系统,解决对话系统中响应质量不一的问题。

The Chinese non-task-oriented dialogue system corpus is a large-scale corpus created by Tencent AI Lab. It contains over 27K unique prompts and 82K responses, with data sourced from social media. This corpus was manually annotated via a predefined 5-level scoring scheme, aiming to improve the quality of response selection for dialogue systems. Its main applications are training and evaluating dialogue systems, to address the issue of inconsistent response quality in such systems.
提供机构:
腾讯AI实验室
创建时间:
2018-05-15
二维码
社区交流群
二维码
科研交流群
商业服务