StickerInt
收藏arXiv2024-03-09 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2403.05427v1
下载链接
链接失效反馈官方服务:
资源简介:
StickerInt数据集是由哈尔滨工业大学深圳分校和哈尔滨工业大学联合创建,旨在研究在线聊天中贴图的回复应用。该数据集包含1,578个中文对话,总计12,644条语句,用于支持贴图回复的研究。数据集从广泛使用的社交平台(如微信)收集,经过严格的数据预处理,确保用户隐私和数据质量。StickerInt数据集不仅用于补充文字表达,还直接用于回复之前的对话,增强了对话的生动性和趣味性。该数据集的应用领域包括智能对话系统,旨在通过贴图提升对话的情感表达和用户体验。
The StickerInt dataset was jointly created by Harbin Institute of Technology, Shenzhen and Harbin Institute of Technology, aiming to investigate the application of stickers as replies in online chats. The dataset contains 1,578 Chinese conversations, totaling 12,644 utterances, to support research on sticker-based dialogue replies. Collected from widely used social platforms such as WeChat, the dataset has undergone rigorous data preprocessing to ensure user privacy and data quality. The StickerInt dataset not only helps supplement textual expressions but can also be directly employed to respond to preceding dialogues, thereby enhancing the vividness and engagingness of conversations. Its application domains primarily include intelligent dialogue systems, where it is designed to boost emotional expression and user experience via the use of stickers.
提供机构:
哈尔滨工业大学深圳分校,哈尔滨工业大学,香港中文大学,吉林大学
创建时间:
2024-03-09



