MUSIED
收藏arXiv2022-11-25 更新2024-06-21 收录
下载链接:
https://github.com/myeclipse/MUSIED
下载链接
链接失效反馈官方服务:
资源简介:
MUSIED是一个大规模的中文事件检测数据集,由美团集团创建,专注于从多源异构非正式文本中识别有意义的事件。该数据集包含来自领先食品服务电子商务平台的用户评论、文本对话和电话对话,共计11,381个文档。MUSIED的创建过程经过精心设计,以确保数据的多样性和真实性,特别关注文本非正式性和多源异质性。该数据集的应用领域主要集中在食品服务行业,旨在通过自动识别和分类事件触发词,提高食品安全事件的监测和管理效率。
MUSIED is a large-scale Chinese event detection dataset created by Meituan Group, which focuses on identifying meaningful events from multi-source heterogeneous informal texts. This dataset contains a total of 11,381 documents including user reviews, text dialogues and phone conversations from a leading food service e-commerce platform. The creation process of MUSIED is meticulously designed to ensure the diversity and authenticity of the data, with particular attention paid to the informality and multi-source heterogeneity of the texts. The application scenarios of this dataset are mainly focused on the food service industry, with the aim of improving the efficiency of monitoring and management of food safety incidents by automatically identifying and classifying event triggers.
提供机构:
美团集团
创建时间:
2022-11-25



