手动标注的客户评论基准数据集
收藏arXiv2023-11-06 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2311.02702v1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集由北卡罗来纳大学夏洛特分校计算机科学系创建,专注于从餐厅、酒店和美发沙龙三个领域的客户评论中提取非典型方面。数据集包含约114条评论,这些评论经过手动标注,以识别与各领域核心业务不直接相关的非典型特征。创建过程涉及使用spaCy工具筛选罕见词汇,并人工审核以确定这些词汇是否指代非典型方面。该数据集旨在支持开发能够识别和推荐具有潜在惊喜元素的服务或产品的推荐系统,从而增强用户体验。
This dataset was created by the Department of Computer Science at the University of North Carolina at Charlotte. It focuses on extracting atypical aspects from customer reviews across three domains: restaurants, hotels, and hair salons. The dataset contains approximately 114 reviews, which have been manually annotated to identify atypical features that are not directly related to the core business of their respective domains. The creation process involved using the spaCy tool to filter rare terms, followed by manual review to determine whether these terms refer to atypical aspects. This dataset is intended to support the development of recommendation systems that can identify and recommend services or products with potentially surprising elements, thereby enhancing user experience.
提供机构:
北卡罗来纳大学夏洛特分校计算机科学系
创建时间:
2023-11-06



