five

Same Sentiment Classification Train/Dev/Test Pair IDs

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/5495792
下载链接
链接失效反馈
官方服务:
资源简介:
This "dataset" only includes the compiled pairings of the Yelp Business Review Dataset. To get access to the actual review texts, please follow the instructions on the Yelp Dataset webpage. The data format is JSONlines. Python Load Example: import pandas as pd traindev_df = pd.read_json("df_traindev.jsonl", lines=True) test_df = pd.read_json("df_test.jsonl", lines=True) # example access to single business/review id s1_bid = test_df.iloc[0]["sent1_business_id"] s1_rid = test_df.iloc[0]["sent1_review_id"] s2_bid = test_df.iloc[0]["sent2_business_id"] s2_rid = test_df.iloc[0]["sent2_review_id"] label = test_df.iloc[0]["is_same_side"] See documentation at: Yelp Dataset Schemata (only business.json and review.json were used) Yelp Business Category Hierarchy (download the json file as all_category_list.json) For details on how the data was compiled and used in our experiments, please refer to our code repository. Other derived data splits can be reproduced deterministically by using the same random seed as in our experiments.
创建时间:
2022-06-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作