用于推荐系统和协同过滤研究的Jester数据集
收藏帕依提提2024-03-04 收录
下载链接:
https://www.payititi.com/opendatasets/show-2013.html
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含3个子数据集:dataset 1、dataset 3、dataset 4,如下所述: The text for each of the 100 Dataset 1 jokes can be downloaded here: jester_dataset_1_joke_texts.zip (92KB) Format: The ratings data: Format: The text of the 150 Dataset 3 jokes: jester_dataset_2/3_joke_texts.zip (29KB) Format: The Ratings Data, Save to disk, then unzip: jester_dataset_3.zip (6MB) Format: Note that the ratings are real values ranging from -10.00 to +10.00. As of May 2009, the jokes {7, 8, 13, 15, 16, 17, 18, 19} are the "gauge set" (as discussed in the Eigentaste paper) and the jokes {1, 2, 3, 4, 5, 6, 9, 10, 11, 12, 14, 20, 27, 31, 43, 51, 52, 61, 73, 80, 100, 116} were removed (i.e. they are never displayed or rated). The text of the jokes: jester_dataset_4_joke_texts.zip (30KB) Format: The Ratings data: Save to disk, then unzip: jester_dataset_4.zip (1.4MB) Format: Note that the ratings are real values ranging from -10.00 to +10.00. The jokes {1, 2, 3, 4, 5, 6, 9, 10, 11, 12, 14, 20, 27, 31, 43, 51, 52, 61, 73, 80, 100, 116} have been removed (i.e. they are never displayed or rated). As of April 2015, 8 jokes were added. For further information please contact: Ken Goldberg goldberg at berkeley dot edu Prof of IEOR and EECS UC Berkeley (510) 643-9565 (phone)
提供机构:
帕依提提



