five

Genre2Movies Dataset

收藏
paperswithcode.com2025-01-21 收录
下载链接:
https://paperswithcode.com/dataset/genre2movies
下载链接
链接失效反馈
官方服务:
资源简介:
Genre annotations for movies The file genre2movies.csv contains genre-movie tuples based on Wikidata annotations (https://www.wikidata.org/). Data Each line in genre2movies.csv represents one genre-movie tuple. The first entry is the genre. The second entry of each line is the movie name. There are 83,670 genre-movie tuples. Joining with the Movielens 20M dataset The movies considered are from the Movielens 20M corpus: https://grouplens.org/datasets/movielens/20m/ The movie names in genre2movies.csv match the movie 'titles' in Movielens 20M. Compositions The directory "compositions" contains movies assigned to compositions of genres. The compositions are of the form: "genre A and genre B", "genre A and not genre B", "genre A and genre B and genre C", "genre A and genre B and not genre C". These assignments have been automatically generated from genre2movies.csv. We try to generate genre-compositions that are useful, e.g., for a "genre A and genre B" composition we ensure that genre B is not a subgenre of genre A, because an interesection of a superset with a subset is identical to the subset and does not form a new concept.

电影类型标注数据集 文件genre2movies.csv包含了基于Wikidata标注的电影类型-电影元组(https://www.wikidata.org/)。 数据 genre2movies.csv文件中的每一行代表一个电影类型-电影元组。 第一项为电影类型。 每一行的第二项为电影名称。 共包含83,670个电影类型-电影元组。 与Movielens 20M数据集的联合 所考虑的电影来自Movielens 20M语料库:https://grouplens.org/datasets/movielens/20m/ genre2movies.csv中的电影名称与Movielens 20M中的电影'titles'相匹配。 组成 "compositions"目录包含被分配到电影类型组合的电影。这些组合形式如下:"类型A与类型B"、"类型A且非类型B"、"类型A与类型B与类型C"、"类型A与类型B且非类型C"。这些分配已从genre2movies.csv自动生成。我们尝试生成有实用价值的电影类型组合,例如,对于"类型A与类型B"的组合,我们确保类型B不是类型A的子类型,因为超集与子集的交集等同于子集,并不构成新的概念。
提供机构:
Papers with Code
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作