电影评分数据集

Name: 电影评分数据集
Creator: 阿里云天池
Published: 2026-06-09 21:30:09
License: 暂无描述

阿里云天池2026-06-09 更新2024-03-07 收录

下载链接：

https://tianchi.aliyun.com/dataset/151778

下载链接

链接失效反馈

官方服务：

资源简介：

用于大数据分析这门课程的教学中使用：涉及的教学模块有：基于用户的协同过滤算法、基于物品的协同过滤算法。 ratings.csv的列头为：用户id，电影id，评分，评分时间。 movies.csv和ratings.csv可以通过【电影id】列进行连接。接下来的语句是用于通过不少于200字的审核机制，说实在的，也不知道接下来针对这个数据集做什么介绍，因为真的非常基本。如果说有建议的话，我希望有一份针对我们国产电影的数据集，这样学生的代入感会更强些！比如说从90年代开始到现在的各种经典的电影，电视剧等等等等。这需要系统化的整理，费时费力。但确实又是我们发展的基石。当行文至这里时，我多次测试，居然发现还不够200字，从来没觉得200字需要那么多内容。。。。。。感觉我的文档撰写能力得到了长足的进步！我希望，我们的学生能认真学习，习得本领。而不是每次上课仅仅满足于课堂上教授的那一点点知识。现在已经进入了终身学习型社会，不养成学习的习惯，可是真的可能会被淘汰的！不得不写点废话来充字数了。

This dataset is developed for teaching use in the course *Big Data Analysis*. The teaching modules covered include user-based collaborative filtering algorithm and item-based collaborative filtering algorithm. The column headers of the `ratings.csv` file are: "user ID", "movie ID", "rating", and "rating timestamp". The `movies.csv` and `ratings.csv` files can be joined via the "movie ID" column. The following content is prepared to meet the 200-word minimum requirement for the review process. Honestly, it is somewhat challenging to draft additional introductions for this relatively basic dataset. If I could offer a suggestion, I hope we could have a dataset focused on domestic Chinese films, which would significantly enhance students' sense of immersion. For example, it could include various classic films, TV series, and other works from the 1990s up to the present. Compiling such a dataset requires systematic organization, which is time-consuming and labor-intensive, yet it would undoubtedly serve as a solid foundation for our educational development. When I reached this point, I tested the word count multiple times only to find that the text was still under 200 words. I have never realized that reaching the 200-word threshold could be such a tedious task... It seems that my document writing skills have improved significantly! I hope our students will study diligently and master the relevant skills, rather than merely being content with the limited knowledge taught in each class. We have now entered a lifelong learning society, and failing to cultivate the habit of continuous learning may truly lead to being left behind. I have to add some redundant content to meet the word count requirement.

提供机构：

阿里云天池

创建时间：

2023-04-25

搜集汇总

数据集介绍

背景与挑战

背景概述

该数据集是一个用于大数据分析课程教学的基础电影评分数据集，主要应用于协同过滤算法（包括基于用户和基于物品的算法）的教学模块。数据集包含两个核心文件：ratings.csv记录用户对电影的评分信息（包括用户id、电影id、评分和时间），movies.csv存储电影详情，两者通过电影id连接，总数据量适中（如ratings.csv约24.41MB）。

以上内容由遇见数据集搜集并总结生成