five

Shree2428/letterboxd-10m-movies-ratings-2025

收藏
Hugging Face2026-01-18 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Shree2428/letterboxd-10m-movies-ratings-2025
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-nc-sa-4.0 task_categories: - tabular-regression language: - en tags: - movies - ratings - recommendation-system - film - dataset pretty_name: Letterboxd Movies Ratings 10M (2025) size_categories: - 10M<n<100M --- # Letterboxd Movie Ratings 10M (Popular Users - Dec 2025) ## Description This dataset contains approximately **10.4 million movie ratings** collected in December 2025. It focuses on the activity of the 6,519 most popular users from Letterboxd at the time of december 2025 ## Dataset Statistics ### General Metrics | Metric | Value | | :--- | :--- | | **Total Interactions** | 10,462,257 | | **Unique Users** | 6,519 | | **Unique Movies** | 281,045 | | **Matrix Density** | 0.5710% | | **Global Mean Rating** | 3.28 (std: 1.06) | ### Rating Distribution The ratings follow a typical distribution skewed towards positive values (3.0 - 4.0). | Rating | Count | Share | | :--- | :--- | :--- | | **0.5** | 190,095 | 1.8% | | **1.0** | 361,082 | 3.5% | | **1.5** | 365,872 | 3.5% | | **2.0** | 859,407 | 8.2% | | **2.5** | 979,630 | 9.4% | | **3.0** | 1,981,222 | 18.9% | | **3.5** | 1,899,613 | 18.2% | | **4.0** | 2,101,217 | 20.1% | | **4.5** | 875,288 | 8.4% | | **5.0** | 848,831 | 8.1% | ### Activity Statistics (Per User & Per Movie) These statistics illustrate the activity volume. Note the heavy long-tail distribution for movies (75% of movies have 8 ratings or fewer within this specific user group). | Statistic | Ratings per User | Ratings per Movie | | :--- | :--- | :--- | | **Mean** | 1,604.9 | 37.2 | | **Std Dev** | 1,517.4 | 205.9 | | **Min** | 13 | 1 | | **25% (Q1)** | 601 | 1 | | **50% (Median)** | 1,092 | 2 | | **75% (Q3)** | 2,032 | 8 | | **Max** | 9,990 | 5,541 | ## Data Schema The dataset is provided in two formats: **Parquet** (recommended for performance) and **CSV** (for universal compatibility). * **`user`** (uint16): Anonymized numerical identifier for the user. * **`movie_id`** (uint32): Letterboxd internal identifier (slug-based ID) of the movie. * **`title`** (string): English title of the movie. * **`rating`** (float32): Numerical rating on a scale of 0.5 to 5.0. ## Citation and Attribution If you use this dataset in your research or project, please cite it as follows: > [Titouan Mokrani], *Letterboxd Movies Rating 10M (2025)*, available on Hugging Face. ## License and Disclaimer This dataset is distributed under the **CC-BY-NC-SA 4.0** license. * **Non-Commercial:** Use limited to research, education and personnals projects. * **Ownership:** Original data belongs to Letterboxd and its community contributors.
提供机构:
Shree2428
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作