Shree2428/letterboxd-10m-movies-ratings-2025
收藏Hugging Face2026-01-18 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Shree2428/letterboxd-10m-movies-ratings-2025
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-nc-sa-4.0
task_categories:
- tabular-regression
language:
- en
tags:
- movies
- ratings
- recommendation-system
- film
- dataset
pretty_name: Letterboxd Movies Ratings 10M (2025)
size_categories:
- 10M<n<100M
---
# Letterboxd Movie Ratings 10M (Popular Users - Dec 2025)
## Description
This dataset contains approximately **10.4 million movie ratings** collected in December 2025. It focuses on the activity of the 6,519 most popular users from Letterboxd at the time of december 2025
## Dataset Statistics
### General Metrics
| Metric | Value |
| :--- | :--- |
| **Total Interactions** | 10,462,257 |
| **Unique Users** | 6,519 |
| **Unique Movies** | 281,045 |
| **Matrix Density** | 0.5710% |
| **Global Mean Rating** | 3.28 (std: 1.06) |
### Rating Distribution
The ratings follow a typical distribution skewed towards positive values (3.0 - 4.0).
| Rating | Count | Share |
| :--- | :--- | :--- |
| **0.5** | 190,095 | 1.8% |
| **1.0** | 361,082 | 3.5% |
| **1.5** | 365,872 | 3.5% |
| **2.0** | 859,407 | 8.2% |
| **2.5** | 979,630 | 9.4% |
| **3.0** | 1,981,222 | 18.9% |
| **3.5** | 1,899,613 | 18.2% |
| **4.0** | 2,101,217 | 20.1% |
| **4.5** | 875,288 | 8.4% |
| **5.0** | 848,831 | 8.1% |
### Activity Statistics (Per User & Per Movie)
These statistics illustrate the activity volume. Note the heavy long-tail distribution for movies (75% of movies have 8 ratings or fewer within this specific user group).
| Statistic | Ratings per User | Ratings per Movie |
| :--- | :--- | :--- |
| **Mean** | 1,604.9 | 37.2 |
| **Std Dev** | 1,517.4 | 205.9 |
| **Min** | 13 | 1 |
| **25% (Q1)** | 601 | 1 |
| **50% (Median)** | 1,092 | 2 |
| **75% (Q3)** | 2,032 | 8 |
| **Max** | 9,990 | 5,541 |
## Data Schema
The dataset is provided in two formats: **Parquet** (recommended for performance) and **CSV** (for universal compatibility).
* **`user`** (uint16): Anonymized numerical identifier for the user.
* **`movie_id`** (uint32): Letterboxd internal identifier (slug-based ID) of the movie.
* **`title`** (string): English title of the movie.
* **`rating`** (float32): Numerical rating on a scale of 0.5 to 5.0.
## Citation and Attribution
If you use this dataset in your research or project, please cite it as follows:
> [Titouan Mokrani], *Letterboxd Movies Rating 10M (2025)*, available on Hugging Face.
## License and Disclaimer
This dataset is distributed under the **CC-BY-NC-SA 4.0** license.
* **Non-Commercial:** Use limited to research, education and personnals projects.
* **Ownership:** Original data belongs to Letterboxd and its community contributors.
提供机构:
Shree2428



