nuriyev/lichess-elite
收藏Hugging Face2025-12-23 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/nuriyev/lichess-elite
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc0-1.0
task_categories:
- text-generation
- reinforcement-learning
language:
- en
tags:
- chess
- games
- lichess
- pgn
- openings
pretty_name: Lichess Elite Database
size_categories:
- 10M<n<100M
---
# Lichess Elite Database
A curated collection of high-level chess games from [Lichess.org](https://lichess.org), extracted from the [Lichess Elite Database](https://database.nikonoel.fr/) by nikonoel.
## Dataset Description
This dataset contains **26.3 million games** played by strong players on Lichess, filtered to include only:
- **White player**: rated 2400+ (2500+ from Dec 2021)
- **Black player**: rated 2200+ (2300+ from Dec 2021)
- **Time controls**: Rapid and Blitz (bullet excluded)
- **Variant**: Standard chess only
### Source
Original data compiled by [nikonoel](https://lichess.org/@/nikonoel) from the [Lichess Open Database](https://database.lichess.org/).
## Dataset Structure
| Column | Type | Description |
|--------|------|-------------|
| `event` | string | Game event (e.g., "Rated Blitz game") |
| `site` | string | Lichess game URL |
| `white` | string | White player username |
| `black` | string | Black player username |
| `result` | string | Game result: `1-0`, `0-1`, `1/2-1/2` |
| `utcdate` | string | Game date (YYYY.MM.DD) |
| `utctime` | string | Game start time (HH:MM:SS) |
| `whiteelo` | int | White player's rating |
| `blackelo` | int | Black player's rating |
| `whiteratingdiff` | string | White's rating change |
| `blackratingdiff` | string | Black's rating change |
| `eco` | string | ECO opening code |
| `opening` | string | Opening name |
| `timecontrol` | string | Time control (e.g., "180+0", "600+0") |
| `termination` | string | How game ended (Normal, Time forfeit, etc.) |
| `moves` | string | Full game in SAN notation |
## Use Cases
- **Opening preparation**: Study how strong players handle specific openings
- **Chess engine training**: Train or fine-tune chess models
- **Game analysis**: Statistical analysis of high-level play
- **Move prediction**: Sequence modeling for next-move prediction
- **Style analysis**: Study playing patterns of titled players
## Usage
```python
from datasets import load_dataset
dataset = load_dataset("nuriyev/lichess-elite")
# Filter by opening
sicilian = dataset["train"].filter(lambda x: x["eco"].startswith("B"))
# Filter by rating
super_gm = dataset["train"].filter(lambda x: x["whiteelo"] >= 2700)
```
## Citation
If you use this dataset, please credit the original source:
```bibtex
@misc{lichess_elite_database,
author = {nikonoel},
title = {Lichess Elite Database},
year = {2020-2025},
url = {https://database.nikonoel.fr/}
}
```
## License
The original Lichess game data is released under [CC0 1.0](https://creativecommons.org/publicdomain/zero/1.0/).
## Acknowledgments
- [nikonoel](https://lichess.org/@/nikonoel) for curating and maintaining the Lichess Elite Database
- [Lichess.org](https://lichess.org) for making game data freely available
license: CC0 1.0
task_categories:
- 文本生成
- 强化学习
language:
- 英语
tags:
- 国际象棋
- 游戏
- Lichess(lichess)
- 可移植对局格式(Portable Game Notation,PGN)
- 开局
pretty_name: Lichess精英数据库
size_categories:
- 1000万 < 数据量 < 1亿
# Lichess精英数据库
本数据集为经nikonoel从[Lichess精英数据库](https://database.nikonoel.fr/)提取、源自[Lichess.org](https://lichess.org)的高规格国际象棋对局精心整理合集。
## 数据集概览
本数据集包含**2630万局对局**,均来自Lichess平台的顶尖棋手,仅收录符合以下条件的对局:
- **执白方**:等级分≥2400(2021年12月起调整为≥2500)
- **执黑方**:等级分≥2200(2021年12月起调整为≥2300)
- **时间控制规则**:仅收录快棋(Rapid)与超快棋(Blitz)对局,排除子弹棋(Bullet)
- **对局变体**:仅收录标准国际象棋对局
## 数据来源
原始数据由[nikonoel](https://lichess.org/@/nikonoel)从[Lichess公开数据库](https://database.lichess.org/)汇编整理。
## 数据集结构
| 列名 | 数据类型 | 描述 |
|------|----------|------|
| `event` | 字符串 | 对局赛事类型(例如:"评级超快棋对局") |
| `site` | 字符串 | Lichess对局链接 |
| `white` | 字符串 | 执白方用户名 |
| `black` | 字符串 | 执黑方用户名 |
| `result` | 字符串 | 对局结果:`1-0`(白胜)、`0-1`(黑胜)、`1/2-1/2`(和棋) |
| `utcdate` | 字符串 | 对局日期(格式:YYYY.MM.DD) |
| `utctime` | 字符串 | 对局开始时间(格式:HH:MM:SS) |
| `whiteelo` | 整数 | 执白方等级分 |
| `blackelo` | 整数 | 执黑方等级分 |
| `whiteratingdiff` | 字符串 | 执白方对局后等级分变化 |
| `blackratingdiff` | 字符串 | 执黑方对局后等级分变化 |
| `eco` | 字符串 | 国际象棋开局分类编码(Encyclopedia of Chess Openings,ECO) |
| `opening` | 字符串 | 开局名称 |
| `timecontrol` | 字符串 | 时间控制规则(例如:"180+0"、"600+0") |
| `termination` | 字符串 | 对局结束方式(正常结束、超时判负等) |
| `moves` | 字符串 | 标准代数记谱法(Standard Algebraic Notation,SAN)格式的完整对局着法序列 |
## 应用场景
- **开局筹备**:研究顶尖棋手针对特定开局的对局策略
- **国际象棋引擎训练**:用于训练或微调国际象棋模型
- **对局分析**:针对高水平对局开展统计研究
- **着法预测**:用于下一步着法预测的序列建模任务
- **棋风分析**:研究职业棋手的对局风格特征
## 使用示例
python
from datasets import load_dataset
dataset = load_dataset("nuriyev/lichess-elite")
# 按开局特征过滤数据集
sicilian = dataset["train"].filter(lambda x: x["eco"].startswith("B"))
# 按等级分过滤,筛选特级大师对局
super_gm = dataset["train"].filter(lambda x: x["whiteelo"] >= 2700)
## 引用规范
若使用本数据集,请注明原始来源:
bibtex
@misc{lichess_elite_database,
author = {nikonoel},
title = {Lichess精英数据库},
year = {2020-2025},
url = {https://database.nikonoel.fr/}
}
## 许可协议
原始Lichess对局数据遵循[CC0 1.0 通用公共领域奉献协议](https://creativecommons.org/publicdomain/zero/1.0/)发布。
## 致谢
- 感谢[nikonoel](https://lichess.org/@/nikonoel)整理并维护Lichess精英数据库
- 感谢[Lichess.org](https://lichess.org)免费开放对局数据
提供机构:
nuriyev



