five

aaayyyhhh/UsersCodeforcesSubmissionsEnd2024

收藏
Hugging Face2026-03-24 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/aaayyyhhh/UsersCodeforcesSubmissionsEnd2024
下载链接
链接失效反馈
官方服务:
资源简介:
--- datasets: - name: usersCodeforcesSubmissionsEnd2024 size: 200MB task_categories: - other languages: - en licenses: - cc-by-4.0 tags: - codeforces - competitive-programming - submissions pretty_name: Codeforces Users Submissions (End of 2024) description: > This dataset contains anonymized submission data of ≈15,000 Codeforces users, spanning from the inception of Codeforces to the end of November 2024. download_size: 100MB dataset_size: 1.2GB license: cc-by-4.0 language: - en tags: - codeforces - submissions size_categories: - 10M<n<100M --- # Codeforces Users Submissions Dataset (End of 2024) This project provides the usersCodeforcesSubmissionsEnd2024.csv file, containing anonymized submission data of approximately 15,000 active Codeforces users. The dataset includes all submissions from the inception of Codeforces up to the end of November 2024. It is designed to support AI and data-driven projects. (All data was collected using the open Codeforces API) ## Dataset Overview The file contains **17,607,999 rows** with the following columns: - **`handle`**: An anonymized and shuffled user nickname (e.g., `user{i}`). - **`rating_at_submission`**: User's rating at the time of submission. - **`problem_rating`**: Problem difficulty rating. - **`id_of_submission_task`**: Unique problem identifier on Codeforces. - **`verdict`**: Result of the submission (e.g., `OK`, `WRONG_ANSWER`). - **`time`**: Time of submission (in seconds since the Unix epoch). ## Purpose of the Dataset 1. **AI Development**: This dataset can be used to create intelligent systems to enhance user learning on Codeforces. _(Example: The author of this dataset is currently working on the project "Codeforces User Analysis System for Generating Individual Training Recommendations," which aims to recommend tasks to users based on their weaknesses.)_ 2. **Time Saving**: Collecting such data manually can be time-consuming (it took ≈7 hours for this dataset). By providing it in a ready-to-use format, we aim to save your time and effort. 3. **Reduce Server Load**: This dataset minimizes repetitive data scraping, thereby reducing the load on Codeforces servers. ## License This dataset is shared under the [CC BY 4.0 License](https://creativecommons.org/licenses/by/4.0/). You are free to use it for your projects with proper attribution. ## How to Use 1. Download the `usersCodeforcesSubmissionsEnd2024.zip` file. 2. Unzip the file to access the `usersCodeforcesSubmissionsEnd2024.csv` dataset: - On Linux/macOS: Use the `unzip` command in the terminal. - On Windows: Right-click the file and select "Extract All." 3. Load the CSV file into your favorite data analysis tool: ```python import pandas as pd df = pd.read_csv("usersCodeforcesSubmissionsEnd2024.csv") # Good luck with your projects :)

--- 数据集基本信息: - 数据集名称:usersCodeforcesSubmissionsEnd2024 - 原始包大小:200MB - 任务类别:其他 - 支持语言:英语 - 许可协议:知识共享署名4.0(CC BY 4.0) - 标签:Codeforces、程序设计竞赛、提交记录 - 展示名称:Codeforces 用户提交记录数据集(2024年末) - 数据集描述:本数据集包含约15000名Codeforces用户的匿名化提交数据,时间跨度从Codeforces平台创立之初至2024年11月末。 - 下载包大小:100MB - 解压后数据集存储大小:1.2GB 全局许可协议:知识共享署名4.0(CC BY 4.0) 全局支持语言:英语 全局标签:Codeforces、提交记录 数据规模分类:1000万<n<1亿条数据 --- # Codeforces 用户提交记录数据集(2024年末) 本项目提供`usersCodeforcesSubmissionsEnd2024.csv`文件,其中包含约15000名活跃Codeforces用户的匿名化提交数据。数据集涵盖了从Codeforces平台创立至2024年11月末的全部用户提交记录,旨在为人工智能与数据驱动类项目提供支撑(所有数据均通过公开的Codeforces API采集)。 ## 数据集概览 该文件包含**17607999行**数据,具备以下字段: - **`handle`**:经过匿名化与打乱处理的用户昵称(例如`user{i}`格式)。 - **`rating_at_submission`**:用户提交代码时的账号评分。 - **`problem_rating`**:题目难度评分。 - **`id_of_submission_task`**:Codeforces平台上的唯一题目标识符。 - **`verdict`**:提交结果(例如`OK`代表通过,`WRONG_ANSWER`代表答案错误)。 - **`time`**:提交时间(以Unix纪元以来的秒数为单位)。 ## 数据集用途 1. **人工智能开发**:本数据集可用于构建智能系统,以优化Codeforces平台上的用户学习体验。 (示例:本数据集的作者正在推进“面向个性化训练推荐的Codeforces用户分析系统”项目,该系统旨在根据用户的知识短板为其匹配合适的编程题目。) 2. **节省时间成本**:手动采集此类数据集需耗费大量时间(本数据集的采集过程耗时约7小时)。本项目以可直接使用的格式提供数据,旨在为您节省精力与时间。 3. **减轻服务器负载**:本数据集可减少重复的数据爬取行为,从而降低Codeforces平台的服务器负载。 ## 许可协议 本数据集采用[CC BY 4.0许可协议](https://creativecommons.org/licenses/by/4.0/)进行共享。您可在注明原作者的前提下,自由将其用于各类项目中。 ## 使用方法 1. 下载`usersCodeforcesSubmissionsEnd2024.zip`文件。 2. 解压该文件以获取`usersCodeforcesSubmissionsEnd2024.csv`数据集: - 在Linux/macOS系统中:在终端中使用`unzip`命令进行解压。 - 在Windows系统中:右键点击文件并选择“全部提取”。 3. 将CSV文件加载至您偏好的数据分析工具中: python import pandas as pd df = pd.read_csv("usersCodeforcesSubmissionsEnd2024.csv") 祝你的项目顺利 :)
提供机构:
aaayyyhhh
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作