CodeSCAN: ScreenCast ANalysis for Video Programming Tutorials
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/10939236
下载链接
链接失效反馈官方服务:
资源简介:
CodeSCAN is the first large-scale and diverse dataset of coding screenshots with pixel-perfect annotations. It features:
24 popular programming languages (according to Github)
100 random repositories per language (with MIT, BSD-3 or WTFPL License), i.e. 2.400 repositories in total
Per repository we use 5 files, i.e. 12.000 files in total
~100 different themes and 25 different fonts
Diverse layouts changes, such as menu bar visibility, sidebar position, output window content, etc.
Numerous realistic interactions such as searching, typing and selecting within a file, etc.
Check our project page for details. The dataset is for academic research use only.
创建时间:
2024-10-04



