five

zzsi/deep-scores-v2

收藏
Hugging Face2026-02-25 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/zzsi/deep-scores-v2
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 task_categories: - object-detection language: - en tags: - music - optical-music-recognition - omr - sheet-music - symbol-detection size_categories: - 100K<n<1M --- # DeepScoresV2 — Complete A HuggingFace-formatted mirror of the **complete** version of the [DeepScoresV2](https://zenodo.org/records/4012193) dataset for music object detection. ## Dataset description DeepScoresV2 is a large-scale dataset of synthetically rendered music score pages annotated with bounding boxes for musical symbols. The **complete** version contains **255,385 images** with **151 million annotated instances** across 135 symbol classes. Each image is a full score page rendered from MuseScore across 5 music fonts (beethoven, emmentaler, gonville, gutenberg1939, lilyjazz). Annotations use COCO format: `bbox` is `[x, y, width, height]` in pixel coordinates. ## Format ```python { "image_id": int, "file_name": str, "font": str, # music font used to render "image": PIL.Image, "width": int, "height": int, "annotation_set": str, # "deepscores" or "muscima++" "objects": { "id": List[int], "bbox": List[List[float]], # [x, y, w, h], COCO format "category_id": List[int], "category": List[str], "area": List[float], }, } ``` ## Usage ```python from datasets import load_dataset ds = load_dataset("zzsi/deep-scores-v2") example = ds["train"][0] print(example["objects"]["category"][:5]) example["image"].show() ``` ## License [Creative Commons Attribution 4.0 International (CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/) ## Attribution This dataset is a reformatted mirror of DeepScoresV2. Please cite the original work: ```bibtex @inproceedings{DeepScoresV2, title = {DeepScoresV2: A Dataset for Music Object Detection with a Challenging Test Set}, author = {Tuggener, Lukas and Satyawan, Yvan Putra and Pacha, Alexander and Schmidhuber, J{\"u}rgen and Stadelmann, Thilo}, booktitle = {British Machine Vision Conference (BMVC)}, year = {2021} } ``` Original dataset: <https://zenodo.org/records/4012193> Original authors: Lukas Tuggener, Yvan Putra Satyawan, Alexander Pacha, Jürgen Schmidhuber, Thilo Stadelmann (ZHAW / IDSIA)

license: CC BY 4.0 task_categories: - 目标检测(object-detection) language: - 英语 tags: - 音乐 - 光学音乐识别(optical-music-recognition) - OMR - 乐谱(sheet-music) - 符号检测(symbol-detection) size_categories: - 10万<样本数<100万 --- # DeepScoresV2 — 完整版 本数据集为音乐目标检测专用数据集DeepScoresV2完整版的HuggingFace格式镜像副本,原始数据集链接为<https://zenodo.org/records/4012193>。 ## 数据集说明 DeepScoresV2是一个大规模合成渲染乐谱页面数据集,针对音乐符号标注了边界框。其完整版包含255,385张图像,涵盖135个符号类别,总计1.51亿个标注实例。 所有图像均为基于MuseScore、使用5种音乐字体(beethoven、emmentaler、gonville、gutenberg1939、lilyjazz)渲染得到的完整乐谱页面。标注采用COCO格式:`bbox`为像素坐标下的`[x, y, 宽度, 高度]`。 ## 数据格式 python { "image_id": int, // 图像ID "file_name": str, // 文件名 "font": str, // 渲染所用音乐字体 "image": PIL.Image, // PIL图像对象 "width": int, // 图像宽度 "height": int, // 图像高度 "annotation_set": str, // 标注集,可选"deepscores"或"muscima++" "objects": { "id": List[int], // 实例ID列表 "bbox": List[List[float]], // COCO格式像素坐标框[x, y, 宽度, 高度] "category_id": List[int], // 类别ID列表 "category": List[str], // 类别名称列表 "area": List[float], // 实例面积列表 }, } ## 使用方法 python from datasets import load_dataset ds = load_dataset("zzsi/deep-scores-v2") example = ds["train"][0] print(example["objects"]["category"][:5]) example["image"].show() ## 许可证 [知识共享署名4.0国际许可协议(CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/) ## 引用声明 本数据集为DeepScoresV2的格式重构镜像副本,请引用原始文献: bibtex @inproceedings{DeepScoresV2, title = {DeepScoresV2: A Dataset for Music Object Detection with a Challenging Test Set}, author = {Tuggener, Lukas and Satyawan, Yvan Putra and Pacha, Alexander and Schmidhuber, J{"u}rgen and Stadelmann, Thilo}, booktitle = {British Machine Vision Conference (BMVC)}, year = {2021} } 原始数据集链接:<https://zenodo.org/records/4012193> 原始作者:Lukas Tuggener、Yvan Putra Satyawan、Alexander Pacha、Jürgen Schmidhuber、Thilo Stadelmann(ZHAW / IDSIA)
提供机构:
zzsi
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作