zzsi/deep-scores-v2

Name: zzsi/deep-scores-v2
Creator: zzsi
Published: 2026-02-25 00:26:33
License: 暂无描述

Hugging Face2026-02-25 更新2026-03-29 收录

下载链接：

https://hf-mirror.com/datasets/zzsi/deep-scores-v2

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: cc-by-4.0 task_categories: - object-detection language: - en tags: - music - optical-music-recognition - omr - sheet-music - symbol-detection size_categories: - 100K<n<1M --- # DeepScoresV2 — Complete A HuggingFace-formatted mirror of the **complete** version of the [DeepScoresV2](https://zenodo.org/records/4012193) dataset for music object detection. ## Dataset description DeepScoresV2 is a large-scale dataset of synthetically rendered music score pages annotated with bounding boxes for musical symbols. The **complete** version contains **255,385 images** with **151 million annotated instances** across 135 symbol classes. Each image is a full score page rendered from MuseScore across 5 music fonts (beethoven, emmentaler, gonville, gutenberg1939, lilyjazz). Annotations use COCO format: `bbox` is `[x, y, width, height]` in pixel coordinates. ## Format ```python { "image_id": int, "file_name": str, "font": str, # music font used to render "image": PIL.Image, "width": int, "height": int, "annotation_set": str, # "deepscores" or "muscima++" "objects": { "id": List[int], "bbox": List[List[float]], # [x, y, w, h], COCO format "category_id": List[int], "category": List[str], "area": List[float], }, } ``` ## Usage ```python from datasets import load_dataset ds = load_dataset("zzsi/deep-scores-v2") example = ds["train"][0] print(example["objects"]["category"][:5]) example["image"].show() ``` ## License [Creative Commons Attribution 4.0 International (CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/) ## Attribution This dataset is a reformatted mirror of DeepScoresV2. Please cite the original work: ```bibtex @inproceedings{DeepScoresV2, title = {DeepScoresV2: A Dataset for Music Object Detection with a Challenging Test Set}, author = {Tuggener, Lukas and Satyawan, Yvan Putra and Pacha, Alexander and Schmidhuber, J{\"u}rgen and Stadelmann, Thilo}, booktitle = {British Machine Vision Conference (BMVC)}, year = {2021} } ``` Original dataset: <https://zenodo.org/records/4012193> Original authors: Lukas Tuggener, Yvan Putra Satyawan, Alexander Pacha, Jürgen Schmidhuber, Thilo Stadelmann (ZHAW / IDSIA)

license: CC BY 4.0 task_categories: - 目标检测（object-detection） language: - 英语 tags: - 音乐 - 光学音乐识别（optical-music-recognition） - OMR - 乐谱（sheet-music） - 符号检测（symbol-detection） size_categories: - 10万<样本数<100万 --- # DeepScoresV2 — 完整版本数据集为音乐目标检测专用数据集DeepScoresV2完整版的HuggingFace格式镜像副本，原始数据集链接为<https://zenodo.org/records/4012193>。 ## 数据集说明 DeepScoresV2是一个大规模合成渲染乐谱页面数据集，针对音乐符号标注了边界框。其完整版包含255,385张图像，涵盖135个符号类别，总计1.51亿个标注实例。所有图像均为基于MuseScore、使用5种音乐字体（beethoven、emmentaler、gonville、gutenberg1939、lilyjazz）渲染得到的完整乐谱页面。标注采用COCO格式：`bbox`为像素坐标下的`[x, y, 宽度, 高度]`。 ## 数据格式 python { "image_id": int, // 图像ID "file_name": str, // 文件名 "font": str, // 渲染所用音乐字体 "image": PIL.Image, // PIL图像对象 "width": int, // 图像宽度 "height": int, // 图像高度 "annotation_set": str, // 标注集，可选"deepscores"或"muscima++" "objects": { "id": List[int], // 实例ID列表 "bbox": List[List[float]], // COCO格式像素坐标框[x, y, 宽度, 高度] "category_id": List[int], // 类别ID列表 "category": List[str], // 类别名称列表 "area": List[float], // 实例面积列表 }, } ## 使用方法 python from datasets import load_dataset ds = load_dataset("zzsi/deep-scores-v2") example = ds["train"][0] print(example["objects"]["category"][:5]) example["image"].show() ## 许可证 [知识共享署名4.0国际许可协议（CC BY 4.0）](https://creativecommons.org/licenses/by/4.0/) ## 引用声明本数据集为DeepScoresV2的格式重构镜像副本，请引用原始文献： bibtex @inproceedings{DeepScoresV2, title = {DeepScoresV2: A Dataset for Music Object Detection with a Challenging Test Set}, author = {Tuggener, Lukas and Satyawan, Yvan Putra and Pacha, Alexander and Schmidhuber, J{"u}rgen and Stadelmann, Thilo}, booktitle = {British Machine Vision Conference (BMVC)}, year = {2021} } 原始数据集链接：<https://zenodo.org/records/4012193> 原始作者：Lukas Tuggener、Yvan Putra Satyawan、Alexander Pacha、Jürgen Schmidhuber、Thilo Stadelmann（ZHAW / IDSIA）

提供机构：

zzsi

5,000+

优质数据集

54 个

任务类型

进入经典数据集