Schenkerian Analysis Dataset

Name: Schenkerian Analysis Dataset
Creator: 杜克大学
Published: 2024-08-14 03:52:06
License: 暂无描述

arXiv2024-08-14 更新2024-08-16 收录

下载链接：

http://arxiv.org/abs/2408.07184v1

下载链接

链接失效反馈

官方服务：

资源简介：

Schenkerian Analysis Dataset是由杜克大学创建的一个大型数据集，专门用于计算Schenkerian音乐分析。该数据集包含145个分析，涵盖了从巴赫到肖斯塔科维奇等多个作曲家的作品。数据集的内容主要描述了音乐作品中的层次结构关系，特别是赋格主题的层次关系。数据集的创建过程涉及多位资深Schenkerian分析学者，他们通过专门的软件工具进行数据收集和可视化。该数据集的应用领域主要是在音乐信息检索和音乐生成任务中，旨在通过机器学习模型更好地理解和生成音乐结构。

Schenkerian Analysis Dataset is a large-scale dataset developed by Duke University, exclusively tailored for computational Schenkerian music analysis. It comprises 145 analytical entries covering musical works created by a diverse range of composers, spanning from Johann Sebastian Bach to Dmitri Shostakovich. The dataset primarily documents the hierarchical structural relationships inherent in musical compositions, with a particular focus on the hierarchical relationships of fugue subjects. The creation of this dataset involved several senior Schenkerian analysis scholars, who utilized specialized software tools for data collection and visualization. Its core application scenarios include music information retrieval and music generation tasks, with the goal of empowering machine learning models to gain a deeper understanding of musical structures and generate such structures more effectively.

提供机构：

杜克大学

创建时间：

2024-08-14

搜集汇总

数据集介绍

构建方式

Schenkerian Analysis Dataset（Schenkerian Analysis Dataset）的构建旨在促进计算机音乐社区对Schenkerian分析（SchA）的研究和应用。该数据集由Duke University的研究团队创建，目前包含超过140个音乐作品片段的Schenkerian分析，是目前为止最大的、同时以人类和计算机可读格式存在的Schenkerian数据集。数据集的构建过程涉及多位经验丰富的Schenkerian学者，他们使用新的计算机记谱软件进行数据的收集和可视化，从而确保了数据的质量和可用性。该软件能够对多达四个声部结构进行记谱，并允许用户调整音高、音深、和声/音阶度标签等参数。数据集以JavaScript Object Notation（JSON）格式存储，包含关于分析、作曲家、标题、副标题等元数据，以及每个理论声部的信息，如音名、音深、Ursatz索引、音阶度/罗马数字等。此外，数据集还存储了跨声部的符号，如声部交换线和表示较长跨度时间内相关音的线条。数据集的构建过程是一个持续的过程，旨在随着时间的推移不断扩大和更新。

特点

Schenkerian Analysis Dataset的特点在于其规模之大、数据质量之高以及易于使用的格式。数据集包含了从多位经验丰富的Schenkerian学者那里收集到的分析，涵盖了包括J.S. Bach、Mendelssohn、Brahms、Bartók、Shostakovich、Gentle Giant等作曲家的作品。数据集以人类和计算机可读的格式存在，使得研究人员可以轻松地使用这些数据来进行音乐信息检索（MIR）和音乐生成任务。数据集还包含了新的计算机记谱软件，该软件能够对Schenkerian分析进行可视化，并生成易于理解的图形表示。此外，数据集还提供了一种新颖的、灵活的Schenkerian分析表示方法，即将其作为异构边图数据结构，这为研究人员提供了更多的可能性来探索和分析音乐结构。数据集的构建过程是一个持续的过程，旨在随着时间的推移不断扩大和更新，从而为计算机音乐社区提供更多的研究资源。

使用方法

Schenkerian Analysis Dataset的使用方法包括以下步骤：1. 数据获取：研究人员可以通过联系Duke University的研究团队获取Schenkerian Analysis Dataset。2. 数据处理：研究人员可以使用提供的计算机记谱软件对数据进行可视化，并生成易于理解的图形表示。3. 数据分析：研究人员可以使用机器学习模型对数据进行训练，以学习音乐结构，并生成更“人性化”的音乐作品。4. 结果评估：研究人员可以使用评估指标来评估机器学习模型的效果，并根据结果进行进一步的改进。5. 数据更新：随着时间的推移，研究人员可以不断更新和扩展Schenkerian Analysis Dataset，以提供更多的研究资源。Schenkerian Analysis Dataset的使用方法简单易行，可以为研究人员提供丰富的音乐结构数据，帮助他们更好地理解音乐并生成更高质量的音乐作品。

背景与挑战

背景概述

Schenkerian Analysis (SchA) is a method of music analysis that combines melody, harmony, counterpoint, and form to describe the hierarchical structure of a musical work. This dataset, developed by researchers at Duke University, is the largest collection of SchA data available in both human- and computer-readable formats, containing over 140 excerpts. The dataset is intended to facilitate research in computational Schenkerian analysis, which could lead to machine learning models with a deeper understanding of musical structure and potentially more human-like results in music understanding and generation. The dataset also includes a novel software for visualization and collection of SchA data, and a flexible representation of SchA as a heterogeneous-edge graph data structure.

当前挑战

The main challenges associated with this dataset and computational Schenkerian analysis include the need for high-quality data and the complexity of the Schenkerian theory itself. The current dataset is a significant step forward but still requires further expansion and refinement to improve the quality and quantity of the data. Additionally, the Schenkerian theory is complex and requires a deep understanding of music theory, which poses a challenge for researchers and developers working in this field. Another challenge is the representation of SchA as a graph data structure, which allows for more flexibility but also introduces new complexities in the modeling and analysis of musical data. Furthermore, the dataset needs to address the limitations of the previous Schenker41 dataset, such as the questionable quality of some excerpts and the lack of representation for certain Schenkerian symbols and concepts.

常用场景

经典使用场景

Schenkerian Analysis Dataset主要用于计算机音乐研究中的音乐分析和生成任务。该数据集通过提供大量的人类可读和计算机可读的Schenkerian分析数据，帮助机器学习模型更深入地理解音乐结构，从而产生更具艺术性和理论性的结构特征。例如，该数据集可用于训练生成模型，使其能够生成具有更复杂和多层次结构的音乐作品，或者在音乐信息检索任务中，帮助模型更好地识别和分类音乐作品。

衍生相关工作

Schenkerian Analysis Dataset的提出衍生了许多相关的研究工作。例如，研究者们基于该数据集开发了一系列的音乐生成模型，这些模型能够生成具有更复杂和多层次结构的音乐作品；此外，该数据集还被用于音乐信息检索任务，帮助模型更好地识别和分类音乐作品；同时，该数据集还推动了Schenkerian分析在音乐教育和研究中的应用，为相关领域的研究提供了有力支持。

数据集最近研究