five

Main Dataset for "Evolution of Popular Music: USA 1960–2010"

收藏
NIAID Data Ecosystem2026-03-08 收录
下载链接:
https://figshare.com/articles/dataset/Main_Dataset_for_Evolution_of_Popular_Music_USA_1960_2010_/1309953
下载链接
链接失效反馈
官方服务:
资源简介:
This is a large file (~20MB) called EvolutionPopUSA_MainData.csv, in comma-separated data format with column headers. Each row corresponds to a recording. The file is viewable in any text editor, and can also be opened in Excel or imported to other data processing programs. Below is a list of the column headers, with annotations. public_id unique ID of the recording artist_name name of the recording artist artist_name_clean artist name all upper case, no spaces, with secondary artists ("featuring") removed. track_name name of the track, i.e. usually name of the song first_entry date of the first entry into the Billboard Hot 100 quarter, year, fiveyear, decade transformations of first_entry to coarser time periods era era the track belongs to (1,...,4), as determined by Foote segmentation on the PC data (see below) cluster cluster membership of the track, as derived by k-means clustering on the PC data (see below) hTopic_01, ... , hTopic_08 harmonic Topic weights, see description in the paper tTopic_01, ... , tTopic_08 timbral Topic weights, see description in the paper PC1, ... , PC14 principal components of the harmonic and timbral Topics harm_… 193 columns of chord change counts; the chord change is indicated in the column label (e.g. harm_M.2.M means major chord followed by another major chord 2 semitones up). timb_01, ... , timb_35 35 columns of timbre class counts (see description in supplementary information)

本数据集为一个大小约20MB的逗号分隔格式(CSV)文件,文件名为EvolutionPopUSA_MainData.csv,包含列标题。每一行对应一条录音数据。该文件可通过任意文本编辑器查看,也可在Excel中打开或导入至其他数据处理程序中。 以下为该文件的列标题列表及对应注释说明: public_id:该录音的唯一标识符 artist_name:录音艺术家的名称 artist_name_clean:全部大写、无空格且移除了客串艺人(featuring)信息的艺术家名称 track_name:曲目名称,即通常所称的歌曲名 first_entry:首次进入公告牌百强单曲榜(Billboard Hot 100)的日期 quarter、year、fiveyear、decade:将first_entry进行粗粒度时间划分后得到的字段,分别对应季度、年份、五年段与十年段 era:曲目所属的时代(编号1至4),由PC数据上的Foote分段算法确定(详见下文) cluster:基于PC数据通过k-means聚类得到的曲目聚类成员归属 hTopic_01至hTopic_08:谐波主题(harmonic Topic)权重,详见论文中的描述 tTopic_01至tTopic_08:音色主题(timbral Topic)权重,详见论文中的描述 PC1至PC14:谐波主题与音色主题的主成分 harm_…:共193列和弦变化计数字段,列标签中标注了和弦变化类型(例如harm_M.2.M表示大三和弦紧随另一个升高2个半音的大三和弦) timb_01至timb_35:共35列音色类别计数字段,详见补充材料中的说明
创建时间:
2015-04-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作