Frankieeee21/spotify-analysis-dataset
收藏Hugging Face2025-11-18 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/Frankieeee21/spotify-analysis-dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
task_categories:
- feature-extraction
tags:
- music
- spotify
pretty_name: Spotify Analysis Dataset
size_categories:
- 100M<n<1B
---
# Spotify Analysis Dataset
This dataset contains the cleaned, processed, and intermediate data files used in the **Music Style and Popularity Analysis on Spotify** project. It includes raw audio feature tables, clustering outputs, classification-ready datasets, and the SQLite database used for K-Means clustering.
The dataset supports two main tasks:
1. **Music style clustering** based on audio attributes
2. **Popularity prediction** using engineered features and contextual metadata
Files include:
`Spotify_Dataset_V3.csv`, `spotify_cleaned_data.csv`, `kmeans_clustered_data.csv`, `spotify_database.db`, `spotify_data_V3.csv`, `spotify_dataset.csv`, `spotify_dataset_sample.csv`, `data_with_famous_artist.csv`.
This dataset is intended for academic analysis, machine learning experiments, and reproducibility of the associated project.
许可证:MIT 许可证
任务类别:
- 特征提取(feature-extraction)
标签:
- 音乐
- Spotify
美观名称:Spotify 分析数据集(Spotify Analysis Dataset)
规模类别:
- 1亿 < 数据规模 < 10亿
---
# Spotify 分析数据集(Spotify Analysis Dataset)
本数据集包含用于**Spotify 音乐风格与流行度分析(Music Style and Popularity Analysis on Spotify)**项目的已清洗、处理后的数据文件及中间态数据文件。其中涵盖原始音频特征表、聚类输出结果、可直接用于分类的数据集,以及用于K-Means聚类的SQLite数据库(SQLite)。
本数据集支持两项核心任务:
1. **基于音频属性的音乐风格聚类**
2. **利用工程化特征与上下文元数据开展流行度预测**
包含以下文件:
`Spotify_Dataset_V3.csv`、`spotify_cleaned_data.csv`、`kmeans_clustered_data.csv`、`spotify_database.db`、`spotify_data_V3.csv`、`spotify_dataset.csv`、`spotify_dataset_sample.csv`、`data_with_famous_artist.csv`。
本数据集旨在用于学术分析、机器学习实验以及关联项目的可复现性研究。
提供机构:
Frankieeee21



