five

Music Informatics for Radio Across the GlobE (MIRAGE) MetaCorpus (v0.2)

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/12786201
下载链接
链接失效反馈
官方服务:
资源简介:
Overview Welcome to the Music Informatics for Radio Across the GlobE (MIRAGE) MetaCorpus. The current (v0.2) development release consists of metadata (e.g., artist name, track title) and musicological features (e.g., instrument list, voice type, tempo) for 1 million events streaming on 10,000 internet radio stations across the globe, with 100 events from each station.  Users who wish to access, interact with, and/or export metadata from the MIRAGE-MetaCorpus may also visit the MIRAGE online dashboard at the following url: https://pearl-laboratory.github.io/mirage-mc/ Attribution The current MIRAGE-MetaCorpus is available under a CC4 license. Users may cite the dataset here: Sears, David R.W. “Music Informatics for Radio Across the Globe (MIRAGE) Metacorpus -- 2024”. Zenodo, July 19, 2024. https://doi.org/10.5281/zenodo.12786202. Users accessing the MIRAGE-MetaCorpus using the online dashboard should also cite the following ISMIR paper: Ngan V.T. Nguyen, Elizabeth A.M. Acosta, Tommy Dang, and David R.W. Sears. "Exploring Internet Radio Across the Globe with the MIRAGE Online Dashboard," in Proceedings of the 25th International Society for Music Information Retrieval Conference (San Francisco, CA, 2024).  Data Sources This repository of the MIRAGE-MetaCorpus contains 81 metadata variables from the following open-access sources: Radio Garden (RG) -- https://radio.garden Natural Earth map data set (NE) -- https://www.naturalearthdata.com/ Internet Radio Station Stream Encoder (SE) Annotator Review (AR) Monitoring/Matching Algorithm (MA) WikiData (WD) -- https://www.wikidata.org MusicBrainz (MB) -- https://musicbrainz.org/ Each event also includes attribution metadata from the following commercial sources: Spotify (SP) -- https://open.spotify.com/ Note that users may examine an additional 19 metadata variables on the MIRAGE online dashboard that were obtained from the Spotify API. Musixmatch (MX) -- https://www.musixmatch.com/ YouTube (YT) -- https://www.youtube.com/ Genius (GE) -- https://genius.com/ AZlyrics (AZ) -- https://www.azlyrics.com/ Data Sets The metadata reflect information about each event's location (e.g., city, country), station (name, format, url), event (id, local time at station, etc.), artist (name, voice type, etc.), and track (e.g., title, year of release, etc.). For that reason, the MIRAGE-MetaCorpus includes the following datasets: MIRAGE.csv -- the complete metacorpus (1 million) events.csv -- all event-level metadata (1 million) tracks.csv -- all track-level metadata (414,886) artists.csv -- all artist-level metadata (259,783) stations.csv -- all station-level metadata (10,000) locations.csv -- all location-level metadata (4,324) A subset of the MIRAGE-MetaCorpus is also available for events with metadata from online music libraries that reliably matched the event's description in the radio station's stream encoder: MIRAGE_reliable.csv (473,850) events_reliable.csv (473,850) tracks_reliable.csv (204,969) artists_reliable.csv (80,005) stations_reliable.csv (9,284) locations_reliable.csv (4,142) Contact If you are a copyright owner for any of the metadata that appears in the MIRAGE-MetaCorpus and would like us to remove your metadata, please contact the developer team at the following email address: miragedashboard@gmail.com
创建时间:
2024-11-07
二维码
社区交流群
二维码
科研交流群
商业服务