BanglaMusicStylo: A Stylometric Dataset of Bangla Music Lyrics
收藏ieee-dataport.org2025-03-22 收录
下载链接:
https://ieee-dataport.org/open-access/banglamusicstylo-stylometric-dataset-bangla-music-lyrics
下载链接
链接失效反馈官方服务:
资源简介:
With the rapid growth of the Bangla music industry huge volume of Bangla songs is produced every day. An immense number of producers, lyricists, singers, and artists are involved in the production of songs from different genres. Among many genres of Bangla music; classical, folk, baul, modern music, Rabindra Sangeet, Nazrul Geeti, film music, rock music, and fusion music have gained the highest popularity. Lyricists try to express their feelings and views towards any situation or subject through their writings. Therefore, each lyricist has their own dictionary of thoughts to put on music lyrics. In this paper, we have presented “BanglaMusicStylo”, the very first stylometric dataset of Bangla music lyrics. We have collected 2824 Bangla song lyrics of 211 lyricists in a digital form. All the lyrics are stored in text format for further use. This dataset could be used for stylometric analysis such as authorship attribution, linguistic forensics, gender identification from textual data, Bangla music genre classification, vandalism detection, emotion classification, etc. Identifying the significant research opportunities in this area, we have formalized this dataset which could be used for stylometric analysis.
伴随着孟加拉音乐产业的迅猛发展,每日均有大量孟加拉歌曲产出。众多制作人、词曲作者、歌手及艺术家投身于不同流派歌曲的创作之中。在众多孟加拉音乐流派中,古典、民间、鲍鲁、现代音乐、拉宾德拉桑格特、纳兹鲁歌特、电影音乐、摇滚音乐以及融合音乐尤为受到青睐。词曲作者们试图通过他们的创作,表达对任何情境或主题的情感与观点。因此,每位词曲作者都拥有一套独特的思想词典,用以融入音乐歌词之中。在本研究中,我们推出了“BanglaMusicStylo”,这是首个孟加拉音乐歌词的文体学数据集。我们收集了211位词曲作者创作的2824首孟加拉歌曲歌词,并以数字化形式存储。所有歌词均以文本格式保存,以备后续使用。此数据集可用于文体学分析,如作者归属、语言法医学、从文本数据中识别性别、孟加拉音乐流派分类、恶搞检测、情感分类等。鉴于该领域显著的科研潜力,我们正式化了这一数据集,以供文体学分析之用。
提供机构:
IEEE Dataport



