five

fahdmirzac/urdu_bollywood_songs_dataset

收藏
Hugging Face2024-03-16 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/fahdmirzac/urdu_bollywood_songs_dataset
下载链接
链接失效反馈
官方服务:
资源简介:
# Bollywood-Inspired Dataset: Movies and Songs Created by Fahd Mirza = https://www.youtube.com/@fahdmirza ## Overview This dataset is a creative collection of fictional Bollywood movie titles paired with equally fictional song lyrics. Inspired by the rich tradition of Bollywood cinema, where music plays a pivotal role in storytelling, this dataset aims to provide a unique resource for exploring the interplay between movie themes and their musical expressions. ## Dataset Structure The dataset is structured in a JSON Lines format (`.jsonl`), where each line is a separate JSON object. Each object contains the following fields: - `instruction`: The English title of a fictional Bollywood movie. - `input`: An empty field, reserved for future use or extensions. - `output`: Original song lyrics inspired by the movie title, provided in English. - `urdu_instruction`: The title of the fictional Bollywood movie, translated into Urdu. - `urdu_input`: An empty field in Urdu, paralleling the `input` field. - `urdu_output`: Song lyrics corresponding to the `output` field, translated into Urdu. ### Example Entry ```json { "instruction": "Heart's Secret", "input": "", "output": "Unveiling the heart's secrets, immersed in love", "urdu_instruction": "راز دل", "urdu_input": "", "urdu_output": "دل کے راز کھولتے ہیں، پیار میں ڈھولتے ہیں" }
提供机构:
fahdmirzac
原始信息汇总

数据集概述

数据集名称

Bollywood-Inspired Dataset: Movies and Songs

数据集描述

该数据集包含虚构的宝莱坞电影标题与相应的虚构歌曲歌词。灵感来源于宝莱坞电影中音乐在叙事中的核心作用,旨在提供一个独特的资源,探索电影主题与其音乐表达之间的互动。

数据集结构

数据集采用JSON Lines格式(.jsonl),每行是一个独立的JSON对象,包含以下字段:

  • instruction: 虚构宝莱坞电影的英文标题。
  • input: 保留字段,目前为空,用于未来扩展。
  • output: 受电影标题启发的原创英文歌词。
  • urdu_instruction: 虚构宝莱坞电影的乌尔都语标题。
  • urdu_input: 乌尔都语的保留字段,与input字段平行。
  • urdu_output: 与output字段对应的乌尔都语歌词。

示例条目

json { "instruction": "Hearts Secret", "input": "", "output": "Unveiling the hearts secrets, immersed in love", "urdu_instruction": "راز دل", "urdu_input": "", "urdu_output": "دل کے راز کھولتے ہیں، پیار میں ڈھولتے ہیں" }

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作