five

AMBILE_Shah_Jo_Risalo_Labeled

收藏
DataCite Commons2025-09-08 更新2026-04-25 收录
下载链接:
https://figshare.com/articles/dataset/AMBILE_Shah_Jo_Risalo_Labeled/30073474
下载链接
链接失效反馈
官方服务:
资源简介:
<b>AMBILE Shah Jo Risalo</b><b>Developed by:</b><br>Abdul Majid Bhurgri Institute of Language Engineering (AMBILE), Hyderabad<br>Under the administrative control of the <b>Culture, Tourism, Antiquities &amp; Archives Department, Government of Sindh</b><b>Dataset Overview</b>The "Shah Jo Risalo" dataset serves as a comprehensive linguistic and literary resource, encompassing <b>4,767 Sindhi poetic verses</b> drawn from the <b>30 traditional Surs</b> (sections) of the esteemed magnum opus of <b>Shah Abdul Latif Bhittai</b>. Each poetic verse, written in <b>Sindhi Arabic Perso</b>, is meticulously paired with its <b>Roman Script</b>, <b>Devanagri Script</b>, along with translations in <b>Sindhi</b>, <b>English</b> (translated by <b>Amar Fayaz Buriro</b>), <b>Urdu</b> (translated by <b>Agha Saleem</b>), and <b>Punjabi</b> (translated by <b>Kartar Singh Arsh</b>). This dataset offers valuable insights into the philosophical, spiritual, and cultural dimensions embedded within the poetry, making it an indispensable asset for researchers, linguists, educators, and developers working on Sindhi literature and AI/NLP applications.<b>Dataset Features</b><b>Total Verses</b>: 4,767 poetic lines from <b>30 classical Surs</b><b>Language</b>: Clean <b>Sindhi script</b> in <b>Unicode format</b><b>File Format</b>: CSV file titled <code>Bhittaipedia Risalo -(25-08-25).csv</code><b>CSV Structure</b>The dataset is organized into the following fields:<b>Row_ID</b>: Unique identifier for each row<b>Melody Number</b>: Identifier for the melody associated with the verse<b>Melody (سر)</b>: Name of the Sur (chapter)<b>Chapter Number</b>: Verse number within the Sur<b>Chapter (داستان)</b>: Subsection within the Sur<b>Type</b>: Type or category of the verse<b>Bait / Vaayi Number</b>: Number associated with the poetic form<b>Sindhi Arabic Perso</b>: Original Sindhi poetic verse<b>Roman Script</b>: Sindhi verse in Roman script<b>Devanagri Script</b>: Sindhi verse in Devanagri script<b>Explanation</b>: Sindhi interpretation of the verse<b>English Translation</b>: Translated by <b>Amar Fayaz Buriro</b><b>Urdu Translation</b>: Translated by <b>Agha Saleem</b><b>Punjabi Translation</b>: Translated by <b>Kartar Singh Arsh</b><b>Keywords</b>: Search-optimized terms for easier data retrieval<b>Applications</b>This dataset is a valuable resource for a variety of applications, including but not limited to:<b>Natural Language Processing (NLP)</b> research in Sindhi languageDevelopment of <b>AI-powered Sindhi chatbots</b> and conversational agentsCreation of <b>educational tools</b> for literature learning<b>Text-to-Speech (TTS)</b> system training<b>Verse classification</b> and <b>sentiment analysis</b> projects<b>Digital preservation</b> and promotion of Sindhi literary heritage<b>Data Source</b>The dataset is sourced from the <b>AMBILE Bhittaipedia project</b>, which aims to digitize and preserve the cultural heritage of Sindhi literature.<b>How to Use</b><b>Clone the repository or download the CSV file</b>.Open the CSV file using <b>Python</b> or <b>Excel</b>:<pre><pre>import pandas as pd <br>df = pd.read_csv("Bhittaipedia Risalo -(25-08-25).csv") <br>print(df.head()) <br></pre></pre>The dataset is sourced from the <b>AMBILE Bhittaipedia project</b>.<b>License</b>This dataset is released under the <b>Creative Commons Attribution-NonCommercial 4.0 License</b>. It is intended for educational and research purposes only.<b>Acknowledgments</b>Special thanks to the <b>AMBILE team</b> for their involvement in data compilation and cleaning.<b>Contact</b>For any queries, collaboration opportunities, or contributions, please contact:<b>Email</b>: datasets@sindh.ai
提供机构:
figshare
创建时间:
2025-09-08
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作