Shah Abdul Latif Bhittai Poetry Dataset
收藏IEEE2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/shah-abdul-latif-bhittai-poetry-dataset
下载链接
链接失效反馈官方服务:
资源简介:
Shah Abdul Latif Bhittai\u2019s Poetry Dataset \u201cShah Jo Risalo\u201dDeveloped by:Abdul Majid Bhurgri Institute of Language Engineering (AMBILE), HyderabadUnder the administrative control of the Culture, Tourism, Antiquities & Archives Department, Government of Sindh.The \u201cShah Jo Risalo\u201d Dataset is a rich linguistic and literary resource comprising 43,779 Sindhi poetic verses extracted from the 30 traditional Surs of Shah Abdul Latif Bhittai\u2019s magnum opus. Each verse is paired with a Sindhi-language explanation, primarily derived from the authoritative interpretations of Dr. Nabi Bakhsh Baloch.This dataset provides profound insights into the philosophical, spiritual, and cultural themes embedded in the poetry, making it an invaluable asset for Sindhi literature researchers, linguists, educators, and AI\/NLP developers.Dataset FeaturesTotal Verses: 43,779 poetic lines from 30 classical SursLanguage: Clean Sindhi script in Unicode formatFile Format: CSV file titled \u201cShah Jo Risalo labeled.csv\u201dCSV Structuremelody (\u0633\u0631): Name of the Surchapter (\u062f\u0627\u0633\u062a\u0627\u0646): Subsection within the Surchapter_verse_number: Verse number within the dastanpoetry_text (\u0628\u064a\u062a): Original Sindhi poetic verseexplanation (\u0648\u0636\u0627\u062d\u062a): Sindhi interpretation of the versekeywords: Search-optimized termscompiler_name: Name of the compiler of various versions of Shah Jo Risalo.Compilers:Allama I. I. QaziBanhoo Khan ShaikhDr. Nabi Bux 28 Surs ExplanationDr. Ernest TrumppGM ShahwaniGurbakh ShaniMirza Qaleech BaigTara Chand ShokeeramUsman Ali AnsariUsman DiplaiAdwani KaliyanDr. Nabi Bux Baloch (1165\u20131207 Hijri)Dr. Nabi Bux Baloch (1269 Hijri and 1270 Hijri)Dr. Nabi Bux Baloch (British Museum)ApplicationsNatural Language Processing (NLP) research in SindhiAI-based Sindhi chatbots and conversational agentsDevelopment of educational tools for literature learningText-to-Speech (TTS) system trainingVerse classification and sentiment analysis tasksDigital preservation and promotion of Sindhi literary heritageData SourceThe dataset is sourced from the AMBILE Bhittaipedia projectLicenseThis dataset is shared under the Creative Commons Attribution-Noncommercial 4.0 International License, allowing its use for educational and research purposes.AcknowledgmentsSpecial thanks to the AMBILE team for their involvement in data compilation and cleaning.ContactFor inquiries, collaborations, or contributions:technicalteam@ambile.pk
提供机构:
Abdul Majid Bhurgri Institute of Language Engineering Hyderabad



