five

TamilThagaval/pathinen_keezhkanakku-iynthinaiezhupadhu

收藏
Hugging Face2025-11-30 更新2026-04-05 收录
下载链接:
https://hf-mirror.com/datasets/TamilThagaval/pathinen_keezhkanakku-iynthinaiezhupadhu
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - question-answering language: - ta tags: - art pretty_name: iynthinaiezhupadhu size_categories: - n<1K --- ## 📝 Dataset Description Ainthinai Ezhupathu (ஐந்திணை எழுபது) is a classical Tamil poetic work belonging to the Pathinen Keezhkanakku literature collection. It contains 70 poems (ஏழுபது பாடல்கள்) arranged across the five landscapes (ஐந்திணை) — each thinai having 14 poems. The thinais included are: குறிஞ்சி – Mountain region முல்லை – Forest region மருதம் – Farmland region நெய்தல் – Seashore region பாலை – Desert region The book also begins with கடவுள் வாழ்த்து (Invocation). This work was authored by மூவாதியார். This dataset contains the poems, their explanations, and thinai classification, structured for NLP research, Tamil literature studies, and LLM fine-tuning. ## 📂 Dataset Structure Each record includes: thinai – The landscape category number – Poem number poem – Original Tamil poem explanation – Tamil explanation of the poem karuthu (optional) – Moral / central idea (if available) ## 🎯 Intended Use This dataset can be used for: Thinai classification Tamil poem explanation generation Translation (TA ↔ EN) Semantic similarity Text clustering Literary analysis Digital preservation Training custom models on Tamil classical literature ## 📊 Data Fields thinai: string poems: list number: int poem: string explanation: string karuthu: string (optional) ## 🛠️ Dataset Creation Poems were collected and manually structured. Unicode normalization applied. Explanations included from freely available Tamil sources. Clean JSON format for NLP-friendly access. ## ** **📜 License Feel free to use, modify, and distribute with proper attribution. ## ** **🙏 Acknowledgements Curated by the TamilThagaval community on Hugging Face. Dedicated to preserving Tamil classical literature for future research.
提供机构:
TamilThagaval
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作