five

TamilThagaval/avvaiyar-nalvazhi

收藏
Hugging Face2026-04-13 更新2026-05-10 收录
下载链接:
https://hf-mirror.com/datasets/TamilThagaval/avvaiyar-nalvazhi
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - translation - text-generation - text-classification language: - ta tags: - art pretty_name: Nalvazhi size_categories: - 10K<n<100K --- # Dataset Card for Nalvazhi (நல்வழி) ## Dataset Description - **Author:** Avvaiyar (ஔவையார்) - **Language:** Tamil (தமிழ்) - **Total Verses:** 41 (Including Kadavul Vazhthu) - **Format:** JSON / CSV ### Summary **Nalvazhi (நல்வழி)**, meaning "The Good Path," is a seminal work of Tamil ethical literature composed by the poetess **Avvaiyar**. The text consists of 41 Venpa verses providing guidance on virtue, karma, and the transient nature of life. This dataset pairs the original classical Tamil poetic stanzas with modern Tamil prose explanations. --- ## Dataset Structure The dataset is organized as a collection of literary pairings. Each entry represents a specific stanza and its corresponding interpretation. ### Data Fields * `verse_no`: (Integer) The index of the poem (0 for Preface, 1–40 for main text). * `verse_text`: (String) The original 4-line Venpa verse in Tamil. * `explanation`: (String) Modern Tamil prose explaining the moral and meaning of the verse. ### Data Example | field | value | | :--- | :--- | | **verse_no** | 1 | | **verse_text** | புண்ணியம் ஆம் பாவம் போம் போன நாள் செய்த அவை... | | **explanation** | மண்ணில் பிறந்தவர் வைத்திருக்கும் பொருள் போன பிறவியில் செய்த புண்ணியம் பாவம் என்னும் இரண்டே... | --- ## Dataset Creation ### Curation Rationale This dataset was curated to bridge the gap between **Senthamizh (Classical Tamil)** and **Koduntamil (Modern Tamil)**. It is designed for: 1. **Fine-tuning LLMs** for low-resource classical language understanding. 2. **Poetry-to-Prose Translation**: Training models to simplify complex poetic structures. 3. **Sentiment & Ethical Analysis**: Mapping ancient ethical frameworks to modern contexts. ### Source Data The text is sourced from public domain classical Tamil literary archives, specifically focused on the works of medieval Avvaiyar. --- ## About the Author: Avvaiyar Avvaiyar is one of the most celebrated figures in Tamil history. She is known for her ability to condense profound philosophical truths into simple, rhythmic verses. Her works, including *Aathichoodi* and *Konrai Venthan*, remain a core part of the Tamil school curriculum to this day. ## Citation If you use this dataset in your research or applications, please cite the literary origin: **Title:** Nalvazhi (நல்வழி) **Original Author:** Avvaiyar (ஔவையார்) **Dataset Format:** Tamil Poetry-Prose Parallel Corpus
提供机构:
TamilThagaval
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作