TamilThagaval/avvaiyar-nalvazhi
收藏Hugging Face2026-04-13 更新2026-05-10 收录
下载链接:
https://hf-mirror.com/datasets/TamilThagaval/avvaiyar-nalvazhi
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
task_categories:
- translation
- text-generation
- text-classification
language:
- ta
tags:
- art
pretty_name: Nalvazhi
size_categories:
- 10K<n<100K
---
# Dataset Card for Nalvazhi (நல்வழி)
## Dataset Description
- **Author:** Avvaiyar (ஔவையார்)
- **Language:** Tamil (தமிழ்)
- **Total Verses:** 41 (Including Kadavul Vazhthu)
- **Format:** JSON / CSV
### Summary
**Nalvazhi (நல்வழி)**, meaning "The Good Path," is a seminal work of Tamil ethical literature composed by the poetess **Avvaiyar**. The text consists of 41 Venpa verses providing guidance on virtue, karma, and the transient nature of life. This dataset pairs the original classical Tamil poetic stanzas with modern Tamil prose explanations.
---
## Dataset Structure
The dataset is organized as a collection of literary pairings. Each entry represents a specific stanza and its corresponding interpretation.
### Data Fields
* `verse_no`: (Integer) The index of the poem (0 for Preface, 1–40 for main text).
* `verse_text`: (String) The original 4-line Venpa verse in Tamil.
* `explanation`: (String) Modern Tamil prose explaining the moral and meaning of the verse.
### Data Example
| field | value |
| :--- | :--- |
| **verse_no** | 1 |
| **verse_text** | புண்ணியம் ஆம் பாவம் போம் போன நாள் செய்த அவை... |
| **explanation** | மண்ணில் பிறந்தவர் வைத்திருக்கும் பொருள் போன பிறவியில் செய்த புண்ணியம் பாவம் என்னும் இரண்டே... |
---
## Dataset Creation
### Curation Rationale
This dataset was curated to bridge the gap between **Senthamizh (Classical Tamil)** and **Koduntamil (Modern Tamil)**. It is designed for:
1. **Fine-tuning LLMs** for low-resource classical language understanding.
2. **Poetry-to-Prose Translation**: Training models to simplify complex poetic structures.
3. **Sentiment & Ethical Analysis**: Mapping ancient ethical frameworks to modern contexts.
### Source Data
The text is sourced from public domain classical Tamil literary archives, specifically focused on the works of medieval Avvaiyar.
---
## About the Author: Avvaiyar
Avvaiyar is one of the most celebrated figures in Tamil history. She is known for her ability to condense profound philosophical truths into simple, rhythmic verses. Her works, including *Aathichoodi* and *Konrai Venthan*, remain a core part of the Tamil school curriculum to this day.
## Citation
If you use this dataset in your research or applications, please cite the literary origin:
**Title:** Nalvazhi (நல்வழி)
**Original Author:** Avvaiyar (ஔவையார்)
**Dataset Format:** Tamil Poetry-Prose Parallel Corpus
提供机构:
TamilThagaval



