iapp/rag_thai_laws
收藏Hugging Face2024-11-05 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/iapp/rag_thai_laws
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
datasets:
- iapp/thai_laws
task_categories:
- text-generation
language:
- th
size_categories:
- 10K<n<100K
---
# Thai Laws Dataset
This dataset contains Thai law texts from the Office of the Council of State, Thailand.
The dataset has been cleaned and processed by the iApp Team to improve data quality and accessibility. The cleaning process included:
- Converting system IDs to integer format
- Removing leading/trailing whitespace from titles and text
- Normalizing newlines to maintain consistent formatting
- Removing excessive blank lines
The cleaned dataset is now available on Hugging Face for easy access and integration into NLP projects.
## Original Dataset Details
- Original source: [PyThaiNLP/thai-law v0.2](https://github.com/PyThaiNLP/thai-law/releases/tag/v0.2)
- Data provider: [Office of the Council of State, Thailand](https://www.krisdika.go.th/)
- Dataset size: 42,755 rows
- License: Public Domain
- Language: Thai
提供机构:
iapp



