five

pythainlp/thai-local-language-translation-dataset

收藏
Hugging Face2024-08-09 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/pythainlp/thai-local-language-translation-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-sa-4.0 task_categories: - translation language: - th size_categories: - 10K<n<100K --- # Thai Local Language Translation Dataset Thai Local Language Translation Dataset is a translation dataset for translate Thai Local Language to Thai Central Language. We create the dataset from [Thai Dialect Corpus (Thai dialects ASR corpus)](https://github.com/SLSCU/thai-dialect-corpus). We select train set only from Thai Dialect Corpus. The dataset support Khummuang, Korat, and Pattani. ## Reference Suwanbandit, A., Naowarat, B., Sangpetch, O., Chuangsuwanich, E. (2023) Thai Dialect Corpus and Transfer-based Curriculum Learning Investigation for Dialect Automatic Speech Recognition. Proc. INTERSPEECH 2023, 4069-4073, doi: [10.21437/Interspeech.2023-1828](https://doi.org/10.21437/Interspeech.2023-1828) ## Citation If you use `Thai Local Language Translation Dataset` in your project or publication, please cite the dataset as follows: > Phatthiyaphaibun, W. (2024). Thai Local Language Translation Dataset [Data set]. Zenodo. https://doi.org/10.5281/zenodo.13283454 or ```bib @dataset{phatthiyaphaibun_2024_13283454, author = {Phatthiyaphaibun, Wannaphong}, title = {Thai Local Language Translation Dataset}, month = aug, year = 2024, publisher = {Zenodo}, doi = {10.5281/zenodo.13283454}, url = {https://doi.org/10.5281/zenodo.13283454} } ```
提供机构:
pythainlp
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作