scb-mt-en-th-2020
收藏arXiv2020-07-07 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2007.03541v1
下载链接
链接失效反馈官方服务:
资源简介:
构建了一个大规模的英语-泰语机器翻译数据集,包含超过100万对段落,数据来源于新闻、维基百科文章、短信、任务型对话、网络爬虫数据和政府文件。
A large-scale English-Thai machine translation dataset was constructed, containing over one million paragraph pairs. The dataset is sourced from news articles, Wikipedia articles, short message service (SMS) texts, task-oriented dialogues, web-crawled data, and government documents.
创建时间:
2020-07-07



