five

Extrinsic Evaluation Dataset for Automatic Sentence Alignment

收藏
科学数据银行2023-11-10 更新2026-04-23 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=aec0de86890e4614afdba0b02510752e
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset consists of Chinese subtitles and their Vietnamese translations downloaded from www.iq.com, which can be used for the extrinsic evaluation of automatic sentence alignment. There are three sub-corpora in the dataset: the training data SUB-Train and the validation and test data Sub-Dev and Sub-Test for the training and evaluation of machine translation systems.
提供机构:
Yanshuan University
创建时间:
2023-07-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作