five

Machine-Translated Texts with Human Annotations and Automatic Metric Evaluations

收藏
Mendeley Data2026-04-09 收录
下载链接:
https://data.mendeley.com/datasets/zx3vt8r26k/1
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset consists of English journalistic texts translated into Slovak using both statistical and neural machine translation systems. Each translated segment was evaluated by human annotators, and errors were recorded in binary format across five error categories. Additionally, the dataset includes the scores of 68 different automatic evaluation metrics, commonly used to assess machine translation quality. The data is divided into training and testing subsets, allowing for the development of models to predict error categories based on the automatic metric scores.
提供机构:
Univerzita Konstantina Filozofa v Nitre; Univerzita Pardubice
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作