five

Machine-Translated Texts with Human Annotations and Automatic Metric Evaluations

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://data.mendeley.com/datasets/zx3vt8r26k
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset consists of English journalistic texts translated into Slovak using both statistical and neural machine translation systems. Each translated segment was evaluated by human annotators, and errors were recorded in binary format across five error categories. Additionally, the dataset includes the scores of 68 different automatic evaluation metrics, commonly used to assess machine translation quality. The data is divided into training and testing subsets, allowing for the development of models to predict error categories based on the automatic metric scores.
创建时间:
2025-07-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作