Machine-Translated Texts with Human Annotations and Automatic Metric Evaluations
收藏Mendeley Data2026-04-09 收录
下载链接:
https://data.mendeley.com/datasets/zx3vt8r26k/1
下载链接
链接失效反馈官方服务:
资源简介:
This dataset consists of English journalistic texts translated into Slovak using both statistical and neural machine translation systems. Each translated segment was evaluated by human annotators, and errors were recorded in binary format across five error categories. Additionally, the dataset includes the scores of 68 different automatic evaluation metrics, commonly used to assess machine translation quality. The data is divided into training and testing subsets, allowing for the development of models to predict error categories based on the automatic metric scores.
提供机构:
Univerzita Konstantina Filozofa v Nitre; Univerzita Pardubice



