Machine-Translated Texts with Human Annotations and Automatic Metric Evaluations
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://data.mendeley.com/datasets/zx3vt8r26k
下载链接
链接失效反馈官方服务:
资源简介:
This dataset consists of English journalistic texts translated into Slovak using both statistical and neural machine translation systems. Each translated segment was evaluated by human annotators, and errors were recorded in binary format across five error categories. Additionally, the dataset includes the scores of 68 different automatic evaluation metrics, commonly used to assess machine translation quality. The data is divided into training and testing subsets, allowing for the development of models to predict error categories based on the automatic metric scores.
创建时间:
2025-07-11



