five

Machine Translation Evaluation Dataset for Amharic

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/3669948
下载链接
链接失效反馈
官方服务:
资源简介:
# Machine Translation Evaluation Dataset for Amharic  The dataset contains sentences in Amharic and their corresponding translations in English that were collected using crowd sourcing. These ground-truth sentences are from across different domains such as news headlines, social media, Wikipedia and everyday conversation. ## Metadata of files in the dataset amen.tsv   - Domain: news | wiki | twitter | convo   - Source Sentence: Amharic sentence   - Reference Translation: English translation   - Google Translate: output of Google Translate   - Yandex Translate: output of Yandex Translate enam.tsv   - Domain: news | wiki | twitter | convo   - Source Sentence: English sentence   - Reference Translation: Amharic translation   - Google Translate: output of Google Translate   - Yandex Translate: output of Yandex Translate ## Reference translations across domains **News** - These are news headlines from Ethiopian news websites. **Wikipedia** - A random sample of sentences from the Amharic Wikipedia. **Twitter** - Amharic Twitter posts on consumer products. **Conversational** - Everyday conversational expressions from Amharic native speakers. ## Evaluation of two systems that provide Amharic translation The dataset also contains evaluation of two commercial systems: [Google Translate](https://translate.google.com/) and [Yandex Translate](https://translate.yandex.com/). Both systems provide free APIs that users can sign up and get access keys. The translations were generated on 14th February 2020.
创建时间:
2020-03-31
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作