five

Machine Translation Evaluation Dataset for Amharic

收藏
Zenodo2020-08-01 更新2026-05-28 收录
下载链接:
https://zenodo.org/record/3734260
下载链接
链接失效反馈
官方服务:
资源简介:
<strong>Machine Translation Evaluation Dataset for Amharic </strong> The dataset contains source sentences in Amharic and English and their corresponding reference translations that were collected using crowd sourcing. These ground-truth sentences are from across different domains such as news headlines, social media, Wikipedia and everyday conversation. <br> <strong>Metadata of files in the dataset</strong> amen.tsv<br> - Domain: news | wiki | twitter | convo<br> - Source Sentence: Amharic sentence<br> - Reference Translation: English translation<br> - Google Translate: output of Google Translate<br> - Yandex Translate: output of Yandex Translate <br> enam.tsv<br> - Domain: news | wiki | twitter | convo<br> - Source Sentence: English sentence<br> - Reference Translation: Amharic translation<br> - Google Translate: output of Google Translate<br> - Yandex Translate: output of Yandex Translate <br> <strong>Amharic source and reference translations across domains:</strong> <em>News: </em>These are news headlines from Ethiopian news websites.<br> <em>Wikipedia: </em>A random sample of sentences from the Amharic Wikipedia.<br> <em>Twitter: </em>Amharic Twitter posts on consumer products.<br> <em>Conversational: </em>Everyday conversational expressions from Amharic native speakers. <strong>English source and reference translations across domains:</strong> <em>News: </em>These are news headlines from Wikipedia current events portal.<br> <em>Wikipedia: </em>A random sample of sentences from the English Wikipedia.<br> <em>Twitter: </em>English Twitter posts on global events from Wikipedia current events portal.<br> <em>Conversational: </em>Everyday conversational expressions from English native speakers. <strong>Evaluation of two systems that provide Amharic translation</strong> The dataset also contains evaluation of two commercial systems: [Google<br> Translate](https://translate.google.com/) and [Yandex<br> Translate](https://translate.yandex.com/). Both systems provide free APIs that<br> users can sign up and get access keys to. The translations for Amharic to English were generated on 14th<br> February 2020. The translations for English to Amharic were generated on 30th March 2020.
提供机构:
Zenodo
创建时间:
2020-03-31
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作