Umbaji/NMTMD
收藏Hugging Face2025-07-25 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Umbaji/NMTMD
下载链接
链接失效反馈官方服务:
资源简介:
NMTMD(NMT-Melinda-Dataset)是一个开源文本数据集,用于西非本地语言的机器翻译(EWE语料库),并在此之后实现了Yodi模型。该数据集的目标是开发一个用于文本到文本翻译的机器翻译文本和语音数据集,并测量由此数据集构建的Yodi模型的准确性和性能。数据集包括两个已转换和分析的词典:KABDICT525和EWEDICT995,分别用于Kabyè语和Ewè语,现在可以作为Python模块用于项目中。
NMTMD (NMT-Melinda-Dataset) is an open-source text dataset for NMT for local languages in West Africa (EWE Corpus) and the implementation of the Yodi model afterward. The objective of the dataset is to develop a Machine Translation Text and Speech Dataset for text-to-text translation and to measure the accuracy or performance of the Yodi model built from this dataset. The dataset includes two transformed and analyzed dictionaries: KABDICT525 and EWEDICT995 for Kabyè and Ewè, which are now available as Python modules for easy integration into projects.
提供机构:
Umbaji



