IgboAPI Dataset
收藏arXiv2024-05-02 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2405.00997v1
下载链接
链接失效反馈官方服务:
资源简介:
IgboAPI数据集是由Nko.wa okwu机构开发的多方言Igbo-English词典数据集,旨在增强Igbo方言的表示。该数据集包含5095个Igbo词汇,涵盖33种不同的Igbo方言,并附有27,816个平行例句。创建过程中,由专家词典编纂者负责收集和添加Igbo词汇及其方言变体。IgboAPI数据集的应用领域包括机器翻译和语义词典构建,旨在解决Igbo语言技术中的方言多样性问题,促进语言的沟通、学习和保存。
The IgboAPI dataset is a multi-dialect Igbo-English dictionary dataset developed by the Nko.wa okwu institution, aiming to enhance the representation of Igbo dialects. This dataset contains 5,095 Igbo lexical entries, covers 33 distinct Igbo dialects, and is paired with 27,816 parallel example sentences. During its development, professional lexicographers were responsible for collecting and adding Igbo vocabulary items and their dialectal variants. The application scenarios of the IgboAPI dataset include machine translation and semantic dictionary construction, with the goal of addressing the dialectal diversity issue in Igbo language technology and promoting language communication, learning, and preservation.
提供机构:
Nko.wa okwu
创建时间:
2024-05-02



