DigiGreen/TranslationDataset_AgriQueries
收藏Hugging Face2024-11-08 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/DigiGreen/TranslationDataset_AgriQueries
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
task_categories:
- translation
language:
- hi
- en
tags:
- Agriculture
- translation
- hindi
- hindi_latin
size_categories:
- 10K<n<100K
---
This Dataset consists of text pair in english and hindi_latin and some sheets which have hindi_devanagri as well.
The dataset is created by different human evaluators who have written hindi sentences in hindi latin (using english alphabets).
The dataset can be used for creating a translation model directly from english to hindi latin, which is how the users use ebcause of limitations of mobile keyboard in typing in hindi devanagri script.
---
许可证:Apache 2.0
任务类别:
- 翻译
语言:
- 印地语
- 英语
标签:
- 农业
- 翻译
- 印地语
- 拉丁化印地语(hindi_latin)
数据规模:
- 10K<n<100K
---
本数据集包含英语与拉丁化印地语(hindi_latin)的文本对,同时部分表格还包含天城文印地语(hindi_devanagri)内容。
本数据集由多名人类评估人员构建,他们使用英文字母编写拉丁化形式的印地语句子。
该数据集可直接用于构建英语到拉丁化印地语的翻译模型——鉴于移动键盘输入天城文印地语存在局限性,这正是用户的实际使用场景。
提供机构:
DigiGreen



