sk-community/romanized_hindi
收藏Hugging Face2025-10-04 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/sk-community/romanized_hindi
下载链接
链接失效反馈官方服务:
资源简介:
Romanized Hindi数据集是一个包含印地语文本及其罗马化(拉丁脚本)表示的集合。该数据集通过结合多个来源创建,包括开放数据集、合成数据生成和基于规则的转写方法。数据集设计用于训练和评估印地语↔罗马语转写模型。
The Romanized Hindi Dataset is a collection of Hindi text paired with its Romanized (Latin script) representation. It has been created by combining multiple sources, including open datasets, synthetic generation, and rule-based transliteration methods. The dataset is designed for training and evaluating Hindi↔Roman transliteration models.
提供机构:
sk-community



