AI4Bharat-IndicNLP corpus
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/anoopkunchukuttan/indic_nlp_library
下载链接
链接失效反馈官方服务:
资源简介:
该数据集为德拉维达语系的单语数据集,旨在用于评估统一书写系统对语言的影响。该数据集的任务是对无监督翻译进行单语数据分析。
This is a monolingual dataset of the Dravidian language family, purpose-built to evaluate the impact of unified writing systems on languages. The task of this dataset is to conduct monolingual data analysis related to unsupervised translation.
提供机构:
AI4Bharat



