five

CMiLBench: A Hierarchical Multitask Benchmark for Low-Resource Minority Languages in China

收藏
科学数据银行2025-09-30 更新2026-04-23 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=6ca22a09e1cf483c8b71fd2eb13492a1
下载链接
链接失效反馈
官方服务:
资源简介:
As large language models (LLMs) continue to advance, they achieve strong performance on high-resource language tasks and show promising potential for low-resource language processing. However, existing benchmarks primarily focus on high-resource languages, with limited coverage of Chinese minority languages. To address this gap, we introduce CMiLBench (Chinese Minority Language Benchmark), a comprehensive evaluation framework targeting three key Chinese minority languages: Tibetan, Mongolian, and Uyghur. CMiLBench comprises 17 task categories and 24,663 samples, covering both language understanding and generation. Several tasks are derived from native corpora and culturally grounded content, offering a realistic assessment of model performance in authentic minority language scenarios. Tasks are stratified into five difficulty levels, and evaluation is conducted using both automatic metrics and LLM-as-a-Judge scoring. We evaluate 14 leading commercial and open-source LLMs, demonstrating that CMiLBench serves as a reliable benchmark and is broadly applicable for evaluating LLMs’ capabilities in Chinese minority languages, thereby calibrating current technological progress and advancing research and application development of multilingual models for low-resource languages.
提供机构:
Yijie Li; 中央民族大学
创建时间:
2025-09-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作