CMiLBench: A Hierarchical Multitask Benchmark for Low-Resource Minority Languages in China

Name: CMiLBench: A Hierarchical Multitask Benchmark for Low-Resource Minority Languages in China
Creator: Yijie Li; 中央民族大学
Published: 2025-09-30 00:00:00
License: 暂无描述

科学数据银行2025-09-30 更新2026-04-23 收录

下载链接：

https://www.scidb.cn/detail?dataSetId=6ca22a09e1cf483c8b71fd2eb13492a1

下载链接

链接失效反馈

官方服务：

资源简介：

As large language models (LLMs) continue to advance, they achieve strong performance on high-resource language tasks and show promising potential for low-resource language processing. However, existing benchmarks primarily focus on high-resource languages, with limited coverage of Chinese minority languages. To address this gap, we introduce CMiLBench (Chinese Minority Language Benchmark), a comprehensive evaluation framework targeting three key Chinese minority languages: Tibetan, Mongolian, and Uyghur. CMiLBench comprises 17 task categories and 24,663 samples, covering both language understanding and generation. Several tasks are derived from native corpora and culturally grounded content, offering a realistic assessment of model performance in authentic minority language scenarios. Tasks are stratified into five difficulty levels, and evaluation is conducted using both automatic metrics and LLM-as-a-Judge scoring. We evaluate 14 leading commercial and open-source LLMs, demonstrating that CMiLBench serves as a reliable benchmark and is broadly applicable for evaluating LLMs’ capabilities in Chinese minority languages, thereby calibrating current technological progress and advancing research and application development of multilingual models for low-resource languages.

提供机构：

Yijie Li; 中央民族大学

创建时间：

2025-09-30

5,000+

优质数据集

54 个

任务类型

进入经典数据集