five

TianYeZ1214/NuosuBench

收藏
Hugging Face2026-04-12 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/TianYeZ1214/NuosuBench
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - question-answering - text-generation language: - zh - ii size_categories: - 100K<n<1M --- # Nuosu-Benchmark 总结说明 `Nuosu-Benchmark` 是面向极低资源语言(规范彝文)的标准化评估基准,旨在系统评测大语言模型在跨语言理解、语义对齐与推理生成方面的综合能力。 --- ## 数据集总体构成 `Nuosu-Benchmark` 由 5 个子数据集组成,共 **110,513** 个高质量样本,覆盖词汇、句法、语义、语用到综合推理等多层级能力。 | 数据集 | 样本数 | 语言层次 | 任务类型 | 核心测试能力 | |---|---:|---|---|---| | NuosuLex | 13,581 | 词汇层、语义层 | 词汇翻译、词义辨析 | 词义映射、词汇对齐 | | NuosuSen | 3,584 | 句法层、语义层 | 句子翻译、口语表达 | 语法结构、日常语用 | | NuosuEpic | 17,644 | 语用层 | 诗歌翻译、文化释意 | 跨语理解、文化语境 | | NuosuGov | 1,712 | 语用层 | 公文翻译、阅读理解 | 抽象语义、术语一致性 | | NuosuEdu | 73,992 | 综合层 | 选择、判断、问答 | 语言逻辑、推理逻辑 | ## 评测: [评测代码](https://github.com/Tian-ye1214/NuosuBench) # Nuosu-Benchmark Overview **Nuosu-Benchmark** is a standardized evaluation benchmark designed for extremely low-resource languages (specifically Standard Yi / Nuosu). It aims to systematically assess large language models in cross-lingual understanding, semantic alignment, and reasoning generation. --- ## Dataset Composition Nuosu-Benchmark consists of **5 sub-datasets**, totaling **110,513** high-quality samples. It covers multiple linguistic levels, including lexical, syntactic, semantic, pragmatic, and comprehensive reasoning capabilities. | Dataset | Samples | Linguistic Level | Task Type | Core Evaluation Focus | |-----------|--------:|------------------------|------------------------------------|----------------------------------------| | NuosuLex | 13,581 | Lexical, Semantic | Word translation, sense disambiguation | Lexical mapping, semantic alignment | | NuosuSen | 3,584 | Syntactic, Semantic | Sentence translation, colloquial expression | Grammar structure, everyday pragmatics | | NuosuEpic | 17,644 | Pragmatic | Poetry translation, cultural interpretation | Cross-lingual understanding, cultural context | | NuosuGov | 1,712 | Pragmatic | Official document translation, reading comprehension | Abstract semantics, terminology consistency | | NuosuEdu | 73,992 | Comprehensive | Multiple choice, classification, QA | Linguistic reasoning, logical inference | ## Metrics: [MetricsCode](https://github.com/Tian-ye1214/NuosuBench)
提供机构:
TianYeZ1214
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作