CFLUE-金融领域中文语言理解评测数据集

Name: CFLUE-金融领域中文语言理解评测数据集
Creator: maas
Published: 2026-05-21 18:11:33
License: 暂无描述

魔搭社区2026-05-21 更新2025-01-11 收录

下载链接：

https://modelscope.cn/datasets/tongyi_dianjin/CFLUE

下载链接

链接失效反馈

官方服务：

资源简介：

阿里云-通义点金团队推出了CFLUE（Chinese Financial Language Understanding Evaluation），这是一个新颖的、全面的评估基准，旨在评估大型语言模型在中文金融语境中的理解和处理能力。CFLUE通过两个主要维度——“知识评估”和“应用评估”来衡量语言模型的性能。知识评估部分包含超过38,000个多项选择题，这些题目选自15种不同的金融资格模拟考试，旨在测试语言模型的答案预测和推理能力。每个问题都伴随有解释，有助于深入评价模型的推理过程。应用评估部分则提供超过16,000个实例，覆盖文本分类、机器翻译、关系抽取、阅读理解和文本生成等五种经典NLP任务，这些实例源自现有共享任务或由专业人员标注的真实数据。

Introduced by the Alibaba Cloud - Tongyi Dianjin Team, CFLUE (Chinese Financial Language Understanding Evaluation) is a novel and comprehensive evaluation benchmark designed to assess the understanding and processing capabilities of large language models in the Chinese financial context. CFLUE measures the performance of language models across two core dimensions: "Knowledge Assessment" and "Application Assessment". The Knowledge Assessment section contains over 38,000 multiple-choice questions selected from 15 different financial qualification mock exams, aiming to test the answer prediction and reasoning capabilities of language models. Each question is accompanied by an explanation, which facilitates in-depth evaluation of the model's reasoning process. The Application Assessment section provides over 16,000 instances covering five classic NLP tasks including text classification, machine translation, relation extraction, reading comprehension, and text generation. These instances are sourced from existing shared tasks or real data annotated by professionals.

提供机构：

maas

创建时间：

2025-01-06

搜集汇总

数据集介绍