five

CFLUE-金融领域中文语言理解评测数据集

收藏
魔搭社区2026-05-21 更新2025-01-11 收录
下载链接:
https://modelscope.cn/datasets/tongyi_dianjin/CFLUE
下载链接
链接失效反馈
官方服务:
资源简介:
阿里云-通义点金团队推出了CFLUE(Chinese Financial Language Understanding Evaluation),这是一个新颖的、全面的评估基准,旨在评估大型语言模型在中文金融语境中的理解和处理能力。CFLUE通过两个主要维度——“知识评估”和“应用评估”来衡量语言模型的性能。知识评估部分包含超过38,000个多项选择题,这些题目选自15种不同的金融资格模拟考试,旨在测试语言模型的答案预测和推理能力。每个问题都伴随有解释,有助于深入评价模型的推理过程。应用评估部分则提供超过16,000个实例,覆盖文本分类、机器翻译、关系抽取、阅读理解和文本生成等五种经典NLP任务,这些实例源自现有共享任务或由专业人员标注的真实数据。

Introduced by the Alibaba Cloud - Tongyi Dianjin Team, CFLUE (Chinese Financial Language Understanding Evaluation) is a novel and comprehensive evaluation benchmark designed to assess the understanding and processing capabilities of large language models in the Chinese financial context. CFLUE measures the performance of language models across two core dimensions: "Knowledge Assessment" and "Application Assessment". The Knowledge Assessment section contains over 38,000 multiple-choice questions selected from 15 different financial qualification mock exams, aiming to test the answer prediction and reasoning capabilities of language models. Each question is accompanied by an explanation, which facilitates in-depth evaluation of the model's reasoning process. The Application Assessment section provides over 16,000 instances covering five classic NLP tasks including text classification, machine translation, relation extraction, reading comprehension, and text generation. These instances are sourced from existing shared tasks or real data annotated by professionals.
提供机构:
maas
创建时间:
2025-01-06
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
CFLUE是一个中文金融领域语言理解评估数据集,包含知识评估和应用评估两部分,旨在全面评估大型语言模型在金融语境中的能力。数据集由阿里云-通义点金与苏州大学联合推出,覆盖多项NLP任务,为金融领域语言模型的发展提供支持。
以上内容由遇见数据集搜集并总结生成
二维码
社区交流群
二维码
科研交流群
商业服务