TRACCERR/gdpval
收藏Hugging Face2025-12-12 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/TRACCERR/gdpval
下载链接
链接失效反馈官方服务:
资源简介:
GDPval数据集用于评估AI模型在现实世界中有经济价值的任务上的性能。数据集包含220个现实世界知识任务,涵盖44个职业。每个任务由一个文本提示和一组支持性参考文件组成。数据集可能包含敏感内容,如NSFW内容、政治内容等,但这些内容反映了各种职业中实际处理的真实主题。数据集还包含对第三方品牌和商标的有限引用,仅用于研究和评估目的。
The GDPval dataset is designed to evaluate AI model performance on real-world economically valuable tasks. It includes 220 real-world knowledge tasks across 44 occupations. Each task consists of a text prompt and a set of supporting reference files. The dataset may contain sensitive content such as NSFW themes, political content, etc., but these reflect real themes addressed in various occupations. The dataset also includes limited references to third-party brands and trademarks, solely for research and evaluation purposes.
提供机构:
TRACCERR



