yifenglu/langfun_gdpval
收藏Hugging Face2025-12-13 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/yifenglu/langfun_gdpval
下载链接
链接失效反馈官方服务:
资源简介:
GDPval是一个用于评估AI模型在现实世界中有经济价值任务上表现的数据集。它包含220个真实世界的知识任务,涵盖44种职业。每个任务由一个文本提示和一组支持参考文件组成。数据集涉及的主题包括NSFW内容(如性、酒精、粗俗语言和政治内容),这些内容反映了各种职业中实际处理的主题。数据集还包含对第三方品牌和商标的有限引用,仅用于研究和评估目的。
GDPval is a dataset designed to evaluate AI model performance on real-world economically valuable tasks. It includes 220 real-world knowledge tasks across 44 occupations. Each task consists of a text prompt and a set of supporting reference files. The dataset covers themes such as NSFW content (e.g., sex, alcohol, vulgar language, and political content), reflecting real themes addressed in various occupations. It also contains limited references to third-party brands and trademarks, solely for research and evaluation purposes.
提供机构:
yifenglu



