alibabagroup/SKYLENAGE-GameCodeGym
收藏Hugging Face2025-09-29 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/alibabagroup/SKYLENAGE-GameCodeGym
下载链接
链接失效反馈官方服务:
资源简介:
V-GameGym是一个包含2219个游戏样本,分布在100个主题簇中的代码大型语言模型基准测试数据集。它旨在弥合算法问题解决和竞技编程中LLM的能力与现实游戏开发需求之间的差距,提供了一种新颖的基于聚类的筛选方法,并引入了多模态评估框架和自动化的LLM驱动管道。
V-GameGym is a benchmark dataset for code large language models (LLM) containing 2219 game samples across 100 thematic clusters. It aims to bridge the gap between LLM capabilities in algorithmic problem-solving and competitive programming versus the comprehensive requirements of practical game development, featuring a novel clustering-based curation methodology and a multimodal evaluation framework with an automated LLM-driven pipeline.
提供机构:
alibabagroup



