five

LudicIndex/ludic-index-turn-logs

收藏
Hugging Face2026-04-28 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/LudicIndex/ludic-index-turn-logs
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集名为Ludic Index — Released Turn Logs,与论文Ludic Index: A Decomposition Framework for Value Loss in Constrained LLM Agents相关。数据集包含经验记录,如回合日志、法官重新评分数据和每个实验的汇总数据。涵盖了Kuhn扑克、Frozen Lake 6x6和RockSample 4x4 POMDP的实验,包括详细的每回合记录、汇总数据和执行元数据。数据集还包括法官调用和比较,总共有6,440个回合和3,957次法官调用。模型面板由通过OpenAI Chat Completions API访问的各种GPT模型组成。数据集采用CC-BY-4.0许可,旨在重现论文的结果。

This dataset is named Ludic Index — Released Turn Logs and is associated with the paper Ludic Index: A Decomposition Framework for Value Loss in Constrained LLM Agents. The dataset includes empirical records such as turn logs, judge rescoring data, and per-experiment aggregates. It covers experiments in Kuhn poker, Frozen Lake 6x6, and RockSample 4x4 POMDP, with detailed per-turn records, aggregates, and manifests. The dataset also includes judge calls and comparisons, with a total volume of 6,440 turns and 3,957 judge calls. The model panel consists of various GPT models accessed via the OpenAI Chat Completions API. The dataset is licensed under CC-BY-4.0 and is intended for reproduction of the papers results.
提供机构:
LudicIndex
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作