five

Human-AI coordination via policy generation from language-guided diffusion

收藏
中国科学数据2026-01-09 更新2026-04-25 收录
下载链接:
https://www.sciengine.com/AA/doi/10.1007/s11431-025-3138-x
下载链接
链接失效反馈
官方服务:
资源简介:
Developing intelligent agents that can effectively coordinate with diverse human partners is a fundamental goal of artificial general intelligence.Previous approaches typically generate a variety of partners to cover human policies, and then either train a single universal agent or maintain multiple best-response (BR) policies for different partners.However, the first direction struggles with the stochastic and multimodal nature of human behaviors, and the second relies on costly few-shot adaptations during policy deployment, which is unbearable in real-world applications such as healthcare and autonomous driving.Recognizing that human partners can easily articulate their preferences or behavioral styles through natural languages (NLs) and make conventions beforehand, we propose a framework for Human-AI Coordination via Policy Generation from Language-guided Diffusion (Haland). Haland first trains BR policies for various partners using reinforcement learning, and then compresses policy parameters into a single latent diffusion model, conditioned on task-relevant language derived from their behaviors.Finally, the alignment between task-relevant and NLs is achieved to facilitate efficient human-AI coordination.Empirical evaluations across diverse cooperative environments demonstrate that Haland generates agents with significantly enhanced zero-shot coordination performance, utilizing only NL instructions from various partners, and outperforms existing methods by approximately 89.64%.
创建时间:
2025-11-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作