Research data supporting “Dialogue manager domain adaptation using Gaussian process reinforcement learning”

Name: Research data supporting “Dialogue manager domain adaptation using Gaussian process reinforcement learning”
Creator: University of Cambridge
Published: 2024-12-17 10:39:41
License: 暂无描述

DataCite Commons2024-12-17 更新2024-08-25 收录

下载链接：

https://www.repository.cam.ac.uk/handle/1810/259963

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset correspond to the results presented in Computer Speech and Language article Dialogue manager domain adaptation using Gaussian process reinforcement learning and relates to Figure 7. Two contrasts were presented: Prior and NoPrior. NoPrior[1,2,3] is the data obtained in interaction with Amazon MTurk while training three policies for SFR domain. Prior[1,2,3] is the data obtained while training policy for SFR domain that uses a generic policy as a prior. In each directory there is a call directory with a time stamp in the name which contains session.xml file with the dialogue log and feedback.xml file with the user feedback. Figure 8 is obtained using data previously published at https://www.repository.cam.ac.uk/handle/1810/251169 and Figure 9 is obtained using data previously published at https://www.repository.cam.ac.uk/handle/1810/252636 . This data is released under a Creative Commons CC-BY licence (see https://creativecommons.org/licenses/by/4.0/)

提供机构：

University of Cambridge

创建时间：

2016-09-05

5,000+

优质数据集

54 个

任务类型

进入经典数据集