five

Policy Optimization Using Semiparametric Models for Dynamic Pricing

收藏
Taylor & Francis Group2022-11-03 更新2026-04-16 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Policy_Optimization_Using_Semiparametric_Models_for_Dynamic_Pricing/21215587/1
下载链接
链接失效反馈
官方服务:
资源简介:
In this paper, we study the contextual dynamic pricing problem where the market value of a product is linear in its observed features plus some market noise. Products are sold one at a time, and only a binary response indicating success or failure of a sale is observed. Our model setting is similar to Javanmard and Nazerzadeh (2019) except that we expand the demand curve to a semiparametric model and learn dynamically both parametric and nonparametric components. We propose a dynamic statistical learning and decision making policy that minimizes regret (maximizes revenue) by combining semiparametric estimation for a generalized linear model with unknown link and online decision making. Under mild conditions, for a market noise c.d.f. F(·) with m-th order derivative (m≥2), our policy achieves a regret upper bound of O˜d(T2m+14m−1), where T is the time horizon and O˜d is the order hiding logarithmic terms and the feature dimension d. The upper bound is further reduced to O˜d(T) if F is super smooth. These upper bounds are close to Ω(T), the lower bound where F belongs to a parametric class. We further generalize these results to the case with dynamic dependent product features under the strong mixing condition.
提供机构:
Fan, Jianqing; Guo, Yongyi; Yu, Mengxin
创建时间:
2022-09-27
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作