Policy Optimization Using Semiparametric Models for Dynamic Pricing

Name: Policy Optimization Using Semiparametric Models for Dynamic Pricing
Creator: Fan, Jianqing; Guo, Yongyi; Yu, Mengxin
Published: 2022-11-03 00:00:00
License: 暂无描述

Taylor & Francis Group2022-11-03 更新2026-04-16 收录

下载链接：

https://tandf.figshare.com/articles/dataset/Policy_Optimization_Using_Semiparametric_Models_for_Dynamic_Pricing/21215587/1

下载链接

链接失效反馈

官方服务：

资源简介：

In this paper, we study the contextual dynamic pricing problem where the market value of a product is linear in its observed features plus some market noise. Products are sold one at a time, and only a binary response indicating success or failure of a sale is observed. Our model setting is similar to Javanmard and Nazerzadeh (2019) except that we expand the demand curve to a semiparametric model and learn dynamically both parametric and nonparametric components. We propose a dynamic statistical learning and decision making policy that minimizes regret (maximizes revenue) by combining semiparametric estimation for a generalized linear model with unknown link and online decision making. Under mild conditions, for a market noise c.d.f. F(·) with m-th order derivative (m≥2), our policy achieves a regret upper bound of O˜d(T2m+14m−1), where T is the time horizon and O˜d is the order hiding logarithmic terms and the feature dimension d. The upper bound is further reduced to O˜d(T) if F is super smooth. These upper bounds are close to Ω(T), the lower bound where F belongs to a parametric class. We further generalize these results to the case with dynamic dependent product features under the strong mixing condition.

提供机构：

Fan, Jianqing; Guo, Yongyi; Yu, Mengxin

创建时间：

2022-09-27

5,000+

优质数据集

54 个

任务类型

进入经典数据集