621 / 2024-04-28 10:58:41
(s,S) Policy Optimization for Distribution Inventory Systems: A Gradient-based Approach
inventory management, gradient estimation, simulation optimization
摘要待审
WenYixing / Shanghai Jiao Tong University
LuoJun / Shanghai Jiao Tong University
WangTong / Shanghai Jiao Tong University
We study a periodic review distribution inventory system in which multiple regional distribution centers (RDCs) order from one central distribution center (CDC). Each order incurs a fixed cost. It is challenging to manage this system due to the high dimensionality of states and actions, as well as the non-continuous cost structure. In this work, we focus on the class of (s,S) policies and derive gradient estimates for the long-run average cost with respect to the policy parameters using conditional Monte Carlo approach. The response surface of the cost to the policies is discontinuous and bumpy. Based on the gradient estimators, we apply an adaptive learning rate optimization algorithm, which is shown performed well in solving complicated high dimensional problems, to optimize the policy. The numerical experiments illustrate that the algorithm results in near-optimal costs.
重要日期
  • 会议日期

    06月28日

    2024

    07月01日

    2024

  • 05月05日 2024

    摘要录用通知日期

  • 05月12日 2024

    摘要截稿日期

  • 07月01日 2024

    注册截止日期

主办单位
中国科学技术大学
协办单位
管理科学与工程学会
移动端
在手机上打开
小程序
打开微信小程序
客服
扫码或点此咨询