随机微分博弈模型中的库存管理问题:马尔可夫链近似和最优策略
Inventory Management Problems in Stochastic Differential Game Models: Markov Chain Approximation and Optimal Policies
摘要: 本文研究了在随机参考价格影响下,两个垄断厂商竞争下的生产和定价策略。设定的库存管理系统包括随机参考价格和随机需求。在随机微分博弈模型的框架下研究了库存管理问题,我们给出该背景下支付函数的定义。为了得到最优生产和定价,我们采用动态规划原理的方法,博弈的上下值满足一个耦合的非线性积分微分Hamilton-Jacobi-Isaacs (HJI)方程组。本文还证明了该对策问题鞍点的存在性,由于很难得到封闭形式的解,我们采用马尔可夫链近似来近似值函数和最优控制,并给出了收敛性分析。最后,我们进行了数值实验,并且根据实验结果,提出了相应的管理建议。
Abstract: This paper investigates the production and pricing strategies employed by two monopolies in a competitive environment, considering the influence of random reference prices. The inventory management system incorporates both random reference prices and random demand. The problem of inventory management is examined within the framework of a stochastic differential game model, with the payment function defined accordingly. To obtain optimal production and pricing decisions, dynamic programming principles are utilized, and coupled nonlinear integral differential Hamilton-Jacobi-Isaacs (HJI) equations that govern the upper and lower values of the game are established. Furthermore, the existence of saddle points for this game problem is proven. Due to challenges in obtaining closed-form solutions, Markov chain approximation is employed to approximate functions and optimal controls while providing convergence analysis. Finally, numerical experiments are conducted to validate our findings, leading to corresponding managerial recommendations.
文章引用:欧君恒, 卢相刚. 随机微分博弈模型中的库存管理问题:马尔可夫链近似和最优策略[J]. 应用数学进展, 2024, 13(4): 1827-1841. https://doi.org/10.12677/aam.2024.134172

参考文献

[1] Whitin, T.M. (1955) Inventory Control and Price Theory. Management Science, 2, 61-68. [Google Scholar] [CrossRef
[2] Bertsimas, D. and De Boer, S. (2005) Special Issue Papers: Dynamic Pricing and Inventory Control for Multiple Products. Journal of Revenue and Pricing Management, 3, 303-319. [Google Scholar] [CrossRef
[3] Chen, L., Chen, Y. and Pang, Z. (2010) Dynamic Pricing and Inventory Control in a Make-to-Stock Queue with Information on the Production Status. IEEE Transactions on Automation Science and Engineering, 8, 361-373. [Google Scholar] [CrossRef
[4] Greenleaf, E.A. (1995) The Impact of Reference Price Effects on the Profitability of Price Promotions. Marketing Science, 14, 82-104. [Google Scholar] [CrossRef
[5] Hu, Z., Chen, X. and Hu, P. (2016) Technical Note—Dynamic Pricing with Gain-Seeking Reference Price Effects. Operations Research, 64, 150-157. [Google Scholar] [CrossRef
[6] Chen, X., Hu, P., Shum, S., et al. (2016) Dynamic Stochastic Inventory Management with Reference Price Effects. Operations Research, 64, 1529-1536. [Google Scholar] [CrossRef
[7] Chen, X., Hu, Z.Y. and Zhang, Y.H. (2019) Dynamic Pricing with Stochastic Reference Price Effect. Journal of the Operations Research Society of China, 7, 107-125. [Google Scholar] [CrossRef
[8] Cao, Y. and Duan, Y. (2020) Joint Production and Pricing Inventory System under Stochastic Reference Price Effect. Computers & Industrial Engineering, 143, Article ID: 106411. [Google Scholar] [CrossRef
[9] Pang, J.S. and Fukushima, M. (2005) Quasi-Variational Inequalities, Generalized Nash Equilibria, and Multi-Leader-Follower Games. Computational Management Science, 2, 21-56. [Google Scholar] [CrossRef
[10] Elliott, R.J. and Siu, T.K. (2010) On Risk Minimizing Portfolios under a Markovian Regime-Switching Black-Scholes Economy. Annals of Operations Research, 176, 271-291. [Google Scholar] [CrossRef
[11] Adida, E. and Perakis, G. (2010) Dynamic Pricing and Inventory Control: Uncertainty and Competition. Operations Research, 58, 289-302. [Google Scholar] [CrossRef
[12] Berkovitz, L.D. and Dresher, M. (1959) A Game-Theory Analysis of Tactical Air War. Operations Research, 7, 599-620. [Google Scholar] [CrossRef
[13] Fleming, W.H. (1961) The Convergence Problem for Differential Games. Journal of Mathematical Analysis and Applications, 3, 102-116. [Google Scholar] [CrossRef
[14] Elliott, R.J. and Kalton, N.J. (1972) The Existence of Value in Differential Games of Pursuit and Evasion. Journal of Differential Equations, 12, 504-523. [Google Scholar] [CrossRef
[15] Jin, Z., Yin, G. and Wu, F. (2013) Optimal Reinsurance Strategies in Regime-Switching Jump Diffusion Models: Stochastic Differential Game Formulation and Numerical Methods. Insurance: Mathematics and Economics, 53, 733-746. [Google Scholar] [CrossRef
[16] Pan, Z. and Başar, T. (1995) H∞-Control of Markovian Jump Systems and Solutions to Associated Piecewise-Deterministic Differential Games. In: Jan Olsder, G., Ed., New Trends in Dynamic Games and Applications, Birkhäuser, Boston, 61-94. [Google Scholar] [CrossRef
[17] Yin, G., Zhang, Q. and Badowski, G. (2003) Discrete-Time Singularly Perturbed Markov Chains: Aggregation, Occupation Measures, and Switching Diffusion Limit. Advances in Applied Probability, 35, 449-476. [Google Scholar] [CrossRef
[18] Kushner, H.J. (2002) Numerical Approximations for Stochastic Differential Games. SIAM Journal on Control and Optimization, 41, 457-486. [Google Scholar] [CrossRef
[19] Herbon, A. and Kogan, K. (2014) Time-Dependent and Independent Control Rules for Coordinated Production and Pricing under Demand Uncertainty and Finite Planning Horizons. Annals of Operations Research, 223, 195-216. [Google Scholar] [CrossRef
[20] Maimon, O., Khmelnitsky, E. and Kogan, K. (1998) Optimal Flow Control in Manufacturing Systems: Production Planning and Scheduling. Springer Science & Business Media, Berlin. [Google Scholar] [CrossRef