保费依赖余额模型的最优分红问题

doi:10.12677/AAM.2019.84090

期刊菜单

保费依赖余额模型的最优分红问题
Optimal Dividend Problem for the Risk Model with Surplus-Dependent Premiums

DOI: 10.12677/AAM.2019.84090, PDF, HTML, XML,
作者: 刘雪：河北工业大学理学院，天津
关键词: 最优分红问题；PDMP；测度值DPE；Optimal Dividend Problem； PDMP； Measure-Valued DPE

摘要: 本文研究了保费依赖余额模型的最优分红问题。目标是最大化破产前的累积期望折现分红，首先，我们给出值函数的基本性质，然后运用测度值生成元的理论得到测度值动态规划方程(测度值DPE)。

Abstract: In this paper, we consider the optimal dividend problem for the risk model with surplus-dependent premiums. The objective is to maximize the expected cumulative discounted dividends payment up to the time of ruin. Firstly, we show the basic properties of the value function. Using the theory of measure-valued generators, we derive the associated measure-valued dynamic programming equation (measure-valued DPE).

文章引用：刘雪. 保费依赖余额模型的最优分红问题[J]. 应用数学进展, 2019, 8(4): 798-804. https://doi.org/10.12677/AAM.2019.84090

1. 引言

1930年，Lundberg-Gramer提出了经典的累积风险模型，该模型常用来研究保险公司的分红及破产概率。而为了使得描述的风险模型更加地贴近实际，1984年，Davis提出了更加一般的保险风险模型，他们称为逐段决定复合泊松风险模型。该模型是一般的模型，囊括了多种目前常见的风险模型。在本文中，我们假设保险公司的风险模型是保费依赖余额(逐段决定复合泊松风险)模型，即余额过程为逐段决定马氏过程(PDMP)。PDMP自提出以来就受到金融，保险，随机控制等多个领域的广泛关注，也涌现出了大量关于PDMP的文章。如 [1] - [6] 研究了PDMP的连续和脉冲控制，最优停时，在风险中的应用等。关于保险中PDMP的相关内容可以参考Schimidlli [7] 。随后，PDMP也被应用到最优分红问题的研究中。

最优分红问题最早可追溯到 [8] ，De Finetti在第15届国际精算学大会(纽约)上首次提出了破产前累积期望折现分红的概念，并对离散时间风险模型的最优分红问题进行了研究，并得到最优分红是barrier策略。通常，研究最优分红问题的方法为Schimidlli的经典方法和Muller [9] 的粘性解的方法(关于最优分红的问题可参考Muller [9] )。在2017年，Liu在 [10] 中提出了一种新的理论：测度值生成元理论。本文即运用该理论得出测度值动态规划方程，该方法不要求值函数是光滑的，则我们不需要讨论方程的粘性解。

本文结构如下：第1节建立保费依赖余额的保险风险模型，并给出相应的最优分红问题。第2节给出了值函数的基本性质。第3节通过测度值生成元理论给出了测度值动态规划方程(测度值DPE)。

2. 模型描述

首先给出完备的概率空间( $Ω$ , F, P)， $Ω$ 是所有右连续且有左极限的函数集合。在此空间内，将保险公司的余额X表示为

$X_{t} = x + \int_{0}^{t} g (X_{s}) d s - \sum_{i = 1}^{N_{t}} Y_{i}, t \geq 0$ (1.1)

其中， $x \geq 0$ 为初始余额， $N_{t}$ 表示到t时刻的索赔个数。索赔额序列 ${Y_{i}}$ 为一独立同分布的随机变量序列，其分布函数为 $Q (x)$ ，并且 ${Y_{i}}$ 和 $N_{t}$ 是相互独立的。索赔到达率 $λ > 0$ 为常数，则下次索赔达到时刻的条件概率分布为 $F (x, t) = e^{- λ t}$ 。 $τ_{n}$ 表示第n次索赔时刻，在两个连续的索赔时刻间的余额过程可表示为

$X_{t} = φ_{X_{τ_{n}}} (t - τ_{n}), t \in [τ_{n}, τ_{n + 1}], n \in N,$

定义推移算子，满足 $X_{s + t} = X_{t} \cdot θ_{s}$ 。

$L = L_{t}, t \geq 0$ 为从0时刻到t时刻的累计分红。给定分红策略L，受控的余额过程 $X_{t}$ 可表示为

$X^{L_{t}} = x + \int_{0}^{t} g (X_{s}^{L}) d s - \sum_{i = 1}^{N_{t}} Y_{i} - L_{t},$ (1.2)

相应的破产时刻定义为

$τ^{L} = \inf {t \geq 0 : X_{T}^{L} < 0} .$

在任意时刻t，索赔到达率都为 $λ$ ，受控后的条件概率分布不变。

如果L满足下列条件，则称L为可行策略。

1) $L_{t}$ 是非降且关于自然流 ${F_{t}}_{t > 0}$ 是适应的；

2) 过程 $L_{t}$ 满足

$L_{t} \leq x + \int_{0}^{t} g (X_{s}^{L}) d s - \sum_{i = 1}^{N_{t}} Y_{i},$

3) 方程(1.2)有唯一强解 $X^{L}$ 。

定义 $Π_{x}$ 是所有以 $x \geq 0$ 为初始余额的可行策略集。

对每个分红策略 $L \in Π_{x}$ ，累积期望折现分红 $V^{L} (x)$ 可表示为

$V^{L} (x) = E [\int_{0}^{τ^{L}} e^{- δ s} d L_{s} | X_{0} = x] = E_{x} [\int_{0}^{τ^{L}} e^{- δ s} d L_{s}],$

其中 $δ > 0$ 是折现因子。定义值函数

$V (x) = \sup {V^{L} (x), \; L \in Π_{x}}, x \geq 0$ 。

通常，当 $x < 0$ 时， $V (x) = 0$ 。

3. 值函数性质

引理2.1：值函数 $V (x)$ 是非降的，局部Lipschitz连续且满足

$y - x \leq V (y) - V (x) \leq \frac{δ + λ}{g (0)} V (y) (y - x)$

$y > x \geq 0$ 。

证明：取分红策略 $L \in Π_{x}$ 使 $V^{L} (x) \geq V (x) - ε (ε > 0)$ 。对任意的 $y > x \geq 0$ ，重新定义一个新策略 $\bar{L} \in Π_{x}$ ，该策略先将 $y - x$ 作为分红一次性分给股东，之后有 $\bar{L} = L$ ，则有下列不等式成立

$V (x) \geq V \bar{^{L}} (x) + y - x \geq V (x) + y - x - ε$

则 $y - x \leq V (y) - V (x)$ ，同时可得V是非降的。

下证 $V (y) - V (x) \leq \frac{δ + λ}{g (0)} V (y) (y - x)$ 。

令 $y = φ_{x (t)}$ ，则有

$\begin{matrix} V (y) - V (x) \leq [1 - e^{- (δ t + λ t)}] V (φ_{x} (t)) \\ \leq [1 - e^{- (δ t + λ t)}] V (y) \\ \leq [δ + λ] t V (y) \end{matrix}$

由于g是单调递增的，有 $t \leq \frac{y - x}{g (0)}$ ，则不等式成立且V是局部Lipschitz连续的。

4. 动态规划原理(DPP)及动态规划方程(DPE)

定义 $U_{x}$ 是可测函数 $α (\cdot) : R_{+} \mapsto [0, l_{0}]$ 的集合， $α (\cdot)$ 满足下列条件：

$x \geq 0$ ，方程 $φ_{x}^{α} (t) = x + \int_{0}^{t} g (φ_{x}^{α} (s)) d s - α (t)$ 有唯一解 $φ_{x}^{α}$ ；

$t \geq 0$ ， $α (t) \leq x + \int_{0}^{t} g (φ_{x}^{α} (x)) d s$ 。

下面我们给出马氏策略及马氏策略集合的定义。

定义4.1：如果受控后的余额过程 $X^{L}$ 是强马氏过程，称 $L \in Π_{x}$ 是马氏控制；如果 $X^{L}$ 是时齐的强马氏过程，则称L为平稳的马氏控制。

定义 $M$ 为可测函数 $L (\cdot) : R_{+} \mapsto [0, l_{0}]$ 的集合，且 $l (\cdot)$ 满足

1) $x \geq 0$ ， $l (x, \cdot) \in U_{x}$ 。

2) 对任意 $x \geq 0$ ， $s, t \geq 0$ ，有

$l (x, s) + l (φ_{x}^{l} (s), t) = l (x, s + t)$

3) 当 $t \in [τ_{n}, τ_{n + 1}]$ 时，

$L_{t} = L_{τ_{n}} + l (X_{τ_{n}}^{L}, t - τ_{n}^{}),$

4) 方程

$φ_{x}^{l} (t) = x + \int_{0}^{t} g (φ_{x}^{l} (s)) d s - l (x, t)$ (4.1)

有唯一解 $φ^{l}$ ；

通过上述定义及 [10] 定理2.3可得集合 $M$ 中的函数是马氏策略函数。

引理4.2：假设存在最优平稳马氏策略 $L^{*}$ 及相应的函数 $l^{*} \in M$ 。值函数V满足

$V (x) = \sup_{l \in M} E_{x} [\int_{0}^{t \land τ_{1}} e^{- δ t} l (x, d s) + e^{- δ (t \land τ_{1})} V (X_{t \land τ_{1}}^{L})], x \geq 0, t \geq 0$ (4.2)

其中 $X_{t \land τ_{1}}^{L} = φ_{x}^{l} (t \land τ_{1}^{l}) - Y_{1} I_{{τ_{1} \leq t}}$ 。

证明：当最优策略 $L^{*}$ 是平稳马氏策略时，可得

$V (x) = E_{x} [\int_{0}^{t \land τ_{1}} e^{- δ s} l^{*} (x, d s)] + E_{x} [\int_{t \land τ_{1}}^{τ^{L^{*}}} e^{- δ s} d L_{s}^{*}], t \geq 0, x \geq 0$ (4.3)

由于 $L^{*}$ 是马氏策略，则等式右边第二部分可以写成

$E_{x} [\int_{t \land τ_{1}}^{τ^{L^{*}}} e^{- δ s} d L_{s}^{*}] = E [e^{- δ (t \land τ_{1})} V (X_{_{t \land τ_{1}}}^{L^{*}})] .$

则(4.3)可写为

$V (x) = E_{x} [\int_{0}^{t \land τ_{1}} e^{- δ s} l^{*} (x, d s) + e^{- δ (t \land τ_{1})} V (X_{t \land τ_{1}}^{L^{*}})],$ $x \geq 0, t \geq 0$ (4.4)

函数 $l \in M$ 。下面我们构造一个新的策略L：将

分红策略在第一次索赔到达前为一般策略L，在之后的索赔到达间隔之间为最优策略 $L^{*}$ 。

在新的策略之下，值函数满足

(4.6)

由(4.4)和(4.6)，可得(4.2)。

定理4.3：假设存在最优平稳马氏策略 $L^{*} \in Π_{x}$ 及函数 $l^{*} \in M$ ，则值函数V(x)满足

$0 = \sup_{l \in M} {l (x, t) - δ \int_{0}^{t} V (φ_{x}^{l} (s)) d s + A^{l} V (x, t)},$ (4.7)

其中

$\begin{matrix} A^{l} V (x, t) = V (φ (t)) - V (x) - λ \int_{0}^{t} V (φ_{x}^{l} (s)) d s \\ + λ \int_{0}^{t} \int_{0}^{φ_{x}^{l} (s)} V (φ_{x}^{l} (s) - y) d Q (y) d s . \end{matrix}$

根据引理3.1, 有

$\begin{matrix} V (x) \geq E_{x} [\int_{0}^{t \land τ_{1}} e^{- δ s} l (x, d s) + e^{- δ (t \land τ_{1})} V (X_{t \land τ_{1}}^{L})] \\ = F (x, t) \int_{0}^{t \land τ_{1}} e^{- δ s} l (x, d s) + \int_{0}^{t} \int_{0}^{s} e^{- δ u} l (x, d u) d F (x, s) \\ + F (x, t) e^{- δ t} V (φ_{x}^{l} (t)) + \int_{0}^{t} e^{- δ s} \int_{0}^{φ_{x}^{l} (s)} V (φ_{x}^{l} (s) - y) d Q (y) d F (x, s) . \end{matrix}$

运用分部积分，得

$e^{- λ s} \int_{0}^{t} e^{- δ s} l (x, d s)$

$= \int_{0}^{t} e^{- δ s} e^{- λ s} l (x, d s) + \int_{0}^{t} \int_{0}^{s} e^{- δ u} l (x, d u) d e^{- λ s}$

和

$\begin{array}{l} e^{- λ t} e^{- δ t} V (φ_{x}^{l} (t)) \\ = V (x) + \int_{0}^{t} e^{- δ s} V (φ_{x}^{l} (s)) d e^{- λ s} + \int_{0}^{t} e^{- λ s} e^{- δ s} d V (φ_{x}^{l} (s)) \\ - δ \int_{0}^{t} e^{- δ s} e^{- λ s} V (φ_{x}^{l} (s)) d s . \end{array}$

则

$\begin{array}{l} 0 \geq \int_{0}^{t} e^{- δ s} e^{- λ s} [- (λ + δ) V (φ_{x}^{l} (s)) + λ \int_{0}^{φ_{x}^{l} (s)} V (φ_{x}^{l} (s) - y) d Q (y)] d s \\ + \int_{0}^{t} e^{- δ s} e^{- λ s} l (x, d s) + \int_{0}^{t} e^{- δ s} e^{- λ s} d V (φ_{x}^{l} (s)) \end{array}$ (4.8)

令

$H^{l} V (x, t) = \int_{0}^{t} [- (λ + δ) V (φ_{x}^{l} (s)) + λ \int_{0}^{φ_{x}^{l} (s)} V (φ_{x}^{l} (s) - y) d Q (y)] d s + l (x, t) + V (φ_{x}^{l} (t)) - V (x)$

方程(4.8)可写为

$\begin{array}{l} 0 \geq \int_{0}^{t} e^{- δ s} e^{- λ s} H^{l} V (x, d s) \\ = \int_{0}^{u} e^{- δ s} e^{- λ s} H^{l} V (x, d s) + \int_{u}^{t} e^{- δ s} e^{- λ s} H^{l} V (x, d s) \end{array}$ 。

$l \in M$ ，根据 [10] 定理2.3知： $φ_{x}^{l} (s + t) = φ_{φ_{x}^{l} (s)}^{l} (t)$ ，则有

$H^{l} V (x, 0) = 0$ ， $H^{l} V (x, s + t) = H^{l} V (x, s) + H^{l} V (φ_{x}^{l} (s), t)$ 。

因此

$\int_{u}^{t} e^{- δ s} e^{- λ s} H^{l} V (x, d s) = \int_{0}^{t - u} e^{- δ (u + s)} e^{- λ (u + s)} H^{l} V (φ_{x}^{l} (u), d s) \leq 0$ 。

由上可得 $\int_{0}^{t} e^{- δ s} e^{- λ s} H^{l} V (x, d s)$ 关于t是非增的。对于任意 $x \geq 0$ ，可得

$0 \geq l (x, t) - δ \int_{0}^{t} V (φ_{x}^{l} (s)) d s + A^{l} V (x, t)$ ， (4.9)

其中

$\begin{matrix} A^{l} V (x, t) = V (φ (t)) - V (x) - λ \int_{0}^{t} V (φ_{x}^{l} (s)) d s \\ + λ \int_{0}^{t} \int_{0}^{φ_{x}^{l} (s)} V (φ_{x}^{l} (s) - y) d Q (y) d s . \end{matrix}$

另一方面，假设存在最优平稳马氏策略及相应的函数 $l^{*} \in M$ 。运用与(4.9)相同的推导方法，(4.3)可写为

$0 = l^{*} (x, t) - δ \int_{0}^{t} V (φ_{x}^{l^{*}} (s)) d s + A^{l} V (x, t)$ (4.10)

由(4.9)和(4.10)，可得

$0 = \sup_{l \in M} {l (x, t) - δ \int_{0}^{t} V (φ_{x}^{l} (s)) d s + A^{l} V (x, t)}$ 。

对固定的，上述等式在 $l \in M$ 时取得最大值等价于 $l (x, \cdot) \in U_{x}$ 时取最大值。我们可改写(4.7)，如下式：

(4.11)

其中

$\begin{matrix} A^{l} V (x, t) = V (φ (t)) - V (x) - λ \int_{0}^{t} V (φ_{x}^{l} (s)) d s \\ + λ \int_{0}^{t} \int_{0}^{φ_{x}^{l} (s)} V (φ_{x}^{l} (s) - y) d Q (y) d s . \end{matrix}$

方程(4.11)是测度值方程，因此我们可以称之为测度值动态规划方程(测度值DPE)。

参考文献

[1]	Costa, O.L.V. and Raymundo, C.A.B. (2000) Impulse and Continuous Control of Piecewise Deterministic Markov Processes. Stochastics: An International Journal of Probability Stochastic Processes, 70, 75-107.
[2]	Azcue, P. and Muler, N. (2005) Optimal Reinsurance and Dividend Distribution Policies in the Cramer-Lundberg Model. Mathematical Finance, 15, 261-308. [Google Scholar] [CrossRef]
[3]	Liu, G. and Liu, Z. (2015) Piecewise Deterministic Markov Processes and Additive Functional of Semi-Dynamic Systems. Scientia Sinica Mathematics, 45, 579-592. (In Chinese)
[4]	Cai, J., Feng, R. and Willmot, G.E. (2009) On the Expectation of Total Discounted Operating Costs up to Default and Its Applications. Advances in Applied Probability, 41, 495-522. [Google Scholar] [CrossRef]
[5]	Feng, R., Zhang, S. and Zhu, C. (2013) Optimal Dividend Payment Problems in Piecewise-Deterministic Compound Poisson Risk Models. IEEE Decision and Control, 7309-7314.
[6]	Marciniak, E. and Palmowski, Z. (2016) On the Optimal Dividend Problem for Insurance Risk Models with Surplus-Dependent Premiums. Journal of Optimization Theory Applications, 168, 723-742. [Google Scholar] [CrossRef]
[7]	Schmidli, H. (2008) Stochastic Control in Insurance. Springer, London.
[8]	De Finetti, B. (1957) Su un’impostazione alternativa della teoria collettiva del rischio. Transactions of the XVth International Congress of Actuarie, 2, 433-443.
[9]	Azcue, P. and Muler, N. (2014) Stochastic Optimization in Insurance: A Dynamic Programming Approach. In Springer Briefs in Quantitative Finance. [Google Scholar] [CrossRef]
[10]	Liu, Z., Jiao, Y. and Liu, G. (2017) Measured-Valued Generator of General Piecewise Deterministic Markov Processes. arXiv: 1704.00938

为你推荐

友情链接