Copulas相依结构下完全和非完全样本极值的联合渐近分布

doi:10.12677/AAM.2023.1211476

期刊菜单

Copulas相依结构下完全和非完全样本极值的联合渐近分布
Asymptotic Distributions of Extremes of Complete and Incomplete Samples under Copulas Dependent

DOI: 10.12677/AAM.2023.1211476, PDF, HTML, XML, 被引量科研立项经费支持
作者: 方圆：浙江师范大学数学科学学院，金华浙江；嘉兴学院数据科学学院，嘉兴浙江
关键词: 随机缺失；Copula；极值；Random Missing； Copula； Extremes

摘要: 在日常生活中，观察某一组数据时，各种偶然因素或不可避免的因素都会导致数据出现随机丢失。通过研究满足阿基米德Copula相依结构极值的联合分布，可以减少或避免极端事件发生时所带来的损失。本文研究了随机缺失情形下，随机序列满足阿基米德Copula相依结构时完全样本与非完全样本极值的联合渐近分布，并给出几种典型的示例来说明主要结论。这不仅在理论中具有重要意义，在实际生活中也具有一定的意义。

Abstract: In daily life, when observing a certain set of data, various accidental or unavoidable factors will lead to random missing. By studying the joint distribution of the extreme values of the Archimedean Copula dependence structures, the losses caused by extreme events can be reduced or avoided. This paper studies the asymptotic distributions of extremes of complete and incomplete samples under random missing. Some examples are given to illustrate the main results. This is not only of great significance in theory, but also has certain significance in real life.

文章引用：方圆. Copulas相依结构下完全和非完全样本极值的联合渐近分布[J]. 应用数学进展, 2023, 12(11): 4824-4833. https://doi.org/10.12677/AAM.2023.1211476

1. 引言

经典的极值理论是研究社会生活中极端现象的一门学科，其主要研究一列随机序列极大值的相关极限理论。设 $X = {X_{n}, n \geq 1}$ 是一列独立同分布的实值随机序列，具有边际分布函数 $F (x)$ 。若存在常数序列 $a_{n} > 0, b_{n} \in R$ 和非退化分布函数 $G (x)$ 使得对 $G (x)$ 的任意连续点处有

$\lim_{n \to \infty} F^{n} (a_{n} x + b_{n}) = G (x),$ (1)

则称分布函数F属于非退化函数 $G (x)$ 的最大吸引场，简记为 $F \in D (G)$ 。注意到 $G (x)$ 必为如下三大极值分布

$Gumbel : Λ (x) = \exp (- e^{- x}), - \infty < x < + \infty;$

$Frechet : Φ_{α} (x) = {\begin{array}{l} 0, & x \leq 0, \\ \exp (- x^{- α}), & x > 0, \end{array} α > 0;$

$Weibull : Ψ_{α} (x) = {\begin{array}{l} \exp (- {(- x)}^{- α}), & x \leq 0, \\ 1, & x > 0, \end{array} α > 0.$

上述的经典结果及其相关推广见专著 [1] 。

在实际应用中，有些数据可能因不同的原因以一种非常不规则的方式丢失。在许多领域(例如金融、水文、气象等)，不同的研究者可能对不同的观测频率的样本感兴趣。在这些情况下，研究完整样本极值和非完整样本的极值的渐近理论以及它们之间的渐近关系变得很重要。

假设 $X$ 中仅有部分随机变量能够被观测到。令 $ε = {ε_{n}, n \geq 1}$ 为一列伯努利序列，表示随机变量 $X_{n}$ 被观测到的事件的指标且与 $X = {X_{n}, n \geq 1}$ 独立。在 $X$ 的前n个样本中，记 $M_{n} = \max {X_{k}, k = 1, 2, \dots, n}$ 和随机缺失序列的最大值为 ${\tilde{M}}_{n} = \max {X_{k}, ε_{k} = 1, 1 \leq k \leq n}$ 。 $S_{n} = \sum_{k = 1}^{n} ε_{k}$ 并假设其满足，当 $n \to \infty$ 时

$\frac{S_{n}}{n} \overset{p}{\to} λ,$

其中 $λ$ 为随机或非随机变量。

当 $λ$ 为一个常数时，文 [2] 研究了独立同分布随机序列完全样本和非完全样本极值之间的渐近关系，获得了如下结论：对任意 $x < y \in ℝ$ 有

$\lim_{n \to \infty} P (a_{n} ({\tilde{M}}_{n} - b_{n}) \leq x, a_{n} (M_{n} - b_{n}) \leq y) = H (x, y, λ),$ (2)

其中

$H (x, y, λ) = G^{λ} (x) G^{1 - λ} (y) .$

同时文 [2] 也研究了一类平稳相依情形，即在极值理论领域一类非常经典的相依条件 $D (u_{n}, v_{n})$ 和 $D^{'} (u_{n})$ (其定义详见专著 [1] )下研究了完全样本和非完全样本极值之间的渐近关系，证明了(2)仍然成立。

从那时起，该问题逐渐成为极值领域的一个研究热点，并被推广到一些其他随机情形。该问题在平稳高斯情形下的推广见文 [3] [4] [5] [6] ，在自回归过程和线性过程下的推广见文 [7] [8] ，在随机场下的推广见文 [9] [10] ，在几乎处处极限定理方面的推广见文 [11] [12] [13] ，在其他相关情形下的推广见文 [14] [15] 及其参考文献。

上述的研究中考虑了一些相依情形，如 [2] [14] 在 $D (u_{n}, v_{n})$ 和 $D^{'} (u_{n})$ 条件下考虑该问题， [3] [4] [5] [6] 则在相依高斯背景下考虑了该问题， [7] [8] 在自回归过程和线性过程下考虑了该问题。除 [3] [4] [5] [6] 中的高斯情形之外，其他研究中考虑的相依性都很弱的，其不影响极限分布 $H (x, y, λ)$ 的形式。因此非常有必要在强相依背景下研究非高斯随机序列完全样本和非完全样本极值之间的渐近关系。

本文将探讨随机序列满足一类Copula相依结构时，完全样本和非完全样本极值之间的渐近关系。在过去的几年里，与Copula相关的主题引起了人们的极大兴趣。Copula被用来描述随机变量之间的尺度不变依赖关系。对这种随机依赖结构的理解在概率论的所有领域中都变得非常重要。特别是在精算领域和金融领域的现代风险管理和压力测试方面，Copula已经证明了它们在构建适当的多元模型方面的有效性。关于Copula的相关理论介绍，可参考专著 [16] 。

2. 多维Copula

在本节中，我们将介绍一些关于Copula的定义，性质及相关定理，它们均来自专著 [16] 。Copula是将多维分布函数与其边缘分布函数连接在一起的一种多元函数，自变量是边缘分布函数。它清晰反映两个边缘分布函数之间联系的结构，可以通过了解这些结构是如何影响二维分布函数的各种性质。

定义 2.1. (Copula) 令 $d \geq 2$ 。一个d维Copula函数是一个定义在 ${[0, 1]}^{d}$ 上的d维分布函数，其边际分布函数服从 $(0, 1)$ 上的均匀分布。

因此，对给定的Copula函数C和边际分布函数 $F_{1}, \dots, F_{d}$ 有

$F (x_{1}, \dots, x_{d}) = C (F_{1} (x_{1}), \dots, F_{d} (x_{d}))$ (3)

是一个分布函数。反之，对给定的具有边际分布函数 $F_{1}, \dots, F_{d}$ 的多维分布函数F，存在一个满足(3)的Copula函数C。这个Copula函数C是不唯一的，如果 $F_{1}, \dots, F_{d}$ 是连续函数，则有

$C (x_{1}, \dots, x_{d}) = F (F_{1}^{- 1} (x_{1}), \dots, F_{d}^{- 1} (x_{d})),$ (4)

其中 $F_{i}^{- 1}$ 表示分布函数 $F_{i}$ 的广义逆函数。(3)和(4)的证明见专著 [16] 。

常见的Copula有commonotonic Copula

$C (x_{1}, \dots, x_{d}) = \min {x_{1}, \dots, x_{d}}$

和独立Copula

$C (x_{1}, \dots, x_{d}) = x_{1} \cdot \dots \cdot x_{d}$ .

本文主要考虑Archimedean Copula。

定义 2.2.令 $d \geq 2$ 。设 $ψ : [0, 1] \to [0, \infty]$ 是严格递减的、凸的函数，并且使得 $ψ (0) = \infty, ψ (1) = 0$ 对于 $x_{i} \in [0, 1], i = 1, \dots, d$ 有

$C_{n}^{ψ} (x_{1}, \dots, x_{d}) = ψ^{- 1} (\sum_{i = 1}^{d} ψ (x_{i})),$

其中， $ψ$ 称为 $C_{n}^{ψ}$ 的生成元。

定义 2.3. 一个定义在I上函数g被称为在I上完全单调的，如果它是连续的并且具有交替符号的所有阶导数，即 ${(- 1)}^{k} \frac{d^{k}}{d x^{k}} g (x) \geq 0$ ，对所有 $k \geq 0$ 和所有 $x \in I$ 成立。

定理 2.4. 对所有 $d \geq 2$ ， $C_{d}^{ψ}$ 是一个Copula当且仅当生成元 $ψ$ 具有逆函数 $ψ^{- 1}$ 并且 $ψ^{- 1}$ 在 $[0, \infty]$ 上完全单调。

定义 2.5. (Archimedean Copula) 如果 $ψ^{- 1}$ 在 $[0, \infty]$ 上完全单调，则称 $C_{d}^{ψ}$ 是Archimedean Copula。

Archimedean Copula在实践中很有趣，通常它们只有一个参数，进而它们很容易构造。下面介绍几类常见的Archimedean Copula。

定义2.6. (Gumbel Copula) 若 $C_{d}^{G u, α} (x_{1}, x_{2}, \dots, x_{d})$ 具有如下形式

$C_{d}^{G u, α} (x_{1}, x_{2}, \dots, x_{d}) = \exp {- [\sum_{i = 1}^{d} {(- \log x_{i})}^{α}]}$

则称其为Gumbel Copula，其生成元 $ψ (t) = {(- \log (t))}^{α}, α \geq 1$ 。

定义 2.7. (Clayton copula) 若 $C_{d}^{C l, α} (x_{1}, x_{2}, \dots, x_{d})$ 具有如下形式

$C_{d}^{C l, α} (x_{1}, x_{2}, \dots, x_{d}) = {(x_{1}^{- α} + \dots + x_{d}^{- α} - d + 1)}^{- \frac{1}{α}}$

则称其为Clayton copula，其生成元 $ψ (t) = t^{- α} - 1, α \geq 0$ 。

文 [17] 研究了满足上述Archimedean Copula结构的随机序列极值的极限分布问题，文 [18] [19] 则在Archimedean Copula结构下获得了随机序列极值的几乎处处中心极限定理。

3. 主要结论及其证明

在本节中，设 $X = {X_{n}, n \geq 1}$ 是一列随机变量序列，并设 $ε = {ε_{n}, n \geq 1}$ 为一列伯努利随机序列，其中 $ε_{n}$ 表示随机变量 $X_{n}$ 被观测到的事件的指标且与 $X = {X_{n}, n \geq 1}$ 独立。令 $S_{n} = \sum_{k = 1}^{n} ε_{k}$ 。记 $M_{n} = \max {X_{k}, k = 1, 2, \dots, n}$ ， ${\tilde{M}}_{n} = \max {X_{k}, ε_{k} = 1, 1 \leq k \leq n}$ 。

现在，我们将阐述主要结论。

定理 3.1.设 $X = {X_{n}, n \geq 1}$ 是一列同分布的随机变量序列具有连续的边际分布函数 $F (x)$ ，满足以下条件：

(i) 对所有的 $n \geq 1$ ，随机向量 $(X_{1}, X_{2}, \dots, X_{n})$ 具有Archimedean Copula $C_{n}^{ψ}$ 结构；

(ii) $\exp (- ψ \circ F) \in D (G)$ ，即存在常数序列 $c_{n} > 0, d_{n} \in ℝ$ 使得对 $G (x)$ 的任意连续点处有

$\lim_{n \to \infty} \exp (- n ψ \circ F (c_{n} x + d_{n})) = G (x) .$ (5)

进一步假设 $S_{n} = \sum_{k = 1}^{n} ε_{k}$ 满足

$\frac{S_{n}}{n} \overset{p}{\to} λ,$ (6)

其中 $λ$ 为一个常数。则对 $x < y \in ℝ$ ，有

$\lim_{n \to \infty} P ({\tilde{M}}_{n} \leq c_{n} x + d_{n}, M_{n} \leq c_{n} y + d_{n}) = ψ^{- 1} (- \log (G^{λ} (x) G^{1 - λ} (y))) .$ (7)

注记 3.2. (i). 记 ${\hat{M}}_{n} = \max {X_{k}, ε_{k} = 0, 1 \leq k \leq n}$ ，由定理易得：对 $x, y \in ℝ$ ，有

$\lim_{n \to \infty} P ({\tilde{M}}_{n} \leq c_{n} x + d_{n}, {\hat{M}}_{n} \leq c_{n} y + d_{n}) = ψ^{- 1} (- \log (G^{λ} (x) G^{1 - λ} (y))) .$ (8)

(ii). 结果(2)表明在弱相依情形下，非完全样本极值与完全样本极值之间是渐近独立；而定理3.1表明非完全样本极值与完全样本极值之间的渐近关系取决于生成元 $ψ$ 。对于Gumbel copula $ψ (t) = {(- \log t)}^{α}$ ，如果选取 $α = 1$ ，则非完全样本极值与完全样本极值之间是渐近独立的，否者是渐近相依的。详细情况可见下节中的例子。

(iii). 在实际应用中，我们除了对极大值感兴趣外，我们也常常对极值次序统计量感兴趣，因此，在Copula结构下考虑完全样本极值次序统计量和非完全样本极值次序统计量的渐近关系也是有意义的工作。但是，我们的证明方法对极值次序统计量不成立。

(iv). 将(6)中的常数 $λ$ 推广到随机变量情形是非常有意义的工作，相关研究可见 [2] [4] [14] 。但是本文的方法对这种情况不成立。这个问题与(iii)中提到的问题将在另外一篇文章被解决。

(v). 文 [17] 给出了 $\exp (- ψ \circ F) \in D (G)$ 的充分必要条件，同时给出了选择常数列 $c_{n}$ 和 $d_{n}$ 的方法，详见文 [17] 中定理4.4和推论4.5。

定理3.1的证明：由全概率公式，(3)和(4)式可得

$\begin{array}{l} P ({\tilde{M}}_{n} \leq c_{n} x + d_{n}, M_{n} \leq c_{n} y + d_{n}) \\ = \sum_{k = 0}^{n} P (S_{n} = k) P ({\tilde{M}}_{n} \leq c_{n} x + d_{n}, M_{n} \leq c_{n} y + d_{n} | S_{n} = k) \\ = \sum_{k = 0}^{n} P (S_{n} = k) P (X_{1} \leq c_{n} x + d_{n}, \dots, X_{k} \leq c_{n} x + d_{n}, X_{k + 1} \leq c_{n} y + d_{n}, \dots, X_{n} \leq c_{n} y + d_{n}) \\ = \sum_{k = 0}^{n} P (S_{n} = k) P (F (X_{1}) \leq F (c_{n} x + d_{n}), \dots, F (X_{k}) \leq F (c_{n} x + d_{n}), \\ F (X_{k + 1}) \leq F (c_{n} y + d_{n}), \dots, F (X_{n}) \leq F (c_{n} y + d_{n})) \end{array}$

$\begin{array}{l} = \sum_{k = 0}^{n} P (S_{n} = k) C_{n}^{ψ} (F (c_{n} x + d_{n}), \dots, F (c_{n} x + d_{n}), F (c_{n} y + d_{n}), \dots, F (c_{n} y + d_{n})) \\ = \sum_{k = 0}^{n} P (S_{n} = k) ψ^{- 1} (k ψ (F (c_{n} x + d_{n})) + (n - k) ψ (F (c_{n} y + d_{n}))) \\ = \sum_{k = 0}^{n} P (S_{n} = k) ψ^{- 1} (- \log (H^{k} (c_{n} x + d_{n}) H^{n - k} (c_{n} y + d_{n}))), \end{array}$ (9)

其中， $H (x) = \exp (- ψ \circ F (x))$ 。设 $0 < ε < λ$ ，则可将(9)右端改写为

$\begin{array}{l} \sum_{k = 0}^{n} P (S_{n} = k) ψ^{- 1} (- \log (H^{k} (c_{n} x + d_{n}) H^{n - k} (c_{n} y + d_{n}))) \\ = \sum_{k : | \frac{k}{n} - λ | > ε} P (S_{n} = k) ψ^{- 1} (- \log (H^{k} (c_{n} x + d_{n}) H^{n - k} (c_{n} y + d_{n}))) \\ + \sum_{k : | \frac{k}{n} - λ | \leq ε} P (S_{n} = k) ψ^{- 1} (- \log (H^{k} (c_{n} x + d_{n}) H^{n - k} (c_{n} y + d_{n}))) \\ = Σ_{1} + Σ_{2} \end{array}$ (10)

对第一项 $Σ_{1}$ ，注意到对任意 $x \in [0, \infty)$ ， $| ψ^{- 1} (x) | \leq 1$ ，进而利用条件(6)可得，当 $n \to \infty$ 时

$Σ_{1} \leq \sum_{k : | \frac{k}{n} - λ | > ε} P (S_{n} = k) \to 0$ . (11)

对第二项 $Σ_{2}$ ，利用 $ψ^{- 1}$ 的单调性可得

$Σ_{2} \leq ψ^{- 1} (- \log (H^{n (λ - ε)} (c_{n} x + d_{n}) H^{n - n (λ + ε)} (c_{n} y + d_{n}))) \sum_{k : | \frac{k}{n} - λ | \leq ε} P (S_{n} = k)$ (12)

和

$Σ_{2} \leq ψ^{- 1} (- \log (H^{n (λ + ε)} (c_{n} x + d_{n}) H^{n - n (λ - ε)} (c_{n} y + d_{n}))) \sum_{k : | \frac{k}{n} - λ | \leq ε} P (S_{n} = k)$ . (13)

注意到由(5)可知， $H \in D (G)$ ，进而结合(6)及(9)-(13)式，对于任意 $ε \in (0, λ)$ ，有

$\lim_{n \to \infty} \sup P ({\tilde{M}}_{n} \leq c_{n} x + d_{n}, M_{n} \leq c_{n} y + d_{n}) \leq ψ^{- 1} (- \log (G^{λ - ε} (x) G^{1 - λ - ε} (y)))$ ,(14)

$\lim_{n \to \infty} \inf P ({\tilde{M}}_{n} \leq c_{n} x + d_{n}, M_{n} \leq c_{n} y + d_{n}) \geq ψ^{- 1} (- \log (G^{λ + ε} (x) G^{1 - λ + ε} (y)))$ . (15)

最后，令 $ε \to 0$ ，则可得

$\lim_{n \to \infty} \sup P ({\tilde{M}}_{n} \leq c_{n} x + d_{n}, M_{n} \leq c_{n} y + d_{n}) \leq ψ^{- 1} (- \log (G^{λ} (x) G^{1 - λ} (y)))$ , (16)

$\lim_{n \to \infty} \inf P ({\tilde{M}}_{n} \leq c_{n} x + d_{n}, M_{n} \leq c_{n} y + d_{n}) \geq ψ^{- 1} (- \log (G^{λ} (x) G^{1 - λ} (y)))$ . (17)

故(7)得证。

4. 例子

在本节中，我们将给出满足定理3.1条件的几种类型的例子。

4.1. Weibull吸引场情形

首先，我们给出 $\exp (- ψ \circ F)$ 被吸引到Weibull分布的情形。

例4.1假设随机变量 $X_{1}, X_{2}, \dots, X_{n}$ 满足定理3.1中的条件。进一步假设其服从区间 $(0, 1)$ 上的均匀分布。

(i). 在Gumbel copula条件下，其中 $ψ (t) = {(- \log t)}^{α}, α \geq 1$ ，则对 $0 < x < y < \infty$ 有

$\lim_{n \to \infty} P ({\tilde{M}}_{n} \leq 1 - n^{- 1 / α} x, M_{n} \leq 1 - n^{- 1 / α} y) = \exp {- {(λ x^{α} + (1 - λ) y^{α})}^{\frac{1}{α}}}$ .

(ii). 在Clayton copula条件下，其中 $ψ (t) = t^{- α} - 1, α > 0$ ，则对 $0 < x < y < \infty$ 有

$\lim_{n \to \infty} P ({\tilde{M}}_{n} \leq 1 - {(α n)}^{- 1} x, M_{n} \leq 1 - {(α n)}^{- 1} y) = {(1 + λ x + (1 - λ) y)}^{- \frac{1}{α}}$ .

证明: (i). 考虑 $t > 0$ ，注意到 $ψ (t) = {(- \log t)}^{α}, α \geq 1$ 以及 $(0, 1)$ 上均匀分布的上尾端点为1，则

$\lim_{x \to \infty} \frac{ψ \circ F (1 - {(x t)}^{- 1})}{ψ \circ F (1 - x^{- 1})} = \lim_{x \to \infty} \frac{ψ (1 - {(x t)}^{- 1})}{ψ (1 - x^{- 1})} = \lim_{x \to \infty} \frac{{(- \log (1 - {(x t)}^{- 1}))}^{α}}{{(- \log (1 - x^{- 1}))}^{α}} = \lim_{x \to \infty} \frac{{(x t)}^{- α}}{x^{- α}} = t^{- α}$ .

因此，由文 [17] 中定理4.4可知 $\exp (- ψ \circ F) \in D (Ψ_{α})$ ，进而由文 [17] 中推论4.5可知其正则化常数 $c_{n} = n^{- 1 / α}, d_{n} = 1$ 代入(7)可证(i)成立。

(ii). 考虑 $t > 0$ ，注意到 $ψ (t) = t^{- α} - 1, α > 0$ ，则

$\lim_{x \to \infty} \frac{ψ \circ F (1 - {(x t)}^{- 1})}{ψ \circ F (1 - x^{- 1})} = \lim_{x \to \infty} \frac{ψ (1 - {(x t)}^{- 1})}{ψ (1 - x^{- 1})} = \lim_{x \to \infty} \frac{{(1 - {(x t)}^{- 1})}^{- α} - 1}{{(1 - x^{- 1})}^{- α} - 1} = \lim_{x \to \infty} \frac{α {(x t)}^{- 1}}{α x^{- 1}} = t^{- 1}$ .

因此，由文 [17] 中定理4.4可知 $\exp (- ψ \circ F) \in D (Ψ_{1})$ ，进而由文 [17] 中推论4.5可知其正则化常数 $c_{n} = {(α n)}^{- 1}, d_{n} = 1$ 。代入(7)可证(ii)成立。

4.2. Fréchet吸引场情形

其次，我们给出 $\exp (- ψ \circ F)$ 被吸引到Fréchet分布的情形。

例4.2假设随机变量 $X_{1}, X_{2}, \dots, X_{n}$ 满足定理3.1中的条件。进一步假设其边际分布为Pareto分布，即对于 $K, β > 0$ ，当 $x \to \infty$ 时，有 $1 - F (x) ~ K x^{- β}$ 。

(i). 在Gumbel copula条件下，其中 $ψ (t) = {(- \log t)}^{α}, α \geq 1$ ，则对 $0 < x < y < \infty$ 有

$\lim_{n \to \infty} P ({\tilde{M}}_{n} \leq K^{\frac{1}{β}} n^{\frac{1}{α β}} x, M_{n} \leq K^{\frac{1}{β}} n^{\frac{1}{α β}} y) = \exp {- {(λ x^{- α β} + (1 - λ) y^{- α β})}^{\frac{1}{α}}}$ .

(ii). 在Clayton copula条件下，其中 $ψ (t) = t^{- α} - 1, α > 0$ ，则对 $0 < x < y < \infty$ 有

$\lim_{n \to \infty} P ({\tilde{M}}_{n} \leq {(α K n)}^{\frac{1}{β}} x, M_{n} \leq {(α K n)}^{\frac{1}{β}} y) = {(1 + λ x^{- β} + (1 - λ) y^{- β})}^{- \frac{1}{α}}$ .

证明: (i). 由文 [17] 中(5.11)可知 $\exp (- ψ \circ F) \in D (Φ_{α β})$ 且其正则化常数 $c_{n} = K^{\frac{1}{β}} n^{\frac{1}{α β}}, d_{n} = 0$ 。代入(7)可证(i)成立。

(ii). 考虑 $t > 0$ ，注意到 $ψ (t) = t^{- α} - 1, α > 0$ ，则

$\lim_{x \to \infty} \frac{ψ \circ F (x t)}{ψ \circ F (x)} = \lim_{x \to \infty} \frac{ψ (1 - K {(x t)}^{- β})}{ψ (1 - K {(x)}^{- β})} = \lim_{x \to \infty} \frac{{(1 - K {(x t)}^{- β})}^{- α} - 1}{{(1 - K {(x)}^{- β})}^{- α} - 1} = \lim_{x \to \infty} \frac{1 + α K {(x t)}^{- β} - 1}{1 + α K {(x)}^{- β} - 1} = t^{- β}$ .

因此 $\exp (- ψ \circ F) \in D (Φ_{β})$ 。由文 [12] 中推论4.5可知其正则化常数 $c_{n} = {(α K n)}^{\frac{1}{β}}$ ， $d_{n} = 0$ 。代入(7)可证(ii)成立。

4.3. Gumbel吸引场情形

最后，我们给出 $\exp (- ψ \circ F)$ 被吸引到Gumbel分布的情形。

例4.3在定理3.1的条件下，设随机变量 $X_{1}, X_{2}, \dots, X_{n}$ 的边际分布为参数为 $θ$ 的指数分布，其中 $θ > 0$ 。

(i). 在Gumbel copula条件下，其中 $ψ (t) = {(- \log t)}^{α}, α \geq 1$ ，则对 $x < y \in ℝ$ 有

$\lim_{n \to \infty} P ({\tilde{M}}_{n} \leq c_{n} x + d_{n}, M_{n} \leq c_{n} y + d_{n}) = \exp {- {(λ e^{- x} + (1 - λ) e^{- y})}^{\frac{1}{α}}}$ ,

其中， $c_{n} = \frac{1}{θ α}, d_{n} = \frac{1}{θ α} \log n$ 。

(ii). 在Clayton copula条件下，其中 $ψ (t) = t^{- α} - 1, α > 0$ ，则对 $x < y \in ℝ$ 有

$\lim_{n \to \infty} P ({\tilde{M}}_{n} \leq c_{n} x + d_{n}, M_{n} \leq c_{n} y + d_{n}) = {(1 + λ e^{- x} + (1 - λ) e^{- y})}^{- \frac{1}{α}}$ ,

其中， $c_{n} = \frac{1}{θ}, d_{n} = \frac{1}{θ} \log (n α)$ 。

证明：(i). 由文 [17] 中(5.17)可知 $\exp (- ψ \circ F) \in D (Λ)$ 且其正则化常数 $c_{n} = \frac{1}{θ α}, d_{n} = \frac{1}{θ α} \log n$ 。代入(7)可证(i)成立。

(ii). 考虑 $t > 0$ ，注意到 $ψ (t) = t^{- α} - 1, α > 0$ ，则

$\begin{matrix} \lim_{x \to \infty} \frac{ψ \circ F (x t)}{ψ \circ F (x)} = \lim_{x \to \infty} \frac{ψ (1 - \exp (- θ x t))}{ψ (1 - \exp (- θ x))} = \lim_{x \to \infty} \frac{{(1 - \exp (- θ x t))}^{- α} - 1}{{(1 - \exp (- θ x))}^{- α} - 1} \\ = \lim_{x \to \infty} \frac{1 + α \exp (- θ x t) - 1}{1 + α \exp (- θ x) - 1} = \exp (- θ x (t - 1)) \in {0, 1, \infty} . \end{matrix}$

因此 $\exp (- ψ \circ F) \in D (Λ)$ 。由文 [17] 中推论4.5可知其正则化常数 $c_{n} = \frac{1}{θ}, d_{n} = \frac{1}{θ} \log (n α)$ 。

例4.4在定理3.1的条件下，设随机变量 $X_{1}, X_{2}, \dots, X_{n}$ 的边际分布为标准正态序列。

(i). 在Gumbel copula条件下，其中 $ψ (t) = {(- \log t)}^{α}, α \geq 1$ ，则有

$\lim_{n \to \infty} P ({\tilde{M}}_{n} \leq c_{n} x + d_{n}, M_{n} \leq c_{n} y + d_{n}) = \exp {- {(λ e^{- x} + (1 - λ) e^{- y})}^{\frac{1}{α}}}$ , (18)

其中 $c_{n} = {(2 α \log n)}^{- \frac{1}{2}}$ ， $d_{n} = {(2 α^{- 1} \log n)}^{\frac{1}{2}} - \frac{α}{2} {(2 α \log n)}^{- \frac{1}{2}} (\log 4 π + \log \log n - \log α)$ 。

(ii). 在Clayton copula条件下，其中 $ψ (t) = t^{- α} - 1, α > 0$ ，则有

$\lim_{n \to \infty} P ({\tilde{M}}_{n} \leq c_{n} x + d_{n}, M_{n} \leq c_{n} y + d_{n}) = {(1 + λ e^{- x} + (1 - λ) e^{- y})}^{- \frac{1}{α}}$ , (19)

其中， $c_{n} = {(2 \log n)}^{- \frac{1}{2}}$ ， $d_{n} = {(2 \log n)}^{\frac{1}{2}} - \frac{1}{2} {(2 \log n)}^{- \frac{1}{2}} (\log (4 π α^{- 2}) + \log \log n)$ 。

证明：(i). 考虑 $t > 0$ ，注意到 $ψ (t) = {(- \log t)}^{α}$ ，则

$\begin{matrix} \lim_{x \to \infty} \frac{ψ \circ Φ (x t)}{ψ \circ Φ (x)} = \lim_{x \to \infty} \frac{ψ (1 - (1 - Φ (x t)))}{ψ (1 - (1 - Φ (x)))} = \lim_{x \to \infty} \frac{{(1 - Φ (x t))}^{α}}{{(1 - Φ (x))}^{α}} \\ = \lim_{x \to \infty} \frac{{(x t)}^{- α} {(2 π)}^{- α / 2} e^{- α {(t x)}^{2} / 2}}{{(x)}^{- α} {(2 π)}^{- α / 2} e^{- α x^{2} / 2}} = \lim_{x \to \infty} t^{- α} e^{- α x^{2} (t^{2} - 1) / 2} \in {0, 1, \infty} . \end{matrix}$

因此，由文 [17] 中定理4.4可得 $\exp (- ψ \circ Φ) \in D (Λ)$ 。设 $u_{n} = u_{n} (x) = c_{n} x + d_{n}$ ，进而有 $n ψ \circ Φ (u_{n}) ~ e^{- x}$ ，即 $n {(1 - Φ (u_{n}))}^{α} ~ e^{- x}$ 。根据文 [1] 中(1.5.4)式可知，当 $n \to \infty$ 时

$\frac{e^{- x} u_{n}^{α}}{n {(ϕ (u_{n}))}^{α}} \to 1$ . (20)

对上式两边取对数，有

$- \log n - x + α \log u_{n} + \frac{α}{2} \log 2 π + \frac{α u_{n}^{2}}{2} \to 0$ . (21)

进而有 $\frac{α u_{n}^{2}}{2 \log n} \to 1$ ，故

$\log u_{n} = \frac{1}{2} (\log 2 + \log \log n - \log α) + o (1)$ . (22)

将(22)代入(21)可得

$\frac{u_{n}^{2}}{2} = \frac{1}{α} x + \frac{1}{α} \log n - \frac{1}{2} \log 4 π - \frac{1}{2} \log \log n + \frac{1}{2} \log α + o (1)$ . (23)

则

$u_{n} = {(2 α^{- 1} \log n)}^{\frac{1}{2}} {1 + \frac{x - \frac{α}{2} \log 4 π - \frac{α}{2} \log \log n + \frac{α}{2} \log α}{2 \log n} + o (\frac{α}{\log n})}$ . (24)

而 $u_{n} = c_{n} x + d_{n}$ ，故(i)得证。

(ii). 考虑 $t > 0$ ，注意到 $ψ (t) = t^{- α} - 1, α > 0$ ，则

$\begin{matrix} \lim_{x \to \infty} \frac{ψ \circ Φ (x t)}{ψ \circ Φ (x)} = \lim_{x \to \infty} \frac{ψ (1 - (1 - Φ (x t)))}{ψ (1 - (1 - Φ (x)))} = \lim_{x \to \infty} \frac{(1 - Φ (x t))}{(1 - Φ (x))} \\ = \lim_{x \to \infty} \frac{{(x t)}^{- 1} {(2 π)}^{- 1 / 2} e^{- {(t x)}^{2} / 2}}{x^{- 1} {(2 π)}^{- 1 / 2} e^{- x^{2} / 2}} = \lim_{x \to \infty} t^{- 1} e^{- x^{2} (t^{2} - 1) / 2} \in {0, 1, \infty} . \end{matrix}$

因此，由文 [17] 中定理4.4可得 $\exp (- ψ \circ Φ) \in D (Λ)$ 。设 $u_{n} = u_{n} (x) = c_{n} x + d_{n}$ ，进而有 $n ψ \circ Φ (u_{n}) ~ e^{- x}$ ，即 $n α (1 - Φ (u_{n})) ~ e^{- x}$ 。余下的证明与(i)完全一样，故略去。

由于只有少数的统计工具可以用来测试依赖性结构，因此很难将满足Copula结构的数据匹配到真实数据中，此时则需通过考虑它们的渐近行为。本文研究了随机缺失情形下，随机序列满足阿基米德Copula相依结构完全样本与非完全样本极值的联合渐近分布，将满足阿基米德Copula相依结构极值的分布推广到了随机缺失的情形下。结果(2)表明在弱相依情形下，非完全样本极值与完全样本极值之间是渐近独立；而本文定理3.1则是在另一种相依情形下表明了非完全样本极值与完全样本极值之间的渐近关系取决于生成元 $ψ$ 。

在Copula结构下，运用本文中的方法考虑完全样本极值次序统计量和非完全样本极值次序统计量的渐近关系以及将常数 $λ$ 推广到随机变量情形是不成立的。今后我将会继续研究满足阿基米德Copula相依结构下的相关问题，解决上述所提到的问题，且会更加全面地考虑问题，更好地将理论与实践结合在一起。

基金项目

本文受浙江省自然科学基金(编号LY18A010020)。

参考文献

参考文献

[1]	Leadbetter, M.R., Lindgren, G. and Rootze ́n, H. (1983) Extremes and Related Properties of Random Sequences and Processes. Springer-Verlag, NewYork. [Google Scholar] [CrossRef]
[2]	Mladenovic ́, P. and Piterbarg, V. (2006) On Asymptotic Distribution of Maxima of Complete and Incomplete Samples from Stationary Sequences. Stochastic Processes and Their Applications, 116, 1977-1991. [Google Scholar] [CrossRef]
[3]	Cao, L. and Peng, Z. (2011) Asymptotic Distributions of Maxima of Complete and Incomplete Samples from Strongly Dependent Stationary Gaussian Sequences. Applied Mathematics Letters, 24, 243-247. [Google Scholar] [CrossRef]
[4]	Hashorva, E., Peng, Z. and Weng, Z. (2013) On Piterbarg Theorem for Maxima of Stationary Gaussian Sequences. Lithuanian Mathematical Journal, 53, 280-292. [Google Scholar] [CrossRef]
[5]	Peng, Z., Cao, L. and Nadarajah, S. (2010) Asymptotic Distributions of Maxima of Complete and Incomplete Samples from Multivariate Stationary Gaussian Sequences. Journal of Multivariate Analysis, 101, 2641-2647. [Google Scholar] [CrossRef]
[6]	Peng, Z., Tong, J. and Weng, Z. (2019) Exceedances Point Processes in the Plane of Stationary Gaussian Sequences with Data Missing. Statistics and Probability Letters, 149, 73-79. [Google Scholar] [CrossRef]
[7]	Glavaš, L., Mladenović, P. and Samorodnitsky, G. (2017) Extreme Values of the Uniform Order 1 Autoregressive Processes and Missing Observations. Extremes, 20, 671-690. [Google Scholar] [CrossRef]
[8]	Glavaš, L. and Mladenović, P. (2020) Extreme Values of Linear Processes with Heavy-Tailed Innovations and Missing Observations. Extremes, 23, 547-567. [Google Scholar] [CrossRef]
[9]	Panga, Z. and Pereira, L. (2018) On the Maxima and Minima of Complete and Incomplete Samples from Nonstationary Random Fields. Statistics and Probability Letters, 137, 124-134. [Google Scholar] [CrossRef]
[10]	Zheng, S. and Tan, Z. (2023) On the Maxima of Nonstationary Random Fields Subject to Missing Observations. Communications in Statistics-Theory and Methods. [Google Scholar] [CrossRef]
[11]	Peng, Z., Wang, P. and Nadarajah, S. (2009) Limiting Distributions and Almost Sure Limit Theorems for the Normalized Maxima of Complete and Incomplete Samples from Gaussian Sequence. Electronic Journal of Statistics, 3, 851-864. [Google Scholar] [CrossRef]
[12]	Dudzinski, M. (2017) Some Applications of the Archimedean Copulas in the Proof of the Almost Sure Central Limit Theorem for Ordinary Maxima. Open Mathematics, 15, 1024-1034. [Google Scholar] [CrossRef]
[13]	Tong, B. and Peng, Z. (2011) On Almost Sure Max-Limit Theorems of Complete and Incomplete Samples from Stationary Sequences. Acta Mathematica Sinica (English Series), 27, 1323-1332. [Google Scholar] [CrossRef]
[14]	Krajka, T. (2011) The Asymptotic Behaviour of Maxima of Complete and Incomplete Samples from Statioanry Sequences. Stochastic Processes and Their Applications, 121, 1705-1719. [Google Scholar] [CrossRef]
[15]	谭中权. 连续与离散时间Gauss次序统计过程的极值[J]. 中国科学(数学), 2018, 48(5): 623-642.
[16]	Nelsen, R.B. (1999) An Introduction to Copulas. Springer, New York. [Google Scholar] [CrossRef]
[17]	Wu ̈thrich, M.V. (2004) Extreme Value Theory and Archimedean Copulas. Scandinavian Actuarial Journal, 104, 211-228. [Google Scholar] [CrossRef]
[18]	Dudzinski, M. and Furmanczyk, K. (2017) On Some Applications of the Archimedean Copulas in the Proofs of the Almost Sure Central Limit Theorems for Certain Order Statistics. Bulletin of the Korean Mathematical Society, 54, 839-874. [Google Scholar] [CrossRef]
[19]	Dudzinski, M. (2017) Some Applications of the Archimedean Copulas in the Proof of the Almost Sure Central Limit Theorem for Ordinary Maxima. Open Mathematics, 15, 1024-1034. [Google Scholar] [CrossRef]

为你推荐

友情链接