带有函数型变量的不等概抽样及其应用
Unequal Probability Sampling Based on Functional Variables and Its Application
DOI: 10.12677/sa.2024.135187, PDF,    科研立项经费支持
作者: 刘高生*:天津商业大学,理学院,天津;魏永怡:华中师范大学,数学与统计学学院,湖北 武汉
关键词: 函数型变量切片逆回归方法不等概抽样共享单车应用Functional Variable Unequal Probability Sampling Slice Inverse Regression Method Shared Bicycle Application
摘要: 随着计算机的发展,函数型变量的数据收集变得越来越容易,传统的不等概抽样方法仅考虑了标量型的辅助变量,同时在不等概抽样方法中确定总体单元入样概率并不容易,为此提出了一种带有函数型变量的不等概抽样方法,这种不等概率抽样方法从样本数据中提取信息构造总体单元入样概率,克服了确定每个总体单元入样概率的困难。模拟结果表明带有函数型变量的不等概抽样估计结果优于简单随机抽样估计结果。最后将其应用到共享单车数据中得到带有函数型变量的不等概抽样的良好表现。
Abstract: With the development of computers, data collection for functional variables has become increasingly easy. Traditional inequality sampling methods only consider scalar auxiliary variables, and it is not easy to determine the sampling probability of population units in inequality sampling methods. Therefore, unequal Probability sampling method with functional variables is proposed. This unequal probability sampling method extracts information from sample data to construct the sampling probability of population units, overcoming the difficulty of determining the sampling probability of each population unit. The simulation results show that the estimation results of unequal probability sampling with functional variables are better than those of simple random sampling estimation. Finally, the method is applied to shared bicycle data and achieves good performance in unequal probability sampling with functional variables.
文章引用:刘高生, 魏永怡. 带有函数型变量的不等概抽样及其应用[J]. 统计学与应用, 2024, 13(5): 1930-1937. https://doi.org/10.12677/sa.2024.135187

参考文献

[1] 金勇进, 刘晓宇. 大数据背景下的抽样调查[J]. 系统科学与数学, 2022, 42(1): 2-16.
[2] 邓明, 张荷观. 利用比率估计方法估计季节指数[J]. 统计与决策, 2007, 23(3): 16-17.
[3] 贺建风, 石立. 大数据情境下基于切片逆回归的抽样方法研究[J]. 广西师范大学学报(自然科学版), 2022, 40(1): 91-99.
[4] Ramsay, J.O. (1982) When the Data Are Functions. Psychometrika, 47, 379-396. [Google Scholar] [CrossRef
[5] Ramsay, J.O. and Dalzell, C.J. (1991) Some Tools for Functional Data Analysis. Journal of the Royal Statistical Society Series B: Statistical Methodology, 53, 539-561. [Google Scholar] [CrossRef
[6] Karuppusami, R., Antonisamy, B. and Premkumar, P.S. (2022) Functional Principal Component Analysis for Identifying the Child Growth Pattern Using Longitudinal Birth Cohort Data. BMC Medical Research Methodology, 22, Article No. 76. [Google Scholar] [CrossRef] [PubMed]
[7] Shi, H., Jiang, S. and Cao, J. (2022) Dynamic Prediction with Time‐Dependent Marker in Survival Analysis Using Supervised Functional Principal Component Analysis. Statistics in Medicine, 41, 3547-3560. [Google Scholar] [CrossRef] [PubMed]
[8] 宁贺. 函数型主成分分析的研究及其应用[D]: [硕士学位论文]. 长春: 吉林大学, 2021.
[9] Li, K. (1991) Sliced Inverse Regression for Dimension Reduction. Journal of the American Statistical Association, 86, 316-327. [Google Scholar] [CrossRef
[10] 李岩岩. SIR降维方法与半参数可加回归的应用研究[D]: [硕士学位论文]. 重庆: 重庆工商大学, 2016.
[11] 荀鹏程, 钱国华, 赵杨, 于浩, 陈峰. 高维生物学数据两阶段组合降维策略研究[J]. 中国卫生统计, 2012, 29(5): 626-629.
[12] Liquet, B. and Saracco, J. (2016) BIG-SIR: A Sliced Inverse Regression Approach for Massive Data. Statistics and Its Interface, 9, 509-520. [Google Scholar] [CrossRef
[13] 刘高生, 柏杨, 余平. 部分函数型线性空间自回归模型的假设检验[J]. 数学学报(中文版), 2023, 66(2): 239-252.