基于D-最优和A-最优设计的多维在线标定设计研究
Research on Multidimensional Online Calibration Design Based on D-Optimal and A-Optimal Designs
摘要: 多维计算机化自适应测验(MCAT)近年来在教育测量中受到越来越多的关注。与所有其他CAT一样,项目补充是MCAT的项目库维护和管理的一个重要组成部分。题库管理者需要定期淘汰题库中过度暴露或过时的项目,并替换为新的项目。在单维CAT (UCAT)中,在线标定技术已被用于有效地标定新项目。然而,文献中关于MCAT在线校准的讨论很少。因此本文在现有UCAT的基础,将UCAT的在线标定设计D-最优设计和A-最优设计推广至多维在线标定情境。本文进行计算机模拟实验,探究不同样本量和能力间的相关系数对D-最优设计和A-最优设计的影响。结果表明,平均后的总信息量和题目参数的信息量随着能力的相关性的增大而减少,在能力的相关性为0.2和0.5时随着新题作答的题目数量的增大呈现先增后减的趋势,而在能力的相关性为0.8时几乎不变,D-最优设计和A-最优设计对其影响较小。
Abstract: In recent years, multidimensional computerized adaptive testing (MCAT) has received more and more attention in educational measurement. As with all other CATs, item replenishment is an im-portant component of MCAT’s item bank maintenance and management. Bank managers need to regularly eliminate overexposed or outdated items in the item bank and replace them with new ones. In unidimensional CAT (UCAT), online calibration techniques have been used to calibrate new items effectively. However, there is little discussion in the literature about online calibration of MCAT. Therefore, based on the existing UCAT, this study extends the D-optimal design and A-optimal design of UCAT online calibration design to the multidimensional online calibration situ-ation. In this study, A computer simulation experiment was conducted to explore the influence of sample size and the correlation coefficient between the ability components on D-optimal design and A-optimal design. The results showed that the average total information and the information of the item parameters decreased with the increase of the correlation of ability components; increased first and then decreased with the increase of the number of new items when the correlation of abil-ity components was 0.2 and 0.5; and was almost unchanged when the correlation of ability compo-nents was 0.8, indicating that it was a little affected by the D- and A-optimal designs as this experi-mental condition.
文章引用:杨森, 何引红, 祁媛媛. 基于D-最优和A-最优设计的多维在线标定设计研究[J]. 应用数学进展, 2023, 12(1): 81-95. https://doi.org/10.12677/AAM.2023.121011

参考文献

[1] 陈平, 张佳慧, 辛涛. 在线标定技术在计算机化自适应测验中的应用[J]. 心理科学进展, 2013(10): 1883-1892.
[2] 张雪琴, 毛秀珍, 李佳. 基于CAT的在线标定: 设计与方法[J]. 心理科学进展, 2020, 28(11): 1970-1978.
[3] Lu, H.Y. (2014) Application of Optimal Designs to Item Calibration. PLOS ONE, 9, e106747. [Google Scholar] [CrossRef] [PubMed]
[4] Wainer, H. and Mislevy, R.J. (1990) Item Response Theory, Item Calibration, and Proficiency Estimation. In: Wainer, H., Dorans, N.J., Green, B.F., Steinberg, L., Flaugher, R., Mislevy, R.J. and Thissen, D., Eds., Computerized Adaptive Testing: A Primer, Lawrence Erlbaum Associates, Inc., Mahwah, 65-102.
[5] 游晓锋, 丁树良, 刘红云. 计算机化自适应测验中原始题项目参数的估计[J]. 心理学报, 2010, 42(7): 813-820.
[6] Chen, P., Wang, C., Xin, T. and Chang, H.H. (2017) Developing New Online Calibration Methods for Multidimensional Computerized Adaptive Testing. British Journal of Mathematical and Statistical Psychol-ogy, 70, 81-117. [Google Scholar] [CrossRef] [PubMed]
[7] Chen, P. and Wang, C. (2016) A New Online Calibration Method for Multidimensional Computerized Adaptive Testing. Psychometrika, 81, 674-701. [Google Scholar] [CrossRef] [PubMed]
[8] Wang, C., Chang, H.H. and Boughton, K.A. (2011) Kull-back-Leibler Information and Its Applications in Multi-Dimensional Adaptive Testing. Psychometrika, 76, 13-39. [Google Scholar] [CrossRef
[9] Mulder, J. and Van der Linden, W.J. (2009) Multidimensional Adaptive Testing with Optimal Design Criteria for Item Selection. Psychometrika, 74, 273-296. [Google Scholar] [CrossRef] [PubMed]
[10] Berger, M.P. (1992) Sequential Sampling Designs for the Two-Parameter Item Response Theory Model. Psychometrika, 57, 521-538. [Google Scholar] [CrossRef
[11] Chang, Y.C.I. and Lu, H.Y. (2010) Online Calibration via Variable Length Computerized Adaptive Testing. Psychometrika, 75, 140-157. [Google Scholar] [CrossRef
[12] Hassan UI, M. and Miller, F. (2019) Optimal Item Calibration for Computerized Achievement Tests. Psychometrika, 84, 1101-1128. [Google Scholar] [CrossRef] [PubMed]
[13] Ren, H., van der Linden, W.J. and Diao, Q. (2017) Continuous Online Item Calibration: Parameter Recovery and Item Utilization. Psychometrika, 82, 498-522. [Google Scholar] [CrossRef] [PubMed]
[14] Buyske, S.G. (1998) Optimal Design for Item Calibration in Computerized Adaptive Testing: The 2PL Case. Lecture Notes-Monograph Series, 115-125. [Google Scholar] [CrossRef
[15] Kang, H.A., Zheng, Y. and Chang, H.H. (2020) Online Calibration of a Joint Model of Item Responses and Response Times in Computerized Adaptive Testing. Journal of Educational and Behavioral Statistics, 45, 175-208. [Google Scholar] [CrossRef
[16] van der Linden, W.J. and Ren, H. (2015) Optimal Bayesian Adaptive Design for Test-Item Calibration. Psychometrika, 80, 263-288. [Google Scholar] [CrossRef] [PubMed]
[17] He, Y., Chen, P. and Li, Y. (2020) New Efficient and Practicable Adaptive Designs for Calibrating Items Online. Applied Psychological Measurement, 44, 3-16. [Google Scholar] [CrossRef] [PubMed]
[18] He, Y. and Chen, P. (2020) Optimal Online Calibration Designs for Item Replenishment in Adaptive Testing. Psychometrika, 85, 35-55. [Google Scholar] [CrossRef] [PubMed]
[19] Reckase, M.D. (2009) Multidimensional Item Response Theory. Springer, New York. [Google Scholar] [CrossRef
[20] 陈平. 两种新的计算机化自适应测验在线标定方法[J]. 心理学报, 2016, 48(9): 1184-1198.
[21] 张雪琴. CD-CAT在线标定设计的研究[D]: [硕士学位论文]. 成都: 四川师范大学, 2021.
[22] 杜文久, 周娟, 李洪波. 二参数逻辑斯蒂模型项目参数的估计精度[J]. 心理学报, 2013(10): 1179-1186.