多水平混合IRT模型及其发展与应用
Multilevel Mixture IRT Model and Its Development and Applications
DOI: 10.12677/AP.2021.116160, PDF,  被引量    科研立项经费支持
作者: 齐媛媛, 陈德枝*:浙江师范大学杭州幼儿师范学院,浙江 杭州
关键词: 多水平混合IRT模型IRT潜在类别分析多水平分析Multilevel Mixture IRT Model IRT Latent Category Analysis Multi-Level Analysis
摘要: 多水平混合IRT模型(MMIRTM)将IRT与潜在类别分析和阶层线性模型相结合,能够同时对嵌套在多水平下的被试分类并量化其潜在特质。它是近年来教育心理测量学的研究热点与重点之一。在梳理多水平混合IRT模型的发展由来、基本概念和原理的基础上,对多水平混合IRT模型在项目功能差异检测等领域的应用做了相关阐述,并对当前多水平混合IRT模型的应用与拓展进行了述评与展望。
Abstract: The Multilevel Mixture IRT Model (MMIRTM) combines IRT with latent category analysis and hie-rarchical linear model, which can simultaneously classify and quantify the potential characteris-tics of subjects nested in multiple levels. It is one of the research hotspots and key points of educational psychometrics in recent years. On the basis of sorting out the development origin, basic concepts and principles of the multilevel mixture IRT model, this paper elaborates the application of the multilevel mixture IRT model in the field of Differential Item Functioning (DIF) detection and other fields, and makes an review and outlook on the application and expansion of the current multilevel mixture IRT model.
文章引用:齐媛媛, 陈德枝 (2021). 多水平混合IRT模型及其发展与应用. 心理学进展, 11(6), 1428-1437. https://doi.org/10.12677/AP.2021.116160

参考文献

[1] 曾秀芹, 孟庆茂(1999). 项目功能差异及其检测方法. 心理学动态, (2), 41-47+57.
[2] 付志慧(2010). 多维项目反应模型的参数估计. 博士学位论文, 长春: 吉林大学.
[3] 黄明明, 王立君(2015). 混合IRT模型的研究进展与应用. 考试研究, (4), 61-68.
[4] 李美娟, 刘玥, 刘红云(2020). 计算机动态测验中问题解决过程策略的分析: 多水平混合IRT模型的拓展与应用. 心理学报, 52(4), 528-540.
[5] 刘红云, 骆方(2008). 多水平项目反应理论模型在测验发展中的应用. 心理学报, 40(1), 92-100.
[6] 刘慧, 简小珠, 张敏强, 等(2012). 多水平IRT的发展与应用述评. 心理科学进展, 20(4), 627-632.
[7] 罗照盛(2012). 项目反应理论基础(p. 4). 北京: 北京师范大学出版社.
[8] 骆方, 张厚粲(2006). 检验项目功能差异的两类方法——CFA和IRT的比较. 心理学探新, 26(1), 74-78.
[9] 马文超, 边玉芳, 骆方(2012). 网络成瘾的潜在结构: 连续的还是分类的? 心理发展与教育, 28(5), 554-560.
[10] 邱皓政(2008). 潜在类别模型的原理与技术(p. 29). 北京: 教育科学出版社.
[11] 王霞, 谭国华, 王旭, 张敏强, 骆聪(2014). 混合IRT潜在模型及其应用轨迹. 心理科学进展, 22(3), 540-548.
[12] 温福星(2009). 阶层线性模型的原理与应用. 北京: 中国轻工业出版社.
[13] 张洁婷, 焦璨, 张敏强(2010). 潜在类别分析技术在心理学研究中的应用. 心理科学进展, 18(12), 1991-1998.
[14] 周韵, 译(2013). 项目功能差异(第2版) (pp. 93-99). 上海: 格致出版社, 上海人民出版社. (Steven, J. O., & Howard, T. E., 2013).
[15] Ackerman, T. A. (1994). Using Multidimensional Item Response Theory to Understand What Items and Tests Are Measuring. Applied Measurement in Education, 7, 255-278.[CrossRef
[16] Adams, R. J., Wilson, M., & Wu, M. (1997). Multilevel Item Response Models: An Approach to Errors in Variables Regression. Journal of Educational and Behavioral Statistics, 22, 47-76.[CrossRef
[17] Anthony, S. B., & Stephen, W. R. (1992). Hierarchical Linear Models: Applications and Data Analysis Methods (p. 32). London: Sage Publication.
[18] Austin, E. J., Deary, I. J., & Egan, V. (2006). Individual Differences in Response Scale Use: Mixed Rasch Modelling of Responses to NEO-FFI Items. Personality and Individual Differences, 40, 1235-1245.[CrossRef
[19] Bacci, S., & Gnaldi, M. (2015). A Classification of University Courses Based on Students’ Satisfaction: An Application of a Two-Level Mixture Item Response Model. Quality & Quantity: International Journal of Methodology, 49, 927-940.[CrossRef
[20] Bennink, M., Croon, M. A., Keuning, J., & Vermunt, J. K. (2014). Measuring Student Ability, Classifying Schools, and Detecting Item Bias at School Level, Based on Student-Level Di-chotomous Items. Journal of Educational and Behavioral Statistics, 39, 180-202.[CrossRef
[21] Cheong, Y. F. (2006). Analysis of School Context Effects on Differential Item Functioning Using Hierarchical Generalized Linear Models. International Journal of Testing, 6, 57-79.[CrossRef
[22] Cheong, Y. F., & Raudenbush, S. W. (2000). Measurement and Structural Models for Children’s Problem Behaviors. Psychological Methods, 5, 477-495.[CrossRef
[23] Cho, S.-J., & Cohen, A. S. (2010). A Multilevel Mixture IRT Model with an Application to DIF. Journal of Educational and Behavioral Statistics, 35, 336-370.[CrossRef
[24] Cho, S.-J., Cohen, A. S., & Bottge, B. (2013). Detecting Intervention Effects Using a Multilevel Latent Transition Analysis with a Mixture IRT Model. Psychometrika, 78, 576-600.[CrossRef] [PubMed]
[25] Chu, K. L., & Kamata, A. (2000). Nonequivalent Group Equating via 1-P HGLLM. The Annual Meeting of the American Educational Research Association, New Orleans, 24-28 April 2000.
[26] Chu, K. L., & Kamata, A. (2005). Test Equating in the Presence of DIF Items. Journal of Applied Measurement, 6, 342-354.
[27] Chung, S., & Houts, C. (2020). flexMIRT: A Flexible Modeling Package for Multidimensional Item Response Models. Measurement: Interdisciplinary Research and Perspectives, 18, 40-54.[CrossRef
[28] Cohen, A. S., & Bolt, D. M. (2005). A Mixture Model Analysis of Differential Item Functioning. Journal of Educational Measurement, 42, 133-148.[CrossRef
[29] de Jong, M. G., & Steenkamp, J.-B. E. M. (2010). Finite Mixture Multilevel Multidimensional Ordinal IRT Models for Large Scale Cross-Cultural Research. Psychometrika, 75, 3-32.[CrossRef
[30] Finch, W. H., & Finch, M. E. H. (2013). Investigation of Specific Learning Disability and Testing Accommodations Based Differential Item Functioning Using a Multilevel Multidimensional Mixture Item Response Theory Model. Educational and Psychological Measurement, 73, 973-993.[CrossRef
[31] Fox, J.-P. (2004). Applications of Multilevel IRT Modeling. School Effectiveness and School Improvement, 15, 261-280.[CrossRef
[32] Fox, J.-P. (2005). Multilevel IRT Using Dichotomous and Polytomous Response Data. The British Journal of Mathematical and Statistical Psychology, 58, 145-172.[CrossRef
[33] Fox, J.-P., & Glas, C. A. W. (2003). Bayesian Modeling of Measurement Error in Predictor Variables Using Item Response Theory. Psychometrika, 68, 169-191.[CrossRef
[34] Jang, Y., Kim, S. H., & Cohen, A. S. (2018). The Impact of Multidimensionality on Extraction of Latent Classes in Mixture Rasch Models. Journal of Educational Measurement, 55, 403-420.[CrossRef
[35] Jilke, S., Meuleman, B., & Walle, S. V. D. (2015). We Need to Compare, but How? Measurement Equivalence in Comparative Public Administration. Public Administration Review, 75, 36-48.[CrossRef
[36] Kamata, A. (2001). Item Analysis by the Hierarchical Generalized Linear Model. Journal of Educational Measurement, 38, 79-93.[CrossRef
[37] Lazarsfeld, P. F., & Henry, N. W. (1968). Latent Structure Analysis (pp. 10-11). Boston, MA: Houghton Mifflin.
[38] Lee, W. Y., Cho, S. J., & Sterba, S. K. (2018). Ignoring a Multilevel Structure in Mixture Item Response Models: Impact on Parameter Recovery and Model Selection. Applied Psychological Measurement, 42, 136-154.[CrossRef] [PubMed]
[39] Liu, H., Liu, Y., & Li, M. (2018). Analysis of Process Data of PISA 2012 Computer-Based Problem Solving: Application of the Modified Multilevel Mixture IRT Model. Frontiers in Psychology, 9, 1372.[CrossRef] [PubMed]
[40] Lord, F. M. (1980). Applications of Item Response Theory to Practical Testing Problems (pp. 16-20). Mahwah, NJ: Lawrence Erlbaum.
[41] Lu, J., Zhang, J., & Tao, J. (2018). Slice-Gibbs Sampling Algorithm for Estimating the Parameters of a Multilevel Item Response Model. Journal of Mathematical Psychology, 82, 12-25.[CrossRef
[42] Raudenbush, S. W., Johnson, C., & Sampson, R. J. (2003). A Multivariate, Multilevel Rasch Model with Application to Self-Reported Criminal Behavior. Sociological Methodology, 33, 169-211.[CrossRef
[43] Reckase, M. D. (1985). The Difficulty of Test Items That Measure More than One Ability. Applied Psychological Measurement, 9, 401-412.[CrossRef
[44] Reckase, M. D. (2007). Multidimensional Item Response Theory (pp. 60-63). New York: Springer.
[45] Rost, J. (1990). Rasch Models in Latent Classes: An Integration of Two Approaches to Item Analysis. Applied Psychological Measurement, 14, 271-282.[CrossRef
[46] Smit, A., Kelderman, H., & Flier, H. V. D. (2003). Latent Trait Latent Class Analysis of an Eysenck Personality Questionnaire. MPR-Online, 8, 23-50.
[47] Spiegelhalter, D., Thomas, A., & Best, N. (2003). WinBUGS (Version 1.4) [Computer Program] (pp. 31-42). Cambridge: MRC Biostatistics Unit, Institute of Public Health.
[48] Tay, L., Diener, E., Drasgow, F., & Vermunt, J. K. (2011). Multilevel Mixed-Measurement IRT Analysis: An Explication and Application to Self-Reported Emotions across the World. Organizational Research Methods, 14, 177-207.[CrossRef
[49] Varriale, R., & Vermunt, J. K. (2012). Multilevel Mixture Factor Models. Multivariate Behavioral Research, 47, 247-275.[CrossRef] [PubMed]
[50] Vermunt, J. K. (2003). Multilevel Latent Class Models. Sociological Methodology, 33, 213-239.[CrossRef
[51] Vermunt, J. K. (2007). Multilevel Mixture Item Response Theory Models: An Application in Education Testing. The Bulletin of the International Statistical Institute 56th Session, Lisbon, 25 September 2007.
[52] Vermunt, J. K. (2008). Multilevel Latent Variable Modeling: An Application in Education Testing. The Australian Journal of Statistics, 37, 285-299.
[53] Zwinderman, A. H. (1991). A Generalized Rasch Model for Manifest Predictors. Psychometrika, 56, 589-600.[CrossRef