TY - JOUR
T1 - Detecting Differential Item Functioning Using the Logistic Regression Procedure in Small Samples
AU - Lee, Sunbok
N1 - Publisher Copyright:
© 2016, © The Author(s) 2016.
PY - 2017/1/1
Y1 - 2017/1/1
N2 - The logistic regression (LR) procedure for testing differential item functioning (DIF) typically depends on the asymptotic sampling distributions. The likelihood ratio test (LRT) usually relies on the asymptotic chi-square distribution. Also, the Wald test is typically based on the asymptotic normality of the maximum likelihood (ML) estimation, and the Wald statistic is tested using the asymptotic chi-square distribution. However, in small samples, the asymptotic assumptions may not work well. The penalized maximum likelihood (PML) estimation removes the first-order finite sample bias from the ML estimation, and the bootstrap method constructs the empirical sampling distribution. This study compares the performances of the LR procedures based on the LRT, Wald test, penalized likelihood ratio test (PLRT), and bootstrap likelihood ratio test (BLRT) in terms of the statistical power and type I error for testing uniform and non-uniform DIF. The result of the simulation study shows that the LRT with the asymptotic chi-square distribution works well even in small samples.
AB - The logistic regression (LR) procedure for testing differential item functioning (DIF) typically depends on the asymptotic sampling distributions. The likelihood ratio test (LRT) usually relies on the asymptotic chi-square distribution. Also, the Wald test is typically based on the asymptotic normality of the maximum likelihood (ML) estimation, and the Wald statistic is tested using the asymptotic chi-square distribution. However, in small samples, the asymptotic assumptions may not work well. The penalized maximum likelihood (PML) estimation removes the first-order finite sample bias from the ML estimation, and the bootstrap method constructs the empirical sampling distribution. This study compares the performances of the LR procedures based on the LRT, Wald test, penalized likelihood ratio test (PLRT), and bootstrap likelihood ratio test (BLRT) in terms of the statistical power and type I error for testing uniform and non-uniform DIF. The result of the simulation study shows that the LRT with the asymptotic chi-square distribution works well even in small samples.
KW - bootstrap
KW - differential item functioning
KW - logistic regression
KW - penalized maximum likelihood
KW - small samples
UR - http://www.scopus.com/inward/record.url?scp=85002427136&partnerID=8YFLogxK
U2 - 10.1177/0146621616668015
DO - 10.1177/0146621616668015
M3 - Article
AN - SCOPUS:85002427136
SN - 0146-6216
VL - 41
SP - 30
EP - 43
JO - Applied Psychological Measurement
JF - Applied Psychological Measurement
IS - 1
ER -