Application of the DIF-free-then-DIF Strategy to the Logistic Regression Method in Assessment of Differential Item Functioning (Thesis of Mr. Chan)
Traditional method on assessing DIF is not based on pure another items and it’s actually depend on unclean items. Hence, DIF detection may be confound by considering these dirty items into total score. Main contribution of the article is to apply DIF-free-then-DIF strategy to detect DIF by using the logistic regression method. The strategy is to find a (several) pure item(s) to be an anchor item(s), then assess DIF through pure items which is found from the first stage. The new method can improve the performance of DIF detection better than the conventional one. A series of simulations were conducted and the results illustrated that type I error was well controlled by adoption of the strategy, particularly for larger DIF difference. Moreover, the power was also reasonable good compared to other methods.
Main idea of the new method
1) DIF-free -then-DIF (first round)
DIF-free is a purification procedure which has been proposed to find pure items in the first round. Set the first item to be an anchor item, then to compute difference of -2 log likelihood of two logistic regression models for each item. At the same way, to set second, third, …, and the very last item to be anchor item, then compute statistics. Finally, summing of difference of -2 log likelihood for each item, and set items with smaller sum to be DIF-free items (anchor items).
2) DIF-free-then-DIF (second round)
Only using total score by summing of DIF-free items (also is denoted as anchor items here), then compute -2 log likelihood of two logistic regression models: One is full model that it considers total score and group membership into the model, and the other is reduce model which only takes total score into account. We can assess whether DIF exists or not by hypothesis testing. When testing is significant, it means that null hypothesis is not true (the coefficient of group membership is not equal to 0), that is DIF exists.
Comments, Questions and Future Study
1) Conventionally, we focused on detecting DIF items using methods which don’t ignore dirty items in the model. However, the performance of DIF detection may be affected by these unclean items. The present study proposed a much creative and positive idea to find pure items first. Thus, it can resolve the problem mentioned above.
2) The fundamental thought of the paper has given me a light on adjusting purification procedure that using on my on-going study. In my study, we tried to adopt purification to find more accurate ability estimates and item parameter. However, the original method could not prove that it can improve the performance of detecting examinees with pre-knowledge by purification procedure. The original method we used is also based on assessing bad items or bad persons, then discard these bad items and persons to obtain much accurate estimates. Maybe we can try another opposite approach to find pure items and persons first. I think it would be a good way to recovery item parameter for sure.
Reference
Wang, W.-C. (2008). Assessment of differential item functioning. Journal of Applied Measurement, 9, 387-408.