﻿ Bias interaction DIF DPF DRF estimation

# Bias interaction DIF DPF DRF estimation

After estimating the measures, Facets checks to see if any Model= specifications include Bias specifiers, "B". If so, for each such model, the specified Bias interaction is estimated for all the data (not just the data matching that particular model). Bias can be due to any type of interaction including Differential Item Functioning DIF, Differential Person Functioning DPF, Differential Rater Functioning DRF.

This is done by iterating through the data again and, after convergence, doing one further iteration to calculate statistics.

Computation of interactions is a two-stage process.

1. The measures for the elements, and the structure of the rating scale, are estimated. Then those values are anchored (fixed, held constant).

2. The expected values of the observations are subtracted from the observed values of the observations, producing the residuals.

3. The residuals corresponding to each interaction term (e.g., examinees rated by judge 4) are summed. If this sum is not zero, then there is an interaction.

4. The size of the interaction is estimated. A first approximation is:

Interaction (logits) = (sum of residuals) / (sum of the statistical-information in the observations).

Algebraically, first the Bn, Di, Cj, Fk are estimated using a Rasch model such as:

log ( Pnijk / Pnij(k-1)) = Bn - Di - Cj - Fk

Then the Bn, Di, Cj, Fk are anchored, and the bias/interaction terms, e.g., Cij, are estimated:

log ( Pnijk / Pnij(k-1)) = ( Bn - Di - Cj - Fk ) - Cij

Thus the Cij are estimated from the residuals left over from the main analysis. The conversion from residual score to bias interaction size is non-linear. Bias sizes may not sum to zero.

Bias, (also called interaction, differential item function, differential person function, etc.,) estimation serves several purposes:

1) in diagnosing misfit:

The response residuals are partitioned by element, e.g., by judge-item pairs, and converted into a logit measure. Estimates of unexpected size and statistical significance flag systematic misfit, focusing the misfit investigation.

2) in investigating validity:

A systematic, but small, bias in an item or a judge, for or against any group of persons, may be overwhelmed by the general stochastic component in the responses. Consequently it may not be detected by the usual summary fit statistics. Specifying a bias analysis between elements of facets of particular importance provides a powerful means of investigating and verifying the fairness and functioning of a test.

3) in assessing the effect of bias:

Since bias terms have a measure and a standard error (precision), their size and significance (t-statistic) are reported. This permits the effect of bias to be expressed in the same frame of reference as the element measures. Thus each element measure can be adjusted for any bias which has affected its estimation, e.g., by adding the estimate of bias, which has adversely affected an element, to that element's logit measure. Then the practical implications of removing bias can be determined. Does adjustment for bias alter the pass-fail decision? Does adjustment for bias affect the relative performance of two groups in a meaningful way?

4) in partitioning unexplained "error" variance:

The bias logit sample standard deviation corrected for its measurement error, can be an estimate of the amount of systematic error in the error variance (RMSE).

e.g., for a bias analysis of judges,

Bias logit S.D. = 0.47, mean bias S.E. = 0.32 (Table 13),

so "true" bias S.D. = (0.47² - 0.32²) = 0.35 logits,

but, this exceeds the RMSE for judges = 0.12 (Table 7).

Here, locally extreme judge-person scores cause an overestimation of systematic bias.

Adjusting for bias:

A straight-forward approach is to define the biased element as two elements: one element for one subset of judges (examinees, etc.) and a different items for the other subset of judges (examinees, etc.). This can done by defining an extra item element, and then adjusting item references in the data file accordingly.

Example:

Facets = 4 ; Items, candidates, examiners, bias adjustment

Non-center = 2 ; candidates float

Models =

?, 28, 17, 1, myscale ; allow for bias adjustment between candidate 28 and examiner 17

?,  ?, ?, 2, myscale

*

Rating scale = myscale, R9

Labels=

1, Items

...

2, Candidates

...

3, Examiners

....

4, Bias adjustment, A

1, 28-17 adjustment ; the bias will be absorbed by this element, relative to element 2.

2, Everyone else, 0

*

Data=

1-5, 28, 17, 1, 1,2,3,4,5

1-5, 29, 23, 2, 5,4,3,2,1

.....

Help for Facets Rasch Measurement Software: www.winsteps.com Author: John Michael Linacre.

The Languages of Love: draw a map of yours!

For more information, contact info@winsteps.com or use the Contact Form

 Facets Rasch measurement software. Buy for \$149. & site licenses. Freeware student/evaluation download Winsteps Rasch measurement software. Buy for \$149. & site licenses. Freeware student/evaluation download

 Forum Rasch Measurement Forum to discuss any Rasch-related topic

Click here to add your email address to the Winsteps and Facets email list for notifications.

Click here to ask a question or make a suggestion about Winsteps and Facets software.

Rasch Publications
Rasch Measurement Transactions (free, online) Rasch Measurement research papers (free, online) Probabilistic Models for Some Intelligence and Attainment Tests, Georg Rasch Applying the Rasch Model 3rd. Ed., Bond & Fox Best Test Design, Wright & Stone
Rating Scale Analysis, Wright & Masters Introduction to Rasch Measurement, E. Smith & R. Smith Introduction to Many-Facet Rasch Measurement, Thomas Eckes Invariant Measurement with Raters and Rating Scales: Rasch Models for Rater-Mediated Assessments, George Engelhard, Jr. & Stefanie Wind Statistical Analyses for Language Testers, Rita Green
Rasch Models: Foundations, Recent Developments, and Applications, Fischer & Molenaar Journal of Applied Measurement Rasch models for measurement, David Andrich Constructing Measures, Mark Wilson Rasch Analysis in the Human Sciences, Boone, Stave, Yale
in Spanish: Análisis de Rasch para todos, Agustín Tristán Mediciones, Posicionamientos y Diagnósticos Competitivos, Juan Ramón Oreja Rodríguez
Winsteps Tutorials Facets Tutorials Rasch Discussion Groups

Coming Winsteps & Facets Events
May 22 - 24, 2018, Tues.-Thur. EALTA 2018 pre-conference workshop (Introduction to Rasch measurement using WINSTEPS and FACETS, Thomas Eckes & Frank Weiss-Motz), https://ealta2018.testdaf.de
May 25 - June 22, 2018, Fri.-Fri. On-line workshop: Practical Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
June 27 - 29, 2018, Wed.-Fri. Measurement at the Crossroads: History, philosophy and sociology of measurement, Paris, France., https://measurement2018.sciencesconf.org
June 29 - July 27, 2018, Fri.-Fri. On-line workshop: Practical Rasch Measurement - Further Topics (E. Smith, Winsteps), www.statistics.com
July 25 - July 27, 2018, Wed.-Fri. Pacific-Rim Objective Measurement Symposium (PROMS), (Preconference workshops July 23-24, 2018) Fudan University, Shanghai, China "Applying Rasch Measurement in Language Assessment and across the Human Sciences" www.promsociety.org
Aug. 10 - Sept. 7, 2018, Fri.-Fri. On-line workshop: Many-Facet Rasch Measurement (E. Smith, Facets), www.statistics.com
Oct. 12 - Nov. 9, 2018, Fri.-Fri. On-line workshop: Practical Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com

Our current URL is www.winsteps.com

Winsteps® is a registered trademark

 John "Mike" L.'s Wellness Report: I'm 72, take no medications and, March 2018, my doctor is annoyed with me - I'm too healthy! According to Wikipedia, the human body requires about 30 minerals, maybe more. There are 60 naturally-occurring minerals in the liquid Mineral Supplement which I take daily.