﻿ Iterations - PROX & JMLE

# Iterations - PROX & JMLE

The Rasch model formulates a non-linear relationship between non-linear raw scores and additive measures. So, estimating measures from scores requires a non-linear process. This is performed by means of iteration. Two estimation methods are used, PROX and JMLE.

The fundamental transformation in linearizing raw scores is:

log ( (observed raw score - minimum possible raw score) / (maximum possible raw score - observed raw score) )

In Winsteps, initially every person is estimated to have the same ability measure at the origin of the measurement scale. Each item is estimated to have the same difficulty measure, also at the origin of the measurement scale. Each rating scale structure parameter, Rasch-Andrich threshold, is also estimated to be 0.

In Winsteps, the first phase of estimation uses the PROX (normal approximation) estimation algorithm. This takes the initial set of estimates and produces revised estimates:

where Bn is the revised ability estimate for person n, mn is the mean difficulty of the items encountered by person n, and sn is the standard deviation of those item difficulties. Rn is the observed raw score for person n and Nn is a perfect, maximum possible score on those same items. Similarly, for the items,

where Di is the revised difficulty estimate for item i, mi is the mean ability of the persons encountering by item i, and si is the standard deviation of those person abilities. Ri is the observed raw score on item i and Ni is a perfect score by those same persons.

To update these PROX estimates, Winsteps traverses the data computing the values of all the terms on the right-side of the estimation equations. This traversal is called an "iteration". When the increase in the range of the person or item measures is smaller than 0.5 logits, or when MPROX= is reached, iteration ceases.

Initial estimates of the Rasch-Andrich threshold between category k and category k-1 are obtained from log (observed frequency of category k-1 / observed frequency of category k) normalized to sum to zero across the thresholds of a rating scale.

The PROX estimates become the starting values for JMLE (Joint Maximum Likelihood Estimation). Using these person, item and rating scale structure estimates, Winsteps computes the expected value, according to the Rasch model, corresponding to each observation in term. After iterating through the entire data set, the marginal sums of these expected values, the person expected raw scores and the item expected raw scores, are compared with their observed (empirical) values. If a person's expected raw score is less than that person's observed raw score, then the ability estimate raised. If the person's expected raw score is greater than the observed score, then the ability estimate is lowered. For items, if the expected raw score is less than the observed score, then the difficulty estimate is lowered. If the item's expected raw score is greater than the observed score, then the difficulty estimate is raised.

The estimation equations for JMLE are derived in RSA, where Newton-Raphson iteration is employed.

y' = y + (observed score - Rasch expected score based on current estimates)/(modeled variance)

where y = a current estimated person measure and y' is the improved estimate.

To see this process in an Excel spreadsheet, go to www.rasch.org/moulton.htm

Newton-Raphson estimation has proved unstable with sparse data sets and also with rating scales which have alternating very high and very low frequency categories. Accordingly, Winsteps implements a more robust proportional-curve-fitting algorithm to produce JMLE estimates. The relationship between raw scores and measures is always monotonic, so the characteristic curve for each person or item parameter is modeled to have the local form of a logistic ogive:

y = a * log( (x-l)/(h-x) ) + c

where y = an estimated measure, a = slope of the ogive, x = a raw score, l = the known minimum possible raw score for the parameter, h = the known maximum possible raw score for the parameter, c = location of ogive relative to local origin.

Values of x are obtained form the current estimated measure y and a nearby measure (y + d). From these, a and c are estimated. The revised measure y' is obtained by evaluating the equation using the observed raw score as the value of x. In the plot below for Example 1, the current estimate, y, is -3 logits, a nearby estimate, y+d, is -2 logits. These both estimate raw scores on the currently-estimated test characteristic curve (TCC, the remainder of which is not yet known). The violet line is the logistic ogive going through these two known points. It is close to the putative TCC. The observed score of "5" is then found on the logistic ogive and an improved estimate is obtained. After all the person and item estimates are improved, the estimated TCC changes and this estimation process is repeated by performing another iteration through the data. Winsteps sets d equal to the biggest logit change in any item or person estimate in the previous iteration. This rapidly becomes much less than 1.0.

For the rating scale structure, the estimate, yk, for Rasch-Andrich threshold k is improved by

yk' = yk - log ( observed count category k / observed count category k-1)

+ log (estimated count category k / estimated count category k-1)

When the various convergence criteria are satisfied, iteration ceases and the final estimates are obtained. These are used in computing fit statistics.

Example: Here is the iteration Table for example0.txt:

CONVERGENCE TABLE

+----------------------------------------------------------------------------+

|    PROX          ACTIVE COUNT       EXTREME 5 RANGE      MAX LOGIT CHANGE  |

| ITERATION   KIDS    ACTS    CATS     KIDS    ACTS       MEASURES  STRUCTURE|

|----------------------------------------------------------------------------|

|        1       75      25     3       3.78    3.20        3.8918     .0740 |

|        2       74      25     3       4.59    3.71         .8258    -.6158 |

|        3       74      25     3       4.83    3.92         .2511    -.1074 |

|----------------------------------------------------------------------------|

|    JMLE     MAX SCORE   MAX LOGIT    LEAST CONVERGED     CATEGORY STRUCTURE|

| ITERATION   RESIDUAL*    CHANGE    KID    ACT      CAT   RESIDUAL   CHANGE |

|----------------------------------------------------------------------------|

|       1         2.84      -.1955      60     22*      2      21.44    .0076|

|       2          .71      -.0335      53     15*      0      -5.89    .0144|

|       3         -.43       .0293      53      5*      1       3.48    .0101|

|       4          .32       .0235      18     11*      1       2.71    .0079|

|       5          .24       .0184      18     11*      0      -2.09    .0060|

|       6          .19       .0141      18     11*      0      -1.63    .0045|

|       7          .14       .0108      18     11*      0      -1.25    .0035|

|       8          .11       .0082      18     11*      0       -.96    .0026|

|       9          .08       .0062      18     11*      0       -.73    .0020|

|      10          .06       .0048      18     11*      0       -.56    .0015|

+----------------------------------------------------------------------------+

In the top section of the Convergence Table are reported the number of active persons, items and categories. The range of item and person measures at the end of each PROX iteration is shown, also the biggest change in any person or item, and in any Rasch-Andrich threshold. PROX iteration ceases with iteration 3 because the "KIDS" (persons) and "ACTS" (items) range has increased by less than 0.5 logits.

In the lower section, for each JMLE iteration, the maximum score residual, the biggest difference between any observed and expected marginal score is shown. Also the biggest change in any measure. Iteration ceases when the values, in iteration 10, are less than the convergence criteria.

Help for Winsteps Rasch Measurement Software: www.winsteps.com. Author: John Michael Linacre

 Forum Rasch Measurement Forum to discuss any Rasch-related topic

Rasch Publications
Rasch Measurement Transactions (free, online) Rasch Measurement research papers (free, online) Probabilistic Models for Some Intelligence and Attainment Tests, Georg Rasch Applying the Rasch Model 3rd. Ed., Bond & Fox Best Test Design, Wright & Stone
Rating Scale Analysis, Wright & Masters Introduction to Rasch Measurement, E. Smith & R. Smith Introduction to Many-Facet Rasch Measurement, Thomas Eckes Invariant Measurement with Raters and Rating Scales: Rasch Models for Rater-Mediated Assessments, George Engelhard, Jr. & Stefanie Wind Statistical Analyses for Language Testers, Rita Green
Rasch Models: Foundations, Recent Developments, and Applications, Fischer & Molenaar Journal of Applied Measurement Rasch models for measurement, David Andrich Constructing Measures, Mark Wilson Rasch Analysis in the Human Sciences, Boone, Stave, Yale
in Spanish: Análisis de Rasch para todos, Agustín Tristán Mediciones, Posicionamientos y Diagnósticos Competitivos, Juan Ramón Oreja Rodríguez
Winsteps Tutorials Facets Tutorials Rasch Discussion Groups

Coming Rasch-related Events
April 10-12, 2018, Tues.-Thurs. Rasch Conference: IOMW, New York, NY, www.iomw.org
April 13-17, 2018, Fri.-Tues. AERA, New York, NY, www.aera.net
May 22 - 24, 2018, Tues.-Thur. EALTA 2018 pre-conference workshop (Introduction to Rasch measurement using WINSTEPS and FACETS, Thomas Eckes & Frank Weiss-Motz), https://ealta2018.testdaf.de
May 25 - June 22, 2018, Fri.-Fri. On-line workshop: Practical Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
June 27 - 29, 2018, Wed.-Fri. Measurement at the Crossroads: History, philosophy and sociology of measurement, Paris, France., https://measurement2018.sciencesconf.org
June 29 - July 27, 2018, Fri.-Fri. On-line workshop: Practical Rasch Measurement - Further Topics (E. Smith, Winsteps), www.statistics.com
July 25 - July 27, 2018, Wed.-Fri. Pacific-Rim Objective Measurement Symposium (PROMS), (Preconference workshops July 23-24, 2018) Fudan University, Shanghai, China "Applying Rasch Measurement in Language Assessment and across the Human Sciences" www.promsociety.org
Aug. 10 - Sept. 7, 2018, Fri.-Fri. On-line workshop: Many-Facet Rasch Measurement (E. Smith, Facets), www.statistics.com
Sept. 3 - 6, 2018, Mon.-Thurs. IMEKO World Congress, Belfast, Northern Ireland www.imeko2018.org
Oct. 12 - Nov. 9, 2018, Fri.-Fri. On-line workshop: Practical Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com

Our current URL is www.winsteps.com