Exercise C1, linear model


Johnson and Albert (1999) analysed data on the grading of essays by several experts. Essays were graded on a scale of 1 to 10 with 10 being excellent. In this exercise we use the subset of the data limited to the grades from graders 1 and 4 on 198 essays (grader1.dat). The same data were used by Rabe-Hesketh and Skrondal (2005, exercise 1.5).



Original data description (grader1.dat)


Number of observations (rows): 198

Number of variables (columns): 3



grade1= grade awarded by grader 1 {1,2,,10)

grade4=grade awarded by grader 4 {1,2,.,10}

essay=essay identifier


The first few lines of the grader1.dat look like:



To use the data in Sabre we need to stack the data, with grade1 and grade4 as a single column grade. We have done this for you and generated an identifier to distinguish grade1 and grade4, i.e. dg4=1, if grade4 =1 and 0 otherwise.


Data description (grader2.dat)

Number of observations (rows): 396

Number of variables (columns): 6



ij = essay identifier (1,2,,198)

r = response (1,2)

grade= grade awarded

essay= essay identifier (this is a copy of ij)

dg1 = 1 if this is the grade from grader 1, 0 otherwise

dg4 = 1 if this is the grade from grader 4, 0 otherwise



The first few lines of the stacked data (grader2.dat) look like:



Start Sabre and specify transcript file:


out grader2.log


data ij r grade essay dg1 dg4

read grader2.dat



Suggested exercise:


(1) Estimate the linear model using Sabre on grade, with just a constant and no other effects;

(2) Estimate the linear model, allowing for the essay random effect, use mass 64, this is a linear model and a lot of quadrature points are need to get a good approximation to the integrated likelihood. Are the essay effects significant? What impact do they have on the model?

(3) Re-estimate the linear model allowing for both the essay random effect and dg4.

(4) How do the results change as compared to a model with just a constant? Interpret your results.





Johnson, V. E., and Albert, J., H., (1999), Ordinal Data Modelling, Springer, New York.


Rabe-Hesketh, S., and Skrondal, A., (2005), Multilevel and Longitudinal Modelling using Stata, Stata Press, Stata Corp, College Station, Texas.