Exercise 3LC2. Binary response model: Tower of London tests (226 individuals in 118 families)



This data set (tower1.dat) is from Rabe-Hesketh and Skrondal (2005). Rabe-Hesketh, Touloupolou and Murray (2001) estimated a multilevel cognitive performance model on 3 groups: (1) subjects with schizophrenia; (2) subject’s relatives and (3) control subjects. The Tower of London test was used to assess cognitive performance. The responses have a 3-level structure, i.e. occasion i for subject j in family k. The test was repeated at 3 different levels of difficulty.  The binary response dtlm takes the value 1 if each test was completed in the minimum number of moves and 0 otherwise. The same data were used by Rabe-Hesketh and Skrondal (2005, exercise 7.2).



Rabe-Hesketh, S., Toulopoulou, T. and Murray, R. (2001). Multilevel modeling of cognitive function in schizophrenic patients and their first degree relatives. Multivariate Behavioral Research 36, 279-298.

Rabe-Hesketh, S., and Skrondal, A., (2005), Multilevel and Longitudinal Modelling using Stata, Stata Press, Stata Corp, College Station, Texas.


Data description


Number of observations = 677

Number of level-2 cases (‘id’ = subject identifier) = 226

Number of level-3 cases (‘famnum’ = family identifier) = 118


The tower1.dat includes the variables:


id= subject identifier                          

level=  level of difficulty of the Tower of London test                           

famnum= family identifier  

group=  group (1:controls, 2:relatives, 3:schizophrenics) 

age= subject’s age (years)

dtlm= indicator for completing the task in the minimum number of moves  



The first few lines of tower1.dat look like:




Suggested exercise:


1.    Estimate a logit model (without random effects) for the binary response dtlm with the covariate level, and dummy variables for group=2 and group=3.

2.    Allow for the level-2 subject random effect (id), use mass 12. Is this random effect significant?

3.    Allow for both the level-2 subject random effect (id), and for the level-3 family random effects (famnum), level 3 use mass 12 for both levels. Are both these random effects significant? Is this model a significant improvement over the model estimated in part 2 of this exercise?

4.    How did your results on group2 and group3 change when you allowed for subject (level 2) and then family (level 3) effects?