Testing the Assumptions of Multilevel Models

Karla Arias

Join Date: Oct 2020

Posts: 6
#1

Testing the Assumptions of Multilevel Models

04 Oct 2020, 16:20

Hello!
I want to know if Stata has a command to calculate the level 2 residuals. I want to test the assumptions for level 2 of a multilevel model.

Thanks for the help
Tags: assumptions, mutilevel models, residuals, residuals level 2, syntax
Clyde Schechter

Join Date: Apr 2014

Posts: 30191
#2

04 Oct 2020, 17:20

After at least -mixed-, -melogit-, and -mepoisson-, -predict- has an -reffects- option that will do this. I believe this is true of the other -me- commands as well, but I haven't checked or used it personally in those other cases.

For details, type -help mixed postestimation- and click the blue -predict- link.
1 like
Comment
Bruce Weaver

Join Date: May 2014

Posts: 1143
#3

05 Oct 2020, 09:25

Hello Karla. Clyde has suggested how you can obtain the desired residuals. I'll just add that if you mean that you intend to carry out statistical tests of assumptions, that is generally not a good idea. Take tests of normality as an example.
H₀ states that the observations (e.g., residuals) are a random sample from a normally distributed population.

Like other tests, tests of normality have low power when n is low, and higher power as n increases.

Normality of the errors is most important when n is low, and becomes less and less important as n increases.

Conclusion: Tests of normality fail to detect important departures from normality when n is low, but throw up the red flag far too easily when n is large (and normality of the errors is far less important). Therefore, there is not much point in carrying out such tests. Graphical examinations of the residuals are far more useful, IMO.

HTH.

--
Bruce Weaver
Email: [email protected]
Version: Stata/MP 19.5 (Windows)
2 likes
Comment
Karla Arias

Join Date: Oct 2020

Posts: 6
#4

05 Oct 2020, 15:26

Originally posted by Clyde Schechter View Post

After at least -mixed-, -melogit-, and -mepoisson-, -predict- has an -reffects- option that will do this. I believe this is true of the other -me- commands as well, but I haven't checked or used it personally in those other cases.

For details, type -help mixed postestimation- and click the blue -predict- link.

Thank you very much, Clyde. I did it!
Comment
Karla Arias

Join Date: Oct 2020

Posts: 6
#5

05 Oct 2020, 16:16

Originally posted by Bruce Weaver View Post

Hello Karla. Clyde has suggested how you can obtain the desired residuals. I'll just add that if you mean that you intend to carry out statistical tests of assumptions, that is generally not a good idea. Take tests of normality as an example.
H₀ states that the observations (e.g., residuals) are a random sample from a normally distributed population.

Like other tests, tests of normality have low power when n is low, and higher power as n increases.

Normality of the errors is most important when n is low, and becomes less and less important as n increases.

Conclusion: Tests of normality fail to detect important departures from normality when n is low, but throw up the red flag far too easily when n is large (and normality of the errors is far less important). Therefore, there is not much point in carrying out such tests. Graphical examinations of the residuals are far more useful, IMO.

HTH.

Hello Bruce, I understand, I agree. I also prefer graphical examinations.
What would be the graphical analysis for the residuals of level 2? Can you help me please.
Comment

Joseph Coveney

Join Date: Apr 2014
Posts: 4457

05 Oct 2020, 19:42

Originally posted by Karla Arias View Post

What would be the graphical analysis for the residuals of level 2?

Run the do-file below and take a look at the resulting Q-Q and P-P plots of the BLUPs. The user manual entries for qnorm and pnorm give you an idea of what to look for in the respective plots.

Code:

version 16.1

clear *

set seed `=strreverse("1575692")'

quietly set obs 200
generate int pid = _n
generate double pid_u = rnormal()

quietly expand 10

generate double out = pid_u + rnormal()

mixed out || pid: , nolrtest nolog

*
* Begin here
*
predict double pre, reffects
label variable pre BLUPs

qnorm pre, msize(vsmall) mcolor(black) ///
    ylabel( , angle(horizontal) nogrid) ///
    rlopts(lcolor(black) lpattern(dash))
quietly graph export QQ.png, replace

pnorm pre, msize(vsmall) mcolor(black) ///
    ylabel( , angle(horizontal) nogrid) ///
    rlopts(lcolor(black) lpattern(dash))
quietly graph export PP.png, replace

exit

Comment

Karla Arias

Join Date: Oct 2020
Posts: 6

05 Oct 2020, 20:23

Originally posted by Joseph Coveney View Post

Run the do-file below and take a look at the resulting Q-Q and P-P plots of the BLUPs. The user manual entries for qnorm and pnorm give you an idea of what to look for in the respective plots.

Code:

version 16.1

clear *

set seed `=strreverse("1575692")'

quietly set obs 200
generate int pid = _n
generate double pid_u = rnormal()

quietly expand 10

generate double out = pid_u + rnormal()

mixed out || pid: , nolrtest nolog

*
* Begin here
*
predict double pre, reffects
label variable pre BLUPs

qnorm pre, msize(vsmall) mcolor(black) ///
ylabel( , angle(horizontal) nogrid) ///
rlopts(lcolor(black) lpattern(dash))
quietly graph export QQ.png, replace

pnorm pre, msize(vsmall) mcolor(black) ///
ylabel( , angle(horizontal) nogrid) ///
rlopts(lcolor(black) lpattern(dash))
quietly graph export PP.png, replace

exit

Thank you Joseph!!
I have a question, would it be different if I have a variable as a random effect within the multilevel model?
In such a model, how would you make a residuals vs predicted scatter plot?

Comment

Joseph Coveney

Join Date: Apr 2014

Posts: 4457
#8

05 Oct 2020, 21:17

Originally posted by Karla Arias View Post

. . . would it be different if I have a variable as a random effect within the multilevel model?

I'm sorry, I don't understand. Do you mean a random slope? According to the help file for predict after mixed, "By default, BLUPs for all random effects in the model are calculated." So, I believe that it would be exactly the same as I show above, but you will need to specify separate variables for each BLUP prediction, or else specify a variable-name stub. See the help file for more information about how to do that.

. . . how would you make a residuals vs predicted scatter plot?

Here's an example.

Code:

version 16.1 clear * set seed `=strreverse("1575715")' quietly sysuse auto summarize rep78, meanonly quietly replace rep78 = runiformint(r(min), r(max)) if mi(rep78) xtreg gear_ratio c.trunk, i(rep78) re * * Begin here * predict double ue, ue // Residuals predict double xb, xb // Predicted // Residuals-versus-predicted plot graph twoway scatter ue xb, mcolor(black) msize(small) /// ytitle(Combined Residual) ylabel( , angle(horizontal) format(%03.1f) nogrid) /// xtitle(Predicted (Linear Prediction)) /// yline(0, lcolor(black) lpattern(dash)) exit
Comment
Karla Arias

Join Date: Oct 2020

Posts: 6
#9

05 Oct 2020, 22:39

Originally posted by Joseph Coveney View Post

I'm sorry, I don't understand. Do you mean a random slope?]

Yes that's what I mean. I checked it, thanks.

I have a last question, How can I get the predicted values for level 2 in a multilevel model?
Which command can I use?
Comment
Joseph Coveney

Join Date: Apr 2014

Posts: 4457
#10

05 Oct 2020, 23:10

Originally posted by Karla Arias View Post

How can I get the predicted values for level 2 in a multilevel model? Which command can I use?

I showed you already. See #6 above. It's the command shown just following the "Begin here" comment in the code block there.
Comment
Karla Arias

Join Date: Oct 2020

Posts: 6
#11

28 Oct 2020, 21:58

Hello Joseph,

Thank you.
Yes, I get it, my question is how can I get and plot the residuals and predictions for level 2 of a multilevel model. It's because I was reading chapter 2 of Joop Hox's book "Multilevel Analysis (Quantitative methodology series)", he talks about this but I don't know how to do it with Stata.
Let me show you with the following image what I mean:
(The sample size for the example is 2000 students from 100 classes)
Comment

Announcement