Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Options when linear mixed model residuals are non-normally distributed

    Hey everyone,

    In a project I've been working on I've run into non-normally distributed residuals in a linear mixed model, with some fixed effects and just random intercepts (no random slopes). Non-normality of the residuals, of course, violates the assumption that the error term is normally distributed. I have uploaded a qqplot and histogram of the level 1 residuals.

    The model has dependent variable time (0, 1, 3, 4, 5, 9, 14, ..., 432) and several binary and categorical variables, as well as one continuous variable.

    Code:
    predict lev1, rstandard
    qnorm lev1
    histogram lev1, rstandard
    Click image for larger version

Name:	lev1_residuals_histogram_linear.png
Views:	1
Size:	30.1 KB
ID:	1461464

    Click image for larger version

Name:	lev1_residuals_linear.png
Views:	1
Size:	21.1 KB
ID:	1461465






    Ultimately I want to visualize the random intercept, and I am afraid that a log-transformation may eliminate that option. Moreover, the fact that interpretation of linear mixed models is relatively simple is something I like.

    What approaches that allow visualizing the random intercepts would you recommend for me? Are perhaps GLMMS a good choice? And with what family and link function?
    Last edited by Dougie Jones; 10 Sep 2018, 07:56.

  • #2
    If you use the -vce(robust)- option, then the assumption of normality of residuals no longer applies.

    Comment


    • #3
      Thank you, Clyde.

      However, using vce(robust) makes it impossible to predict the standard errors of the random effects predictions.
      Code:
      mixed x y || id:, vce(robust)
      predict random_effects, reffects reses(newvar)
      Is there any way around this issue? Or an alternative to show the uncertainty surround the random effects predictions?
      Last edited by Dougie Jones; 11 Sep 2018, 02:28.

      Comment


      • #4
        You mentioned generalized linear multilevel / mixed models. Have you considered something like the following?
        Code:
        meglm . . . || . . . : , family(gaussian) link(log)

        Comment


        • #5
          Originally posted by Clyde Schechter View Post
          If you use the -vce(robust)- option, then the assumption of normality of residuals no longer applies.
          Hi Clyde, and others. There are lots of books and articles one can cite to support use of Huber-White robust standard errors to deal with heteroscedasticity. I wonder if anyone can point to any reputable textbooks or articles to support Clyde's point about non-normality. Thus far, all I've come up with is the following excerpt from this UCLA web-book on regression (emphasis added):

          The Stata regress command includes a robust option for estimating the standard errors using the Huber-White sandwich estimators. Such robust standard errors can deal with a collection of minor concerns about failure to meet assumptions, such as minor problems about normality, heteroscedasticity, or some observations that exhibit large residuals, leverage or influence. For such minor problems, the robust option may effectively deal with these concerns.

          With the robust option, the point estimates of the coefficients are exactly the same as in ordinary OLS, but the standard errors take into account issues concerning heterogeneity and lack of normality.
          It's not that I doubt what you and the web-book author are saying, Clyde. I'm just imagining that my boss or other collaborators may need something in print in a reputable book or journal to be convinced.

          Cheers,
          Bruce
          --
          Bruce Weaver
          Email: [email protected]
          Version: Stata/MP 18.5 (Windows)

          Comment

          Working...
          X