Wanting to use time-invariant variables - despite xtoverid suggesting fe

em lowthian

Join Date: Jun 2017
Posts: 51

Wanting to use time-invariant variables - despite xtoverid suggesting fe

26 Jul 2017, 04:53

Hi all,

Sorry for the frequent posts - but i'm not entirely sure after reading Longhi & Nandi's chapter on Xtreg, re, fe how I would solve this problem. I also have searched a few terms but I can't see a similar post.

I have ran a re model, which is clustered -

Code:

xtreg selfesteem dvage ypsex divorcemo emotabusemo physpunishtmo, re vce(cluster clusterhh)

Random-effects GLS regression                   Number of obs     =        834
Group variable: youth_pidp                      Number of groups  =        795

R-sq:                                           Obs per group:
     within  = 0.0005                                         min =          1
     between = 0.0331                                         avg =        1.0
     overall = 0.0300                                         max =          2

                                                Wald chi2(5)      =      24.32
corr(u_i, X)   = 0 (assumed)                    Prob > chi2       =     0.0002

                             (Std. Err. adjusted for 626 clusters in clusterhh)
-------------------------------------------------------------------------------
              |               Robust
   selfesteem |      Coef.   Std. Err.      z    P>|z|     [95% Conf. Interval]
--------------+----------------------------------------------------------------
        dvage |  -.1344853   .0794991    -1.69   0.091    -.2903008    .0213301
        ypsex |  -.1276481   .2528637    -0.50   0.614    -.6232518    .3679557
    divorcemo |   -.978547   .4455818    -2.20   0.028    -1.851871   -.1052227
  emotabusemo |   .2393315    .064738     3.70   0.000     .1124474    .3662155
physpunishtmo |  -.0460084   .0814713    -0.56   0.572    -.2056893    .1136724
        _cons |   23.01266   1.562677    14.73   0.000     19.94987    26.07545
--------------+----------------------------------------------------------------
      sigma_u |  .82554565
      sigma_e |  3.4511155
          rho |  .05412488   (fraction of variance due to u_i)
-------------------------------------------------------------------------------

Checking for consistency, the xtoverid test suggests I should use a fe regression

Code:

Test of overidentifying restrictions: fixed vs random effects
Cross-section time-series model: xtreg re  robust cluster(clusterhh)
Sargan-Hansen statistic   9.897  Chi-sq(4)    P-value = 0.0422

However, I do not want to omit the sex variable as it is of interest to me for controlling/being a confounder.

Is there anyway I can use the random effects? Is there something I can do to incorporate sex, whilst still having a reliable, powerful model? I am not a statistician so any responses which can lead in me in the right direction in a way that I can understand would be really helpful.

Many thanks in advance.

Tags: None

Carlo Lazzaro

Join Date: Apr 2014

Posts: 17630
#2

26 Jul 2017, 04:59

Em:
you may want to take a look at the following link: net describe mundlak, from(http://fmwww.bc.edu/RePEc/bocode/m).

Kind regards,
Carlo
(StataNow 18.5)
Comment
em lowthian

Join Date: Jun 2017

Posts: 51
#3

26 Jul 2017, 06:51

Many thanks Carlo - this looks useful. Do you know with Mundlak, whether you report the values of the mean of the variables, or the normal variables?
Comment
Sebastian Kripfganz

Join Date: May 2014

Posts: 2568
#4

26 Jul 2017, 07:16

If you just want to "control" for it, you do not have to worry at all. All time-invariant variables are automatically controlled for when you use the fixed-effects approach. That is the whole purpose of this estimator.

https://www.kripfganz.de/stata/
Comment
em lowthian

Join Date: Jun 2017

Posts: 51
#5

26 Jul 2017, 07:38

Ok, that is useful thank you sebastian - although a random effects model is more powerful?
Comment
Jeff Wooldridge

Join Date: Apr 2014

Posts: 2091
#6

26 Jul 2017, 09:21

Your example is a bit puzzling. You practically have a single cross section because there are only 39 units where you observe two periods. FE will use only those 39 observations; the others will be effectively discarded. You can still use the Mundlak approach and include the time-constant variable, especially if you're interested in that variable. I suspect the estimates on the other variables are much less significant, as they are based on a much smaller data set.
Comment
em lowthian

Join Date: Jun 2017

Posts: 51
#7

27 Jul 2017, 03:04

Hi Jeff - thanks for your post. Sorry about my example - If I can elaborate anywhere with explanation or code do please let me know.

Oh I see. I thought it was the other way around whereby if you have 834 observations and 795 groups I had 795 cases where I observed over two periods. That is not ideal at all, completely disappointing for a longitudinal analysis. It also suggests why I simply cannot use the FE model.
What would you say about the reliability of the model? I understand that this is not a great example.

I agree, with only 39 observations I guess I am still only looking at association rather than causality! Disappointing but unfortunately this is the way it goes with data sometimes I suppose. Whats interesting is that I'm still seeing similar findings to other studies so that will aid the discussion.

Em
Comment

Announcement

Wanting to use time-invariant variables - despite xtoverid suggesting fe

Comment

Comment

Comment

Comment

Comment

Comment