Dear Stata Forum,
I have a panel (N = 3584 and T = 14) after omitting observations with any missing values. What I’m trying to analyze if the impact of business practices on firm performance.
With my decent knowledge on statistics, I have been reading various posts and come up with the following steps to perform the analysis for my paper.
However, I'm unsure with the following points.
Question 1) if my steps are econometrically sound
Question 2) what to do after I perform the last step
I appreciate your advice.
The following part describes the steps I took.
1) create the initial regression model
2) check multicollinearity of 1) by -vif- and omit some dummy variables whose vif > 5
3) declare the panel
4) compare POLS model and RE model -xttest0-.
5) compare FE model and RE model -hausman-.
The test says FE model is appropriate
6) After the Hausman test, I invoke non-default standard errors for RE model: -xi: xtreg, re vce(robust)- and -xtoverid-
The test says the null is rejected, so I go with the robust FE model.
7) check heteros in the FE model
8) check autocorrelation in the FE model
9) since both the heteros and autocorrelation tests are rejected, I invoke non-default standard errors into the FE model
I have a panel (N = 3584 and T = 14) after omitting observations with any missing values. What I’m trying to analyze if the impact of business practices on firm performance.
With my decent knowledge on statistics, I have been reading various posts and come up with the following steps to perform the analysis for my paper.
However, I'm unsure with the following points.
Question 1) if my steps are econometrically sound
Question 2) what to do after I perform the last step
I appreciate your advice.
The following part describes the steps I took.
1) create the initial regression model
2) check multicollinearity of 1) by -vif- and omit some dummy variables whose vif > 5
3) declare the panel
4) compare POLS model and RE model -xttest0-.
Code:
xtreg ln_tq x1 x2 x3 x4 x5 size re exps i.year, re estimates store re_model xttest0
Code:
Breusch and Pagan Lagrangian multiplier test for random effects
ln_tq[co_cik,t] = Xb + u[co_cik] + e[co_cik,t]
Estimated results:
| Var SD = sqrt(Var)
---------+-----------------------------
ln_tq | .2866833 .5354281
e | .091313 .3021804
u | .1865672 .4319343
Test: Var(u) = 0
chibar2(01) = 4595.75
Prob > chibar2 = 0.0000
The test says FE model is appropriate
Code:
xtreg ln_tq x1 x2 x3 x4 x5 size re exps i.year, fe estimates store fe_model hausman fe_model re_model
Code:
F test that all u_i=0: F(459, 3103) = 15.11 Prob > F = 0.0000
Code:
Test of H0: Difference in coefficients not systematic
chi2(19) = (b-B)'[(V_b-V_B)^(-1)](b-B)
= 90.29
Prob > chi2 = 0.0000
(V_b-V_B is not positive definite)
6) After the Hausman test, I invoke non-default standard errors for RE model: -xi: xtreg, re vce(robust)- and -xtoverid-
The test says the null is rejected, so I go with the robust FE model.
Code:
xi: xtreg ln_tq x1 x2 x3 x4 x5 size re exps i.year, re vce(robust) xtoverid
Code:
Test of overidentifying restrictions: fixed vs random effects Cross-section time-series model: xtreg re robust cluster(co_cik) Sargan-Hansen statistic 134.156 Chi-sq(21) P-value = 0.0000
Code:
xtreg ln_tq x1 x2 x3 x4 x5 size re exps i.year, fe xttest3
Code:
Modified Wald test for groupwise heteroskedasticity in fixed effect regression model H0: sigma(i)^2 = sigma^2 for all i chi2 (460) = 1.4e+35 Prob>chi2 = 0.0000
Code:
xtserial ln_tq x1 x2 x3 x4 x5 size re exps
Code:
. xtserial ln_tq x1 x2 x3 x4 x5 size re exps
Wooldridge test for autocorrelation in panel data
H0: no first order autocorrelation
F( 1, 369) = 65.771
Prob > F = 0.0000
Code:
xtreg ln_tq x1 x2 x3 x4 x5 size re exps i.year, fe robust
Code:
Fixed-effects (within) regression Number of obs = 3,584
Group variable: co_cik Number of groups = 460
R-squared: Obs per group:
Within = 0.1323 min = 1
Between = 0.0000 avg = 7.8
Overall = 0.0162 max = 14
F(21, 459) = 17.14
corr(u_i, Xb) = -0.1523 Prob > F = 0.0000
(Std. err. adjusted for 460 clusters in co_cik)
------------------------------------------------------------------------------
| Robust
ln_tq | Coefficient std. err. t P>|t| [95% conf. interval]
-------------+----------------------------------------------------------------
x1 | -.0101627 .011002 -0.92 0.356 -.0317832 .0114579
x2 | -.0288111 .0102505 -2.81 0.005 -.0489549 -.0086673
x3 | -.0065449 .0089465 -0.73 0.465 -.0241262 .0110364
x4 | -.0000741 .0018472 -0.04 0.968 -.003704 .0035559
x5 | -.0085098 .0197696 -0.43 0.667 -.0473599 .0303404
size | -.0097784 .0046018 -2.12 0.034 -.0188217 -.0007352
re | -8.93e-09 6.30e-07 -0.01 0.989 -1.25e-06 1.23e-06
exps | .0000149 6.68e-06 2.23 0.026 1.76e-06 .000028
|
year |
2011 | -.0341651 .0178455 -1.91 0.056 -.0692341 .000904
2012 | -.0296391 .0218184 -1.36 0.175 -.0725153 .0132372
2013 | .1215146 .0255803 4.75 0.000 .0712456 .1717835
2014 | .1738158 .0273033 6.37 0.000 .1201609 .2274708
2015 | .1637915 .0311645 5.26 0.000 .1025487 .2250343
2016 | .1830113 .0288137 6.35 0.000 .1263881 .2396345
2017 | .2382127 .0320691 7.43 0.000 .1751923 .3012331
2018 | .1821995 .0352102 5.17 0.000 .1130064 .2513926
2019 | .2392736 .0354988 6.74 0.000 .1695133 .3090338
2020 | .3039198 .0380133 8.00 0.000 .2292181 .3786215
2021 | .3863713 .0370685 10.42 0.000 .3135263 .4592163
2022 | .1891405 .039879 4.74 0.000 .1107725 .2675085
2023 | .1689371 .0426209 3.96 0.000 .0851809 .2526934
|
_cons | .8312923 .0506949 16.40 0.000 .7316695 .9309151
-------------+----------------------------------------------------------------
sigma_u | .50856322
sigma_e | .30218043
rho | .73906807 (fraction of variance due to u_i)
------------------------------------------------------------------------------

Comment