Dear honored Statalist experts,
I already hat the chance to read a lot of your interesting discussions, posts and ideas. Now as I don't know how to overcome the misspecification of my stata model I write to you in seek of advice.
I am evaluating the impact of gold price on the capital structure of 63 gold mining companies from Q1 2003 - Q4 2017. The panel data set is unbalanced because e.g. not all companies reported their Q4 2017.:
Steps conducted:

I already hat the chance to read a lot of your interesting discussions, posts and ideas. Now as I don't know how to overcome the misspecification of my stata model I write to you in seek of advice.
I am evaluating the impact of gold price on the capital structure of 63 gold mining companies from Q1 2003 - Q4 2017. The panel data set is unbalanced because e.g. not all companies reported their Q4 2017.:
Variable | Def | Obs | Mean | Std. Dev. | Min | Max |
id | Gold miners | 3780 | 131 | 18.18665 | 100 | 162 |
gold | ln(gold price) | 3780 | 6.813 | 0.4902495 | 5.85 | 7.45 |
tang_n | Total Fixed Assets/ Total Assets | 3729 | 0.5399061 | 0.2334447 | 0 | 0.99 |
prof_n | EBIT/Total Assets | 3729 | 0.0046581 | 0.0717691 | -0.89 | 0.37 |
lev_n | Total Debt/Total Assets | 3726 | 0.145314 | 0.152689 | -0.29 | 1.13 |
growth_n | (Total Assetst-Total Assetst-1) / Total Assetst-1 | 3713 | 0.0828764 | 0.4107697 | -0.91 | 11.36 |
risk_n | (EBITt-EBITt-1)/EBITt-1 | 3715 | 0.194498 | 13.30551 | -279.71 | 321.22 |
sizeii_n | ln(Total Assets) | 3729 | 6.050724 | 2.141712 | -1.69 | 11.2 |
- After intensive literature research I decided to build a basic model concluding all by literature repeatedly tested independent variables and than adding gold
- Checked for outliers by using scatter plots --> There are a few but all seem reasonable which is the reason I decided not to drop them or the entire individum
- Hausman test for fixed versus random effects model --> Rejected H0 -> FE model
- Breusch-Pagan LM test for random effects versus OLS --> Rejected H0 -> RE model to be favoured over FE model
- Wooldridge test for autocorrelation --> Rejected H0 -> Existency of autocorrelation
- Modified Wald test for heteroskedasticity -->Rejected H0 -> Existency of heteroskedasticity
- Friedman's as well as Pesaran's test for cross sectional dependence --> cross-sectional dependence
- Decided to use Driscoll-Kraay standard errors (.xtscc) to overcome autocorrelation, heteroskedasticity and most important cross-sectional dependence
- Further I make use of time & indivium invariant dummy variables to adjust for potential omitted variables
Comment