Using Stata I ran regressions with n=1101 observations. ID is a unique firm identifier and Year is a year variable.
I ran two different regressions, simple OLS and Company+Year fixed effects and clustered standard errors.
regress Y1 Dummy1 Dummy2 Dummy3 Dummy4 Dummy5 Dummy6 X1 X2 X3 PROBLEM1 COUNTRY
reghdfe Y1 Dummy1 Dummy2 Dummy3 Dummy4 Dummy5 Dummy6 X1 X2 X3 PROBLEM1 COUNTRY, absorb(ID Year) cluster(ID Year)
If I run the simple OLS regression, everything looks fine.
But when I include fixed effects and standard error clustering, Stata warns me about:
Problem is variable that is equal to 1 if the firm is audited by a BIG4 company, a variable that usually is not changing across time.
Country is equal to 1 if the firm is located in a specific country. From all the 1101 observations, 420 are coded with COUNTRY=1. How can I fix my design to not exclude these variables?
2. Is it a big issue that I included so many dummy variables, apart from interpretation?
3. Alternatively, I could include industry fixed effects, but I would love to include Year and Firm fixed effects/Understand why my model right now is not working.
Best regards
I ran two different regressions, simple OLS and Company+Year fixed effects and clustered standard errors.
regress Y1 Dummy1 Dummy2 Dummy3 Dummy4 Dummy5 Dummy6 X1 X2 X3 PROBLEM1 COUNTRY
reghdfe Y1 Dummy1 Dummy2 Dummy3 Dummy4 Dummy5 Dummy6 X1 X2 X3 PROBLEM1 COUNTRY, absorb(ID Year) cluster(ID Year)
If I run the simple OLS regression, everything looks fine.
But when I include fixed effects and standard error clustering, Stata warns me about:
- dropped 6 singleton observations
- PROBLEM1 is probably collinear with the fixed effects (all partialled-out values are close to zero; tol = 1.0e-09)
- COUNTRY is probably collinear with the fixed effects (all partialled-out values are close to zero; tol = 1.0e-09)
- Warning: VCV matrix was non-positive semi-definite; adjustment from Cameron, Gelbach & Miller applied.
- missing F statistic; dropped variables due to collinearity or too few clusters
Problem is variable that is equal to 1 if the firm is audited by a BIG4 company, a variable that usually is not changing across time.
Country is equal to 1 if the firm is located in a specific country. From all the 1101 observations, 420 are coded with COUNTRY=1. How can I fix my design to not exclude these variables?
2. Is it a big issue that I included so many dummy variables, apart from interpretation?
3. Alternatively, I could include industry fixed effects, but I would love to include Year and Firm fixed effects/Understand why my model right now is not working.
Best regards
Comment