Currently, I'm working on my thesis working doing a Fixed Effects regression in a dataset consisting of 1,301 observations with 7 variables. I'm using Stata 16.0. The dependent variable is CO2 emissions per capita, whereas the independent variables are government ideology (categorial; -1 right-wing; 0 center; 1 left-wing government), Herfindahl index (from 0 to 1), Polity 2 score (from -10 to 10), Urban pop (% urban pop of total pop), trade openness (% trade of total gdp), log of gdp_per_capita and gdp^2 (in millions). The Fixed Effects model came as most appropriate by doing the Hausman test.
From previous post on the forum, I managed to structure the data and do a -xtreg, fe- regression resulting in the following results by entering the following command:
The summary of my variables:
My questions are the following:
1) For the GDP per capita^2, I had to divide the variable by 1,000,000 to get results from the regression. Is this normal?
2) My overall regression seems significant whereas my variable of interest, government ideology (execrlc) is not. Are there ways to fix this? Should I do another regression method or include/exclude some variables?
3) Should I include interaction terms or should other variables be taken the log of (or undo the log of some)?
I'm quite uncertain on what to do as I can't figure out what the next step should be and where to go. Hopefully, you guys can help me on this
From previous post on the forum, I managed to structure the data and do a -xtreg, fe- regression resulting in the following results by entering the following command:
Code:
xtreg co2_per_capita execrlc herfgov polity2 urban_pop trade_open log_gdp gdp2, fe robust
Code:
Fixed-effects (within) regression Number of obs = 995 Group variable: panel_id Number of groups = 41 R-sq: Obs per group: within = 0.4099 min = 1 between = 0.5192 avg = 24.3 overall = 0.4376 max = 44 F(7,40) = 48.18 corr(u_i, Xb) = 0.2276 Prob > F = 0.0000 (Std. Err. adjusted for 41 clusters in panel_id) ------------------------------------------------------------------------------ | Robust co2_per_ca~a | Coef. Std. Err. t P>|t| [95% Conf. Interval] -------------+---------------------------------------------------------------- execrlc | .0389004 .0468245 0.83 0.411 -.0557355 .1335363 herfgov | .0126363 .1187266 0.11 0.916 -.2273191 .2525917 polity2 | -.0325272 .0187234 -1.74 0.090 -.0703687 .0053142 urban_pop | .0361227 .0145646 2.48 0.017 .0066866 .0655589 trade_open | .0065079 .0039343 1.65 0.106 -.0014437 .0144595 gdp_per_ca~a | .0000583 .0000341 1.71 0.095 -.0000106 .0001273 gdp2 | -.0005166 .0009303 -0.56 0.582 -.0023969 .0013636 _cons | -.363398 .7462756 -0.49 0.629 -1.871677 1.144881 -------------+---------------------------------------------------------------- sigma_u | 1.5154859 sigma_e | .36628114 rho | .94480887 (fraction of variance due to u_i)
Code:
Variable | Obs Mean Std. Dev. Min Max -------------+--------------------------------------------------------- panel_id | 1,301 163.6595 107.1012 6 377 country | 0 code | 0 year | 1,301 1997.003 11.61816 1975 2018 co2_per_ca~a | 995 2.042711 1.634142 .041 6.496 -------------+--------------------------------------------------------- co2_per_gdp | 1,215 .3726897 .2829472 .038 2.61 execrlc | 1,301 .2221368 .8973476 -1 1 herfgov | 1,301 .8013089 .2756591 .0743667 1 polity2 | 1,301 4.68947 6.330302 -9 10 urban_pop | 1,301 58.92987 20.57164 7.834 97.403 -------------+--------------------------------------------------------- trade_open | 1,301 61.76426 30.59038 8.384615 152.5161 gdp_per_ca~a | 1,301 6184.989 8368.914 104.2722 38542.72 gdp2 | 1,301 108.239 262.4872 .0108727 1485.541 left | 1,301 .5380477 .498742 0 1 right | 1,301 .3159108 .4650564 0 1 -------------+--------------------------------------------------------- log_gdp | 1,301 7.901027 1.356007 4.647005 10.55952
1) For the GDP per capita^2, I had to divide the variable by 1,000,000 to get results from the regression. Is this normal?
2) My overall regression seems significant whereas my variable of interest, government ideology (execrlc) is not. Are there ways to fix this? Should I do another regression method or include/exclude some variables?
3) Should I include interaction terms or should other variables be taken the log of (or undo the log of some)?
I'm quite uncertain on what to do as I can't figure out what the next step should be and where to go. Hopefully, you guys can help me on this

Comment