Hello everyone,
I am studying a data series defined by :
tsset CountyCode year
I have 3 other variables : var1, var2 and var3.
I want to compare the results of 3 different regression methods :
1) First regression method :
xi:areg var1 var2 var3 i.year, absorb(CountyCode)
2) Second regression method using the residuals of the regression of var1, var2 and var3 respectively on year and county fixed effects (as in the Frisch-Waugh-Lovell theorem) and doing the final regression with reg :
xi:areg var1 i.year, absorb(CountyCode)
predict var1_res, residuals
xi:areg var2 i.year, absorb(CountyCode)
predict var2_res, residuals
xi:areg var3 i.year, absorb(CountyCode)
predict var3_res, residuals
reg var1_res var2_res var3_res
3) Third regression method using the residuals of the regression of var1, var2 and var3 respectively on year and county fixed effects and doing the final regression with xtreg :
xi:areg var1 i.year, absorb(CountyCode)
predict var1_res, residuals
xi:areg var2 i.year, absorb(CountyCode)
predict var2_res, residuals
xi:areg var3 i.year, absorb(CountyCode)
predict var3_res, residuals
xtreg var1_res var2_res var3_res, re
I obtain the following coefficients :
1) First regression :
var2 : 0.004***
var3 : 0.07***
2) Second regression :
var2_res : 0.02***
var3_res : 0.06*
3) Third regression :
var2_res : 0.005***
var3_res : 0.08***
My questions are the following :
- According to the Frisch-Waugh-Lovell theorem, I expected to get the same results from the regressions 1) and 2). Why is not it the case ?
- Is it only by chance that the regressions 1) and 3) give similar results ?
- If no, why is it important to use a random variable for the county-specific effects (i.e. to use xtreg..., re) whereas these effects seem to me to have been suppressed in the previous regressions (of type xi:are var1 i.year, absorb(CountyCode)) ?
- If yes, what regressions should I use in the method 2) to obtain the same result as in regression 1) ?
I thank you in advance for your answers.
I am studying a data series defined by :
tsset CountyCode year
I have 3 other variables : var1, var2 and var3.
I want to compare the results of 3 different regression methods :
1) First regression method :
xi:areg var1 var2 var3 i.year, absorb(CountyCode)
2) Second regression method using the residuals of the regression of var1, var2 and var3 respectively on year and county fixed effects (as in the Frisch-Waugh-Lovell theorem) and doing the final regression with reg :
xi:areg var1 i.year, absorb(CountyCode)
predict var1_res, residuals
xi:areg var2 i.year, absorb(CountyCode)
predict var2_res, residuals
xi:areg var3 i.year, absorb(CountyCode)
predict var3_res, residuals
reg var1_res var2_res var3_res
3) Third regression method using the residuals of the regression of var1, var2 and var3 respectively on year and county fixed effects and doing the final regression with xtreg :
xi:areg var1 i.year, absorb(CountyCode)
predict var1_res, residuals
xi:areg var2 i.year, absorb(CountyCode)
predict var2_res, residuals
xi:areg var3 i.year, absorb(CountyCode)
predict var3_res, residuals
xtreg var1_res var2_res var3_res, re
I obtain the following coefficients :
1) First regression :
var2 : 0.004***
var3 : 0.07***
2) Second regression :
var2_res : 0.02***
var3_res : 0.06*
3) Third regression :
var2_res : 0.005***
var3_res : 0.08***
My questions are the following :
- According to the Frisch-Waugh-Lovell theorem, I expected to get the same results from the regressions 1) and 2). Why is not it the case ?
- Is it only by chance that the regressions 1) and 3) give similar results ?
- If no, why is it important to use a random variable for the county-specific effects (i.e. to use xtreg..., re) whereas these effects seem to me to have been suppressed in the previous regressions (of type xi:are var1 i.year, absorb(CountyCode)) ?
- If yes, what regressions should I use in the method 2) to obtain the same result as in regression 1) ?
I thank you in advance for your answers.
Comment