Hi all,
I want to estimate the effects of agricultural innovation subsidies on firm's R&D activities.
My dataset contains innovative agricultural firms, of which some received a subsidy (treated) and some did not (untreated)
I observe the activities (in number of R&D personnel) and other variables of these firms in 2007-2011 (balanced panel)
Grants were provided in 2009 and 2010 and the dataset contains the exact value of the subsidy
I want to use a difference-in-difference approach to estimate the effects of the subsidy on R&D activity.
As the subsidy relates to innovation projects with a minimum duration of 3 years, I assume the subsidy will have effect in all of these years.
Yet, I am not sure which equation i should use to estimate these effects. I think I need to estimate the following equation with fixed effects:
yit = ai + λt + β*Subsidyt + μ*Xit + eit
where i = firm, t = year, X = other covariates, where 'Subsidy' has value 0 in pre-treatment years for both control and treatment firms and value of the subsidy in post-treatment years for treated firms.
i.e. in Stata (in the case without covariates):
xtset firm year
xtreg r_d_personnel i.year subsidy, fe cluster(firm)
I wanted to replicate my results by estimating a standard 'DID' in a regression equation and estimated the following equation:
regress r_d_personnel i.year treated subsidy, vce(cluster firm)
where treated is a dummy variable equal to one when a firm is in the treamtent group (received subsidy in 2009 or 2010) and zero when a firm did not receive a subsidy in 2007-2011.
Yet, the models do not produce the same results and I know it has something to do with the treatment being a continuous variable, because if I estimate the same equation with a dummy variable for treatment (which is equal to 1 for treated firms in post-treatment years and zero otherwise) I do get exactly the same results. I do however not know which equation is right and why.
Furthermore, when I include (time-varying) control variables I also obtain different results, even with the treatment being a dummy variable.
I hope you can answer my question(s)
Thanks in advance.
Kind regards,
John
I want to estimate the effects of agricultural innovation subsidies on firm's R&D activities.
My dataset contains innovative agricultural firms, of which some received a subsidy (treated) and some did not (untreated)
I observe the activities (in number of R&D personnel) and other variables of these firms in 2007-2011 (balanced panel)
Grants were provided in 2009 and 2010 and the dataset contains the exact value of the subsidy
I want to use a difference-in-difference approach to estimate the effects of the subsidy on R&D activity.
As the subsidy relates to innovation projects with a minimum duration of 3 years, I assume the subsidy will have effect in all of these years.
Yet, I am not sure which equation i should use to estimate these effects. I think I need to estimate the following equation with fixed effects:
yit = ai + λt + β*Subsidyt + μ*Xit + eit
where i = firm, t = year, X = other covariates, where 'Subsidy' has value 0 in pre-treatment years for both control and treatment firms and value of the subsidy in post-treatment years for treated firms.
i.e. in Stata (in the case without covariates):
xtset firm year
xtreg r_d_personnel i.year subsidy, fe cluster(firm)
I wanted to replicate my results by estimating a standard 'DID' in a regression equation and estimated the following equation:
regress r_d_personnel i.year treated subsidy, vce(cluster firm)
where treated is a dummy variable equal to one when a firm is in the treamtent group (received subsidy in 2009 or 2010) and zero when a firm did not receive a subsidy in 2007-2011.
Yet, the models do not produce the same results and I know it has something to do with the treatment being a continuous variable, because if I estimate the same equation with a dummy variable for treatment (which is equal to 1 for treated firms in post-treatment years and zero otherwise) I do get exactly the same results. I do however not know which equation is right and why.
Furthermore, when I include (time-varying) control variables I also obtain different results, even with the treatment being a dummy variable.
I hope you can answer my question(s)
Thanks in advance.
Kind regards,
John
Comment