Dear Mr Kripfganz,
Thank you for your reply, I believe I understand your point now.
Digging deeper into the System GMM framework, there has surfaced more questions related to this estimator and my specific data set. I hope you have the time and effort to reply.
My N is still 288 and my T spans from 2003-2014 (year dummies corresponding to dyy5-dyy16).
I have an endogenous variable (a lagged dependent variable) in my gmmstyle parenthesis and treat remaining variables as exogenous. My code look like following:
dzone* are a set of dummy variables for vegetation zone each municipality belongs to (time invariant) dyy8-dyy16 are a set of year dummies where dyy8=2006. [aweight=CAB_ipo] are analytical weights.
Exclusion of year dummies and Orthogonality:
In your post #12 in this Stata thread (https://www.statalist.org/forums/for...r-panel-models) your are suggesting that the user drops the first two years due to lags and one more year to avoid the "dummy trap". However, when using the SystemGMM I make use of the fact that I use lagged differences as instruments for levels. Since I have L.y as endogenous, I would then have D.L2.y as instrument for levels (D.L2.y = L2.y - L3.y). Would this not imply that my first T yo include as a year dummy is T=4?
I am using the orthogonality option, since my panel has gaps in my "main" explanatory variable. Does the orthogonality option change anything about which years I should exclude, considering it makes use of all available future observations?
Eq(diff) or eq(level)
What is the rationale behind using one or the other (or both) of these options within the gmm(.) parenthesis? I have read David Roodman's (2009) paper (http://www.stata-journal.com/sjpdf.h...iclenum=st0159) extensively, especially pp123-125, but I can not understand the difference in analysis or interpretation when using gmm(L.x, eq(diff)) or gmm(L.x, eq(level)) instead of gmm(L.x). Could you possibly elaborate a bit on this? And are there any significant aspects of these options when using the SystemGMM with orthogonality option?
H(#) option when using System GMM with orthogonality condition.
When reading David Roodman's (2009) paper (http://www.stata-journal.com/sjpdf.h...��) p. 117 and p. 123 I understand that h(3) is the default for System GMM in xtabond2, but that h(3) differs slightly from the matrix in the orthogonal deviations case (ibid p.117). So, which h(#) is to be used when using the orthogonality option? I do not wish to use h(1) due to the panel's heteroskedasticity. I see that h(2) imitates DPD for Ox according to Roodman (ibid, p. 123) and I believe that h(2) was the matrix used by Blundell, Bond and Windmeijer (2001) (according to your post in https://www.statalist.org/forums/for...r-panel-models) but what are the implications of using h(2) in my case? I read in your same post that h(3) is not optimal if there are unobserved unit-specific effects, which I believe there are in my case.
Robust standard errors on municipality level or region level?
This is maybe a quite general question, and not specifically pertaining to GMM, but here it goes. My dependent variable is on county level (N=21) and my explanatory variables are on municipality level (N=288), county level, and also on national level. Now, which level do I cluster my standard errors on? I believe that there is correlation between municipalities within the same county (which speaks for the use of clustering SE:s on county level). However, my data also suffers from serial correlation within each municipality, which speaks for the use of clustering SE:s on municipality level. One source of heteroscedasticity on county level is the fact that each county has different numbers of farmers, which means that counties with a smaller farmer's population are more susceptible for shocks compared to those with more farmers. Therefore, I want to use analytic weights, using the square root of number of farmers in each county. Does it make sense to cluster standard errors on municipality level (to account for serial correlation) and to use aweights to address heteroscedasticity on county level?
Best regards
-Hanna
Thank you for your reply, I believe I understand your point now.
Digging deeper into the System GMM framework, there has surfaced more questions related to this estimator and my specific data set. I hope you have the time and effort to reply.
My N is still 288 and my T spans from 2003-2014 (year dummies corresponding to dyy5-dyy16).
I have an endogenous variable (a lagged dependent variable) in my gmmstyle parenthesis and treat remaining variables as exogenous. My code look like following:
Code:
xtabond2 y L.y L.x1 x2 dzone* x3 x4 L.x5 x6 x7 x8 dyy8-dyy16 [aweight=CAB_ipo], gmmstyle(L.y, laglimit(1 5)) ivstyle(L.x1 x2 x3 x4 L.x5 x6 x7 x8) ivstyle(dyy8-dyy16 dzone*, eq(level)) artests(3) twostep cluster(lk) h(2) orthogonal
Exclusion of year dummies and Orthogonality:
In your post #12 in this Stata thread (https://www.statalist.org/forums/for...r-panel-models) your are suggesting that the user drops the first two years due to lags and one more year to avoid the "dummy trap". However, when using the SystemGMM I make use of the fact that I use lagged differences as instruments for levels. Since I have L.y as endogenous, I would then have D.L2.y as instrument for levels (D.L2.y = L2.y - L3.y). Would this not imply that my first T yo include as a year dummy is T=4?
I am using the orthogonality option, since my panel has gaps in my "main" explanatory variable. Does the orthogonality option change anything about which years I should exclude, considering it makes use of all available future observations?
Eq(diff) or eq(level)
What is the rationale behind using one or the other (or both) of these options within the gmm(.) parenthesis? I have read David Roodman's (2009) paper (http://www.stata-journal.com/sjpdf.h...iclenum=st0159) extensively, especially pp123-125, but I can not understand the difference in analysis or interpretation when using gmm(L.x, eq(diff)) or gmm(L.x, eq(level)) instead of gmm(L.x). Could you possibly elaborate a bit on this? And are there any significant aspects of these options when using the SystemGMM with orthogonality option?
H(#) option when using System GMM with orthogonality condition.
When reading David Roodman's (2009) paper (http://www.stata-journal.com/sjpdf.h...��) p. 117 and p. 123 I understand that h(3) is the default for System GMM in xtabond2, but that h(3) differs slightly from the matrix in the orthogonal deviations case (ibid p.117). So, which h(#) is to be used when using the orthogonality option? I do not wish to use h(1) due to the panel's heteroskedasticity. I see that h(2) imitates DPD for Ox according to Roodman (ibid, p. 123) and I believe that h(2) was the matrix used by Blundell, Bond and Windmeijer (2001) (according to your post in https://www.statalist.org/forums/for...r-panel-models) but what are the implications of using h(2) in my case? I read in your same post that h(3) is not optimal if there are unobserved unit-specific effects, which I believe there are in my case.
Robust standard errors on municipality level or region level?
This is maybe a quite general question, and not specifically pertaining to GMM, but here it goes. My dependent variable is on county level (N=21) and my explanatory variables are on municipality level (N=288), county level, and also on national level. Now, which level do I cluster my standard errors on? I believe that there is correlation between municipalities within the same county (which speaks for the use of clustering SE:s on county level). However, my data also suffers from serial correlation within each municipality, which speaks for the use of clustering SE:s on municipality level. One source of heteroscedasticity on county level is the fact that each county has different numbers of farmers, which means that counties with a smaller farmer's population are more susceptible for shocks compared to those with more farmers. Therefore, I want to use analytic weights, using the square root of number of farmers in each county. Does it make sense to cluster standard errors on municipality level (to account for serial correlation) and to use aweights to address heteroscedasticity on county level?
Best regards
-Hanna
Comment