Dear Prof. Kripfganz,
thank you very much for your kind reply it was indeed very helpful.
I have a few further doubts and it would be very useful if you could illuminate me on these.
My model which I have now refined looks something like this:
Where: y is my dependent variable, x my variable of interest, $controls_lag a set of relevant controls and i.Year are year dummies. I then use lags instruments for all of my endogenous variables (y x $endogen_controls) and use normal instruments for my exogenous variables and year dummies. Do note that while endogenous regressors are not lagged, exogenous regressors are.
my questions are the following:
1) Is my specification for the exogenous variables instrument correct or should i specify iv(l.$exogen_controls i.Year, model(fodev)) or iv(l.$exogen_controls i.Year, d)? As they are exogenous variables I am satisfied if they simply instrument themself. However, I would ideally make sure that my instruments are uncorrelated with unobserved unit-specific heterogeneity.
2) I would also like to estimate the same model specified using an iterated System GMM. Excluding the SYS GMM assumption, the model should be the same. Would this be the correct specification?
3) What are the consequences of specifying vce(r, model(fodev))? I would like to make my SE robust to intragroup correlation is this the specification to use?
4) I have an unbalanced panel. There are over 93 countries with some observations on the dependent variable, but some years are missing. For some reasons however, the number of groups reduces to 76 when I estimate the model. Do you have any idea of why this could be the case?
5) Using the same unbalanced dataset, the estimated model reports negative t statistics for some of the variables (with the p values close to one (.999, .998)) is this correct? Should I be concerned?
6) Finally, I have a general question that relates to lag length and the appropriateness of GMM. Another possibility to study address my research question would be to use a panel with 43 countries over 28 years. Do you believe it is credible that, with such a small number of groups and a relatively large number of time points, the assumption of GMM work in this context? Which alternative approach you would suggest? The follow-up question is how to deal with lag length. With such a small number of groups, there is a very limited number of lags that we can use to avoid instrument proliferation (1 or 2). Do you have any suggestion to determine lag length when you do not have strong theoretical reasons?
I thank in advance very much for your great helpfulness
Best regards
thank you very much for your kind reply it was indeed very helpful.
I have a few further doubts and it would be very useful if you could illuminate me on these.
My model which I have now refined looks something like this:
Code:
xtdpdgmm L(0/1).y l.x $controls_lag i.Year, model(fodev) collapse gmm(y x $endogen_controls, lag(1 4)) iv(l.$exogen_controls i.Year) igmm vce(r) small noconstant igmmiterate(200)
my questions are the following:
1) Is my specification for the exogenous variables instrument correct or should i specify iv(l.$exogen_controls i.Year, model(fodev)) or iv(l.$exogen_controls i.Year, d)? As they are exogenous variables I am satisfied if they simply instrument themself. However, I would ideally make sure that my instruments are uncorrelated with unobserved unit-specific heterogeneity.
2) I would also like to estimate the same model specified using an iterated System GMM. Excluding the SYS GMM assumption, the model should be the same. Would this be the correct specification?
Code:
xtdpdgmm y l.x $controls_lag i.Year, model(fodev) collapse gmm(y x $endogen_controls, lag(1 4)) iv(l.$exogen_controls i.Year) gmm(y x $endogen_controls, lag(1 1) diff model(level)) gmm(l.$exogen_controls i.Year, lag(0 0) diff model(level)) igmm vce(r)
4) I have an unbalanced panel. There are over 93 countries with some observations on the dependent variable, but some years are missing. For some reasons however, the number of groups reduces to 76 when I estimate the model. Do you have any idea of why this could be the case?
5) Using the same unbalanced dataset, the estimated model reports negative t statistics for some of the variables (with the p values close to one (.999, .998)) is this correct? Should I be concerned?
6) Finally, I have a general question that relates to lag length and the appropriateness of GMM. Another possibility to study address my research question would be to use a panel with 43 countries over 28 years. Do you believe it is credible that, with such a small number of groups and a relatively large number of time points, the assumption of GMM work in this context? Which alternative approach you would suggest? The follow-up question is how to deal with lag length. With such a small number of groups, there is a very limited number of lags that we can use to avoid instrument proliferation (1 or 2). Do you have any suggestion to determine lag length when you do not have strong theoretical reasons?
I thank in advance very much for your great helpfulness
Best regards
Comment