Dear All,
my question here is probably methodological rather than Stata-technical.
I want to estimate a quite simple regression model that boils down to this:
Y = x + i.country
where X is my variable of interest and country1... country20 are country dummies.
To check robustness of my results and see whether any specific country is "driving" my estimates for X, my idea was to obtain jackknife estimates (first time for me!) for X, dropping one country at a time. That is, in Stata I write:
jackknife coef_x = _b[x], eclass cluster(country): reg y x i.country
which replicates the regression 20 times, dropping one cluster (country) at a time.
Stata won't let me do this because by dropping one country at a time, in each replication the set of coefficients to be estimated is different. The precise error message I get is:
"collinearity in replicate sample is not the same as the full sample, posting missing values"
(after each replication)
and
"insufficient observations to compute jackknife standard errors
no results will be saved
r(2000);"
(at the end of the command execution)
So I understand that the "real" jackknifing does not really support this type of application, because it requires homogeneous sets of coefficients to be estimated across replication subsamples.
What I could do is run my model by hand, dropping one country at a time, and present the 20 different estimates for X. But is this the fairest / most useful / concise thing to do here? There WILL be differences in the estimates of X across the 20 subsamples - but is this telling me something useful? Is there no way to deliver a "summary" measure of stability of my coefficient?
I like the jackknife approach because it delivers ONE "robust" coefficient estimate - obtained (as far as I understand) by averaging the different replication estimates.
Should I then average the 20 different estimates for X that I get by manually dropping one country at a time, and present that result as a "jackknife" estimate? And how do I obtain standard errors for it?
Thank you in advance, anyone who has thoughts on this!!
Zelda
my question here is probably methodological rather than Stata-technical.
I want to estimate a quite simple regression model that boils down to this:
Y = x + i.country
where X is my variable of interest and country1... country20 are country dummies.
To check robustness of my results and see whether any specific country is "driving" my estimates for X, my idea was to obtain jackknife estimates (first time for me!) for X, dropping one country at a time. That is, in Stata I write:
jackknife coef_x = _b[x], eclass cluster(country): reg y x i.country
which replicates the regression 20 times, dropping one cluster (country) at a time.
Stata won't let me do this because by dropping one country at a time, in each replication the set of coefficients to be estimated is different. The precise error message I get is:
"collinearity in replicate sample is not the same as the full sample, posting missing values"
(after each replication)
and
"insufficient observations to compute jackknife standard errors
no results will be saved
r(2000);"
(at the end of the command execution)
So I understand that the "real" jackknifing does not really support this type of application, because it requires homogeneous sets of coefficients to be estimated across replication subsamples.
What I could do is run my model by hand, dropping one country at a time, and present the 20 different estimates for X. But is this the fairest / most useful / concise thing to do here? There WILL be differences in the estimates of X across the 20 subsamples - but is this telling me something useful? Is there no way to deliver a "summary" measure of stability of my coefficient?
I like the jackknife approach because it delivers ONE "robust" coefficient estimate - obtained (as far as I understand) by averaging the different replication estimates.
Should I then average the 20 different estimates for X that I get by manually dropping one country at a time, and present that result as a "jackknife" estimate? And how do I obtain standard errors for it?
Thank you in advance, anyone who has thoughts on this!!
Zelda
Comment