All,
I have a model estimation issue the cause of which I can't determine and therefore can't remedy. I am running GEE logistic models with and without multiple imputation at the behest of a reviewer on a revise and resubmit. He insisted even though there is an N of 2 for the cluster variable. The following command runs fine when multiple imputation is not used:
As shown, I am requesting robust standard errors and the nmp correction for the small number of clusters. All seems well until I get to the multiple imputation phase. The N is 258 prior to imputation and 274 after. So there is not that much missing data. But to keep the reviewer and editor happy, we went with it. Using MICE and 100 imputations (overkill I suppose but the "how_many_imputations" user add-on that I have relied on in the past seems not to work with Stata V18), all missing data are successfully imputed. The xtgee command run after running MICE is almost identical to the prior command but with the MI additions:
A portion of the output returned looks like this:
data:image/s3,"s3://crabby-images/69d23/69d23f5c81619e7df880c404786d3418c6db9571" alt="Click image for larger version
Name: Screenshot 2024-03-09 at 11.35.25 AM.png
Views: 1
Size: 58.6 KB
ID: 1746058"
If I remove the vce(robust) option, the model runs and produces all of the above missing estimates. However, I can also get all estimates if I remove one of the predictors (insurance status) and leave the vce(robust) option in. It gets weirder because if I add an additional predictor that was not in the model, homelessness, and keep insurance status and the vce(robust) option, it also works and produces estimates. The variable insurance status, has only ten missing cases to be imputed and had a distribution of 10% yes, 90% no. I did not think that would cause a problem and maybe it isn't the problem given this confusing pattern of results. I do note that with robust standard errors more of the predictors are significant and in a way that makes substantive sense. Without the robust standard errors, fewer predictors are significant. So my preference is to use the model reporting the robust standard errors and associated significance tests but I don't want to misrepresent the findings if this kind of error points to model misspecification or something I am doing wrong with MICE.
Any thoughts on debugging/troubleshooting or what might be causing this issue would be appreciated. Thanks in advance.
I have a model estimation issue the cause of which I can't determine and therefore can't remedy. I am running GEE logistic models with and without multiple imputation at the behest of a reviewer on a revise and resubmit. He insisted even though there is an N of 2 for the cluster variable. The following command runs fine when multiple imputation is not used:
Code:
xtgee linkagein14days i.(arm block gender_id1 latinx employment homeless insurance_any /// coip appointmentwithin48hours) c.(age withdrawal) appointmentwithin48hours#arm, family(binomial) /// link(logit) ef vce(robust) nmp
Code:
mi estimate, eform: xtgee linkagein14days i.(arm block gender_id1 latinx employment insurance_any homeless /// coip appointmentwithin48hours) c.(age withdrawal) appointmentwithin48hours#arm, family(binomial) /// link(logit) vce(robust) nmp
If I remove the vce(robust) option, the model runs and produces all of the above missing estimates. However, I can also get all estimates if I remove one of the predictors (insurance status) and leave the vce(robust) option in. It gets weirder because if I add an additional predictor that was not in the model, homelessness, and keep insurance status and the vce(robust) option, it also works and produces estimates. The variable insurance status, has only ten missing cases to be imputed and had a distribution of 10% yes, 90% no. I did not think that would cause a problem and maybe it isn't the problem given this confusing pattern of results. I do note that with robust standard errors more of the predictors are significant and in a way that makes substantive sense. Without the robust standard errors, fewer predictors are significant. So my preference is to use the model reporting the robust standard errors and associated significance tests but I don't want to misrepresent the findings if this kind of error points to model misspecification or something I am doing wrong with MICE.
Any thoughts on debugging/troubleshooting or what might be causing this issue would be appreciated. Thanks in advance.
Comment