Hi fellow Statalisters,
I am currently trying to take a deep dive into Multiple Imputation, and while doing so, some rather specific questions have come up that I'd be super grateful to get some advice on.
1. Question: How to impute a newly created dichotomous variable
I have created a dichotomous variable "dieting vs. no dieting" within the past year from a categorical item describing how many times a person has dieted within the past year. If I understand the documentation correctly, any variable than is created via gen/egen from an existing variable is supposed to be registered via
and then imputed via
. However, I have only found examples in which the imputed variable was created based on two or more variables, e.g.,
. I have not yet found a way how to implement the creation of my dichotomous diet-Variable, which is after all only one variable, and it is categorical as well. Does anyone know who to procede here?
2. Question: Dealing with winsorized variables
There is another continuous variable I would like to impute. With this variable, I would like to conduct sensitivity analyses after winsorizing outliers in the observed, non-missing values. I am really uncertain how to implement this in the context of imputation: Am I supposed to conduct imputation based on the winsorized observed values, e.g.,
or is there even a way to winsorize imputed values? Or is there a different procedure or shouldn't I use winsorizing in this context at all? Any insight is appreciated.
3. Question: Augment function in the case of "mi impute logit: perfect predictor(s) detected"
With one binary predictor, I have encountered the error message "mi impute logit: perfect predictor(s) detected". I have been able to resolve the issue by including the option
:
However, in the documentation this procedure is called "ad hoc" and it results in th warning message
which I am unsure how to interpret this. Does anyone have advice on whether using the
-function is appropriate here?
Sorry for asking all these questions, I'm very new to MI
any help is appreciated, thank you very much in advance!
I am currently trying to take a deep dive into Multiple Imputation, and while doing so, some rather specific questions have come up that I'd be super grateful to get some advice on.
1. Question: How to impute a newly created dichotomous variable
I have created a dichotomous variable "dieting vs. no dieting" within the past year from a categorical item describing how many times a person has dieted within the past year. If I understand the documentation correctly, any variable than is created via gen/egen from an existing variable is supposed to be registered via
HTML Code:
mi register passive
HTML Code:
mi passive: generate
HTML Code:
mi passive: generate new_Var = Var_1 + Var_2
2. Question: Dealing with winsorized variables
There is another continuous variable I would like to impute. With this variable, I would like to conduct sensitivity analyses after winsorizing outliers in the observed, non-missing values. I am really uncertain how to implement this in the context of imputation: Am I supposed to conduct imputation based on the winsorized observed values, e.g.,
HTML Code:
mi register imputed Var_1 Var_2 Var_3_winsorized mi impute chained (ologit) Var_1, (logit) Var_2 (regress) Var_3_winsorized, add(15) savetrace(trace1,replace)
3. Question: Augment function in the case of "mi impute logit: perfect predictor(s) detected"
With one binary predictor, I have encountered the error message "mi impute logit: perfect predictor(s) detected". I have been able to resolve the issue by including the option
HTML Code:
augment
HTML Code:
mi impute chained (ologit) Var_1, (logit) Var_2 (regress) Var_3_winsorized, add(15) augment savetrace(trace1,replace)
Warning: the sets of predictors of the imputation model vary across imputations or iterations
HTML Code:
augment
Sorry for asking all these questions, I'm very new to MI

Comment