why does registering a variable as passive in an MI dataset change the results of an estimation (and apparently the data itself). I am working with a large, publically available dataset that contains several variables (calculated by the survey team) which depend on imputed variables. As I understand it this meets Stata's definition of a "passive" variable. Based on the documentation I do not expect registering these variables as passive to change the data internally or to affect the results of estimation commands. However, in my data this is clearly the case. I run the following commands:
use DATASET
mi est: mean VAR [gives result X, based on N_1 observations]
mi register passive VAR [reports "N_2 observations of passive variable VAR in m>0 updated to match values in m=0]
mi est: mean VAR [gives result Y, based on N_3 observations]
There are two things I do not understand: 1) why is mi register passive updating my imputed data based on the non-imputed data, thus seemingly wiping out the imputation; 2) why SPECIFICALLY is the result X based on a different number of observations than Z. I note that N_1 + N_2 does not equal N_3...so it is not the case that the change in observations from the first estimation to the second is equivalent to the changes made by mi register...
use DATASET
mi est: mean VAR [gives result X, based on N_1 observations]
mi register passive VAR [reports "N_2 observations of passive variable VAR in m>0 updated to match values in m=0]
mi est: mean VAR [gives result Y, based on N_3 observations]
There are two things I do not understand: 1) why is mi register passive updating my imputed data based on the non-imputed data, thus seemingly wiping out the imputation; 2) why SPECIFICALLY is the result X based on a different number of observations than Z. I note that N_1 + N_2 does not equal N_3...so it is not the case that the change in observations from the first estimation to the second is equivalent to the changes made by mi register...
Comment