Hi,
I recently worked with HIV dataset. I have a question: how to handle Missing Values for Skipped or logic Questions? I searched in Google, but still not find the answer for this spefic question. The detailed description of this question as followed.
Two questions:
1. Eversex: Have you ever had sexual intercourse?
Yes............................................... .......... 1
No................................................ .......... 0
No response 9
2. Condom_use: Did you and/or your partner use condoms at last sex?
Yes............................................... .......... 1
No................................................ .......... 0
Only when Eversex=1, participants were asked the question of Condom_use.
So there are many missing data in Condom_use, If I delete the missing data, I will lose great than 70% samples, which will decrease the statistical power.
I need to put Eversex and Condom_use in one model, but when I put them in one model, the Stata will report error if Ido not handle the missing values in Condom_use.
I heard about some missing data techniques, like Multiple imputation, but I do not know in this case, what is the best method to handle the missing data showed in Condom_use due to skipped or logic question.
Can I just recode the missing=2 in Condom_use, and then I choose to ignore the OR about Condom_use=2, only focused on the OR about Condom_use=1
If I leave them as missing, it still have problem when I fit Condom_use in logistic models.
The biggest question is that in my Multivariable logistic regression model:
logistic Eversex Age Condom_use x1 x2 x3 x4 x5
Stata reported wrong when I execute this model.
The second question is that if I put Condom_use in any other models, the Number of obs = 215 out of 760.
Could you please give me some advise?
Thanks very much.
I recently worked with HIV dataset. I have a question: how to handle Missing Values for Skipped or logic Questions? I searched in Google, but still not find the answer for this spefic question. The detailed description of this question as followed.
Two questions:
1. Eversex: Have you ever had sexual intercourse?
Yes............................................... .......... 1
No................................................ .......... 0
No response 9
2. Condom_use: Did you and/or your partner use condoms at last sex?
Yes............................................... .......... 1
No................................................ .......... 0
Only when Eversex=1, participants were asked the question of Condom_use.
So there are many missing data in Condom_use, If I delete the missing data, I will lose great than 70% samples, which will decrease the statistical power.
I need to put Eversex and Condom_use in one model, but when I put them in one model, the Stata will report error if Ido not handle the missing values in Condom_use.
I heard about some missing data techniques, like Multiple imputation, but I do not know in this case, what is the best method to handle the missing data showed in Condom_use due to skipped or logic question.
Can I just recode the missing=2 in Condom_use, and then I choose to ignore the OR about Condom_use=2, only focused on the OR about Condom_use=1
If I leave them as missing, it still have problem when I fit Condom_use in logistic models.
The biggest question is that in my Multivariable logistic regression model:
logistic Eversex Age Condom_use x1 x2 x3 x4 x5
Stata reported wrong when I execute this model.
The second question is that if I put Condom_use in any other models, the Number of obs = 215 out of 760.
Could you please give me some advise?
Thanks very much.
Comment