Hi All:
Hoping that I can get some suggestions on the following:
I am using a continuous income variable as one of my predictors of a binary dependent variable. Nearly 4 percent of my cases have a value of 0 for the income variable. I think that using logged income is the best approach. I am using the svy: logistic command with survey data.
I have done some reading but am still not sure whether it is better to:
1) Use a logged income variable that drops the 3.4 percent of cases as missing (N=981)
or
2) Change the value of income to $1 (or something else) and then do the log transformation. (N=1013)
I have tried both in logistic regression analyses with different results, keeping all other variables exactly the same. With option #1, the odds ratio of the income variable is large and significant; another critical variable that I'm more interested in is not significant. With option #2, odds ratio close to 1 and insignificant, but the same other critical variable is significant.
Any suggestions? I've read about creating a dummy variable for the cases where income is $0 and using in conjunction with option #1, but haven't tried that yet.
Thanks so much!
Edie
Examples of what I've read:
http://www.stata.com/statalist/archi.../msg00018.html
http://www.stata.com/statalist/archi.../msg00629.html
Hoping that I can get some suggestions on the following:
I am using a continuous income variable as one of my predictors of a binary dependent variable. Nearly 4 percent of my cases have a value of 0 for the income variable. I think that using logged income is the best approach. I am using the svy: logistic command with survey data.
I have done some reading but am still not sure whether it is better to:
1) Use a logged income variable that drops the 3.4 percent of cases as missing (N=981)
or
2) Change the value of income to $1 (or something else) and then do the log transformation. (N=1013)
I have tried both in logistic regression analyses with different results, keeping all other variables exactly the same. With option #1, the odds ratio of the income variable is large and significant; another critical variable that I'm more interested in is not significant. With option #2, odds ratio close to 1 and insignificant, but the same other critical variable is significant.
Any suggestions? I've read about creating a dummy variable for the cases where income is $0 and using in conjunction with option #1, but haven't tried that yet.
Thanks so much!
Edie
Examples of what I've read:
http://www.stata.com/statalist/archi.../msg00018.html
http://www.stata.com/statalist/archi.../msg00629.html
Comment