Hi all,
Thanks for the help in advance.
I want to run a logistic regression. My dependent variable is binary (majority / minority). In my model, I include two independent variables (i.e. independent 1, can take any value between 1-100, and independent 2, can take any value between 0 - 7). I also have a moderation added for the two independent variables. I also include three control variables. Two are categorical with 3 categories for the first, and 4 categories for the other variable. The last control variable can take any value between 1 and 100 again.
In stata, I use a logistic regression. I code the categorical variables using i.categorical1. Then, I obtain the VIF score using VIF, uncentered. I get extremely high VIF scores (80 and 100) for independent variable 1, and control variable which can take any value between 1 and 100. The correlation matrix does not have extreme correlations, with the highest being 0.67.
How do I go about this? Am I doing it right? Am I calculating the VIF scores right? Should I go about these categorical variables different? The two independent variables are needed for my research question. Should I then drop my control variable?
In the models where independent variable 1 is not present, no high VIF scores (all under 3) where reported.
Many thanks for the help already.
Thanks for the help in advance.
I want to run a logistic regression. My dependent variable is binary (majority / minority). In my model, I include two independent variables (i.e. independent 1, can take any value between 1-100, and independent 2, can take any value between 0 - 7). I also have a moderation added for the two independent variables. I also include three control variables. Two are categorical with 3 categories for the first, and 4 categories for the other variable. The last control variable can take any value between 1 and 100 again.
In stata, I use a logistic regression. I code the categorical variables using i.categorical1. Then, I obtain the VIF score using VIF, uncentered. I get extremely high VIF scores (80 and 100) for independent variable 1, and control variable which can take any value between 1 and 100. The correlation matrix does not have extreme correlations, with the highest being 0.67.
How do I go about this? Am I doing it right? Am I calculating the VIF scores right? Should I go about these categorical variables different? The two independent variables are needed for my research question. Should I then drop my control variable?
In the models where independent variable 1 is not present, no high VIF scores (all under 3) where reported.
Many thanks for the help already.
Comment