I am trying to run a regression with explanatory variables both continous and dummy. dependent variable continous.
I notice that one of my dummy variables is correlated to one of the continous variables and that seems to couse collinearity. I checked Point-biserial r is about 0.74.
Variable background:
reg y x1 x2 x3 x4, fe
the dummy variable is created as x1=1 if x5>threshold
x5 is not part of my regression itself. but x5 is highly correlated to x2 (pearson 0.9).
Conceptually both x1 and x2 are relevant in the analysis so I would prefer not to drop the dummy.
Any idea on how I could deal with multicollinearity in this case? I checked Factor Analysis (princeton.edu) , but couldn't understand how/ if I could use this to solve multicollinearity issue in my case.
Any suggestion is very much appreciated! Thank you!
I notice that one of my dummy variables is correlated to one of the continous variables and that seems to couse collinearity. I checked Point-biserial r is about 0.74.
Variable background:
reg y x1 x2 x3 x4, fe
the dummy variable is created as x1=1 if x5>threshold
x5 is not part of my regression itself. but x5 is highly correlated to x2 (pearson 0.9).
Conceptually both x1 and x2 are relevant in the analysis so I would prefer not to drop the dummy.
Any idea on how I could deal with multicollinearity in this case? I checked Factor Analysis (princeton.edu) , but couldn't understand how/ if I could use this to solve multicollinearity issue in my case.
Any suggestion is very much appreciated! Thank you!
Comment