I am running a Cox regression in Stata. One of the things I would like to show with data is how adoption and abandonments of Zoom in different branches of a multinational company impact the adoption. The diagnostic statistics indicate that two variables of interest are highly multicollinear (0.72), I use
I tried to add each variable separately to the model, but the results do not match with the fully saturated model.
Specifically, if I run;
The coefficient on NoAdoption is positive and the coefficient on NoAbandonment is negative
If I run:
The coefficient on NoAdoption is positive, which is consistent with the full model above
If I run:
The coefficient on NoAbandonment is positive, which is not consistent with the full model above and not consistent what the theory would predict.
Is there a way to deal with Multicollinearity in Cox models (e.g. L1 or L2 regularization) ?
I thought about Orthogonalization but learned that it is not considered to be a good practice in my field (or maybe in econometrics in general).
Code:
estat vce, corr
Specifically, if I run;
Code:
stcox Age Size NoAdoption NoAbandonment i.date, strata(company) vce(cluster company)
If I run:
Code:
stcox Age Size NoAdoption i.date, strata(company) vce(cluster company)
If I run:
Code:
stcox Age Size NoAbandonment i.date, strata(company) vce(cluster company)
Is there a way to deal with Multicollinearity in Cox models (e.g. L1 or L2 regularization) ?
I thought about Orthogonalization but learned that it is not considered to be a good practice in my field (or maybe in econometrics in general).
Comment