I am conducting a discrete-time survival analysis and I am suspecting an endogeneity problem. To address it, I am using the 2-stage residual inclusion method since my exposure is binary. Therefore, to conduct the first stage, I am predicting the probability of receiving treatment X using a logistic regression model. My question is : which residual should I use. I searched the web and Dr. Hilbe's logistic regression book and there is no straight answer. In Stata,there are multiple options after logit post-estimation including:
dev Deviance residual
res Pearson residuals; adjusted for number sharing covariate pattern
rs Standardized Pearson residuals; adjusted for number sharing covariate pattern
In addition, I found other ways on the web that used the probability of a positive outcome and the linear prediction to calculate the residuals as follow:
xb linear prediction
pr probability of a positive outcome
Logit X Z C1 C2
Predict xb, xb
gen expxb=exp(xb)
gen resid1=X-expxb
Predict pr, pr
gen resid2=X-pr
gen resid3=1-pr
Would you please direct me toward the right choice of residuals that I should include in the 2nd stage. I highly appreciate your help.
dev Deviance residual
res Pearson residuals; adjusted for number sharing covariate pattern
rs Standardized Pearson residuals; adjusted for number sharing covariate pattern
In addition, I found other ways on the web that used the probability of a positive outcome and the linear prediction to calculate the residuals as follow:
xb linear prediction
pr probability of a positive outcome
Logit X Z C1 C2
Predict xb, xb
gen expxb=exp(xb)
gen resid1=X-expxb
Predict pr, pr
gen resid2=X-pr
gen resid3=1-pr
Would you please direct me toward the right choice of residuals that I should include in the 2nd stage. I highly appreciate your help.
Comment