Hi everyone, as a pre-cursor, I know parts of my enquiry have been answered in other forum posts. However, I am hoping that this question brings together some of those posts and people might be able to provide some overarching advice on the use of non-linear regression models for binary outcome variables!
I want to run a series of non-linear (probably logit) regression models to predict binary outcome variables. I have been reading about how to deal with heteroskedasticity in the errors and it has become clear that I shouldn't be using "robust" standard errors as I would under a linear model (because if there is heteroskedasticity, the parameters are also inconsistent).
However, others have made note that heteroskedasticity is inherent and to be expected in the MLE of logit and other non-linear regression models (and we shouldn't be concerned about this).
Given those pieces of advice, what are people's recommendations on running logit regression models?
i.e.
Should I be accounting for heteroskedasticity somehow (and if so, how)?
Are there key diagnostics for these non-linear models that will show how appropriate the model is (and how unbiased/consistent the estimators are)?
Could I use a linear probability model (LPM) instead if the model doesn't give significant out-of-bounds predictions? I know this raises other issues too about the best way to model binary outcome variables!
Thanks in advance for the help!
I want to run a series of non-linear (probably logit) regression models to predict binary outcome variables. I have been reading about how to deal with heteroskedasticity in the errors and it has become clear that I shouldn't be using "robust" standard errors as I would under a linear model (because if there is heteroskedasticity, the parameters are also inconsistent).
However, others have made note that heteroskedasticity is inherent and to be expected in the MLE of logit and other non-linear regression models (and we shouldn't be concerned about this).
Given those pieces of advice, what are people's recommendations on running logit regression models?
i.e.
Should I be accounting for heteroskedasticity somehow (and if so, how)?
Are there key diagnostics for these non-linear models that will show how appropriate the model is (and how unbiased/consistent the estimators are)?
Could I use a linear probability model (LPM) instead if the model doesn't give significant out-of-bounds predictions? I know this raises other issues too about the best way to model binary outcome variables!
Thanks in advance for the help!
Comment