Dear STATA community,
I am running a Logit Model, and I am facing a problem that had been discussed here already: variable != 0 predicts failure perfectly. However, the discussions around it have not yet solved my problem.
When including country fixed effects, I face the aforementioned problem which leads to a massive drop of my observations (nearly half of it). Some answers in this forum were suggesting to use -firthlogit- or -exlogistic-, however these commands do not work in my case as I am using time-series operators and factor variables. Other answers were suggesting to keep the model as it is, since the data does not provide enough information about how changes in the predictor variables are associated with changes in the outcome.
Before this model, I run three other models (starting with a bivariate, then including control variables, then time fixed effects). In these three models, I do not face the problem and my observations are not dropped. I want to compare my results across these models, and I am wondering if this is at all possible since so many observations were dropped?
Here you can see the observations numbers:
Bivariate Model: 18488 | Model with all controls: 9981| Time Fixed effects: 9840 |Time and Country Fixed effects: 4353
And here you can see the “error” message from STATA (as an example):
note: 41.country_id != 0 predicts failure perfectly; 41.country_id omitted and 29 obs not used.
Country_id is a unique ID designed for every country.
Notably, a few years have also been deleted: note: 1901.year != 0 predicts failure perfectly; 1901.year omitted and 35 obs not used.
My idea right now is to include country&time fixed effects for all models so I have a comparable observation number for all models.
Thank you very much for your help in advance!
I am running a Logit Model, and I am facing a problem that had been discussed here already: variable != 0 predicts failure perfectly. However, the discussions around it have not yet solved my problem.
When including country fixed effects, I face the aforementioned problem which leads to a massive drop of my observations (nearly half of it). Some answers in this forum were suggesting to use -firthlogit- or -exlogistic-, however these commands do not work in my case as I am using time-series operators and factor variables. Other answers were suggesting to keep the model as it is, since the data does not provide enough information about how changes in the predictor variables are associated with changes in the outcome.
Before this model, I run three other models (starting with a bivariate, then including control variables, then time fixed effects). In these three models, I do not face the problem and my observations are not dropped. I want to compare my results across these models, and I am wondering if this is at all possible since so many observations were dropped?
Here you can see the observations numbers:
Bivariate Model: 18488 | Model with all controls: 9981| Time Fixed effects: 9840 |Time and Country Fixed effects: 4353
And here you can see the “error” message from STATA (as an example):
note: 41.country_id != 0 predicts failure perfectly; 41.country_id omitted and 29 obs not used.
Country_id is a unique ID designed for every country.
Notably, a few years have also been deleted: note: 1901.year != 0 predicts failure perfectly; 1901.year omitted and 35 obs not used.
My idea right now is to include country&time fixed effects for all models so I have a comparable observation number for all models.
Thank you very much for your help in advance!
Comment